JPH10143516A

JPH10143516A - Database management system, database search method, and recording medium

Info

Publication number: JPH10143516A
Application number: JP8296927A
Authority: JP
Inventors: Yoshihiro Mori; 吉弘森
Original assignee: Kubota Corp
Current assignee: Kubota Corp
Priority date: 1996-11-08
Filing date: 1996-11-08
Publication date: 1998-05-29

Abstract

(57)【要約】【課題】コンピュータに負担をかけずに、また、随時
実行できる方法により検索条件の最適な検索順位を得て
検索することができるデータベース管理システム、デー
タベース検索方法及びその実施に使用する記録媒体の提
供。【解決手段】複数の属性を有し、各属性が１又は複数
の分類項目を有するデータ群からなるデータベースを所
定の検索条件により検索するデータベース管理システ
ム。属性毎のデータ量の各分類項目への散らばり程度の
大小を比較する比較手段（Ｓ２６）と、比較手段（Ｓ２
６）の比較結果により、属性が異なる複数の分類項目を
検索条件として検索する場合の検索順位を、散らばり程
度が大である属性の分類項目に優先検索順位を与えるよ
うに設定する設定手段（Ｓ２７）と、設定手段（Ｓ２
７）が設定した検索順位に従って検索する検索手段（Ｓ
２８）とを備える構成である。 (57) [Summary] [PROBLEMS] To provide a database management system, a database search method, and a database search method capable of obtaining an optimum search order of search conditions by a method that can be executed at any time without burdening a computer. Provision of recording media to be used. SOLUTION: A database management system for searching a database having a plurality of attributes, each of which has a data group having one or a plurality of classification items, according to a predetermined search condition. Comparing means (S26) for comparing the degree of dispersion of the data amount for each attribute to each classification item; and comparing means (S2)
Based on the comparison result of (6), setting means for setting a search order when searching for a plurality of classification items having different attributes as search conditions so as to give a priority search order to the classification items of attributes having a large degree of dispersion (S27). ) And setting means (S2
Search means (S) for searching according to the search order set by (7)
28).

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、複数の属性を有
し、各属性が１又は複数の分類項目を有するデータ群か
らなるデータベースを所定の検索条件により検索するデ
ータベース管理システム、データベース検索方法及びそ
の実施に使用する記録媒体に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a database management system, a database search method, and a database management method for searching a database consisting of a data group having a plurality of attributes and each attribute having one or a plurality of classification items under a predetermined search condition. It relates to a recording medium used for the implementation.

【０００２】[0002]

【従来の技術】属性が性別、職業及び住所である、ある
都市の個人情報５０万件を格納するデータベースを想定
して、例えば、「男であり宝石加工を職業とする」の検
索条件により、そのデータベースを検索するとする。こ
の場合、属性が性別である分類項目「男」の検索条件か
ら検索を実行すると、図７（ａ）に示すように、分類項
目「男」又は「女」に該当するデータ量は、それぞれ約
２５万件であると考えられるから、先ず、約２５万件の
「男」のデータを読出し、この約２５万件のデータの中
から「宝石加工」の分類項目を検索条件として検索を行
うことになり、長時間を必要する。2. Description of the Related Art Assuming a database storing 500,000 personal information of a certain city, whose attributes are gender, occupation and address, for example, by a search condition of "a man and a jewelry occupation," Suppose you search that database. In this case, when the search is performed from the search condition of the classification item “male” whose attribute is gender, as shown in FIG. 7A, the data amount corresponding to the classification item “male” or “female” is about Since it is considered to be 250,000, first, data of about 250,000 "male" is read out, and a search is performed from the data of about 250,000 using the classification item of "jewelry processing" as a search condition. It takes a long time.

【０００３】一方、属性が職業である分類項目「宝石加
工」の検索条件から検索を実行すると、図７（ｂ）に示
すように、分類項目「宝石加工」に該当するデータ量は
僅か（多くて数１０件）であると考えられるから、先
ず、この数１０件のデータを読出し、この数１０件のデ
ータの中から「男」の分類項目を検索条件として検索を
行えばよく、短時間で終了する。つまり、複数の検索条
件によりデータベースを検索する場合、検索条件の順序
によって、所要時間に大きな差が生じることがある。こ
れは、検索条件の中にその該当するデータ量が大きなも
のがあり（属性毎の、分類項目毎のデータ量の和は同じ
（５０万件）である）、その分類項目から検索を行った
場合に生じる。On the other hand, when a search is performed from the search condition of the classification item “jewelry processing” whose attribute is “occupation”, as shown in FIG. 7B, the data amount corresponding to the classification item “jewelry processing” is small (large). Therefore, first, the data of several tens is read, and a search may be performed from the data of several tens using the classification item of “male” as a search condition. Ends with That is, when searching the database using a plurality of search conditions, the required time may greatly differ depending on the order of the search conditions. This is because some of the search conditions have a large corresponding data amount (the sum of the data amounts for each attribute is the same (500,000) for each attribute), and the search is performed from the classification item. Occurs in the case.

【０００４】[0004]

【発明が解決しようとする課題】このような場合、従来
の方法では、例えば、複数条件同時検索方法では、コン
ピュータの能力の遊休部分を有効に利用するために、複
数の検索を同時に行うことになるが、複数のユーザが同
時にコンピュータにアクセスして、遊休部分が無いよう
なときには、効率の良い方法ではない。また、検索条件
毎のデータ量分布表を作成して、データ量が小さい検索
条件から検索を行う方法では、データベース内の全デー
タを検索条件毎に計数する必要があり、分布表作成に多
大な時間を要し、随時実行することはできないため有効
ではない。In such a case, in a conventional method, for example, in a multiple condition simultaneous search method, a plurality of searches are simultaneously performed in order to effectively use an idle portion of computer capability. However, when a plurality of users access the computer at the same time and there is no idle portion, this is not an efficient method. Further, in a method of creating a data amount distribution table for each search condition and performing a search from search conditions with a small data amount, it is necessary to count all data in the database for each search condition, which is a huge amount of time for creating a distribution table. It is not effective because it takes time and cannot be executed at any time.

【０００５】本発明は、上述した事情に鑑みてなされた
ものであり、コンピュータに負担をかけずに、また、随
時実行できる方法により検索条件の最適な検索順位を得
て検索することができるデータベース管理システム、デ
ータベース検索方法及びその実施に使用する記録媒体を
提供することを目的とする。SUMMARY OF THE INVENTION The present invention has been made in view of the above-mentioned circumstances, and provides a database capable of performing a search without burdening a computer and obtaining an optimum search order of search conditions by a method that can be executed at any time. It is an object of the present invention to provide a management system, a database search method, and a recording medium used for executing the method.

【０００６】[0006]

【課題を解決するための手段】本発明の第１発明に係る
データベース管理システムは、複数の属性を有し、各属
性が１又は複数の分類項目を有するデータ群からなるデ
ータベースを所定の検索条件により検索するデータベー
ス管理システムにおいて、前記属性が異なる複数の分類
項目を検索条件として検索する場合の検索順位を設定す
る設定手段と、該設定手段が設定した検索順位に従って
検索する検索手段とを備えることを特徴とする。According to a first aspect of the present invention, there is provided a database management system having a plurality of attributes, each of which has one or more classification items. A database management system for setting a search order when a plurality of classification items having different attributes are searched as search conditions, and a search unit for searching according to the search order set by the setting unit. It is characterized by.

【０００７】第１発明に係るデータベース管理システム
では、設定手段が、属性が異なる複数の分類項目を検索
条件として検索する場合の検索順位を設定する。そし
て、検索手段が、その設定した検索順位に従って検索す
る。これにより、検索条件の最適な検索順位を得て、コ
ンピュータに負担をかけずに短時間で検索することがで
きる。[0007] In the database management system according to the first invention, the setting means sets a search order when a plurality of classification items having different attributes are searched as search conditions. Then, the search means searches according to the set search order. As a result, it is possible to obtain the optimum search order of the search conditions and perform the search in a short time without burdening the computer.

【０００８】第２発明に係るデータベース管理システム
は、前記属性毎のデータ量の各分類項目への散らばり程
度の大小を比較する比較手段を備え、前記設定手段は、
該比較手段の比較結果により、前記散らばり程度が大で
ある前記属性の分類項目に優先検索順位を与えることを
特徴とする。The database management system according to a second aspect of the present invention includes a comparing unit that compares the degree of dispersion of the data amount for each attribute to each classification item, and the setting unit includes:
According to the comparison result of the comparing means, a priority search order is given to the category items of the attribute having the large degree of dispersion.

【０００９】第３発明に係るデータベース管理システム
は、前記属性毎の前記散らばり程度を演算する演算手段
を備えることを特徴とする。[0009] A database management system according to a third aspect of the present invention is characterized in that the database management system further comprises a calculating means for calculating the degree of dispersion for each attribute.

【００１０】第４発明に係るデータベース管理システム
は、前記散らばり程度は、前記各分類項目に、分類項目
毎のデータ量に応じた所定順に、所定の等差を有する等
差数列の各項を割り当て、前記データ量を度数とし、前
記各項をクラスとして、前記属性毎に該クラス毎の前記
度数分布の分散を演算したものであることを特徴とす
る。According to a fourth aspect of the present invention, in the database management system, the degree of dispersion is assigned to each of the classification items by assigning each term of an arithmetic progression having a predetermined equality to the respective classification items in a predetermined order according to the data amount of each classification item. , And the variance of the frequency distribution for each class is calculated for each attribute, with the data amount being a frequency and the respective terms being a class.

【００１１】第５発明に係るデータベース管理システム
は、前記分散は、推定法により求める標本分散であるこ
とを特徴とする。[0011] A database management system according to a fifth invention is characterized in that the variance is a sample variance obtained by an estimation method.

【００１２】第６発明に係るデータベース検索方法は、
複数の属性を有し各属性が１又は複数の分類項目を有す
るデータ群からなるデータベースを所定の検索条件によ
り検索するデータベース検索方法において、前記属性毎
のデータ量の各分類項目への散らばり程度の大小を比較
し、前記属性が異なる複数の分類項目を検索条件として
検索する場合の検索順位を、前記散らばり程度が大であ
る前記属性の分類項目に優先順位を与えるように設定
し、該検索順位に従って検索することを特徴とする。A database search method according to a sixth aspect of the present invention
In a database search method for searching a database composed of a data group having a plurality of attributes and each attribute having one or a plurality of classification items under a predetermined search condition, the data amount for each attribute may be distributed to each classification item. By comparing the magnitudes and setting a search order in the case of searching for a plurality of classification items having different attributes as a search condition so as to give a priority to the classification items of the attribute having a large dispersion degree, The search is performed according to

【００１３】第２発明に係るデータベース管理システム
及び第６発明に係るデータベース検索方法では、比較手
段が、属性毎のデータ量の各分類項目への散らばり程度
の大小を比較する。そして、設定手段が、その比較結果
により、属性が異なる複数の分類項目を検索条件として
検索する場合の検索順位を、散らばり程度が大である属
性の分類項目に優先順位を与えるように設定し、検索手
段が、その設定した検索順位に従って検索する。これに
より、図７（ｃ）に示す、属性が居住地域であるよう
な、多数の分類項目にデータが分かれ、各分類項目のデ
ータ量の差が小さくてデータ量の各分類項目への散らば
り程度が大きく、また、大きいデータ量の分類項目を検
索する虞が無い属性から検索できるので、短い時間で検
索を終了することができる。[0013] In the database management system according to the second invention and the database search method according to the sixth invention, the comparing means compares the degree of dispersion of the data amount for each attribute to each classification item. Then, the setting means sets a search order in a case where a plurality of classification items having different attributes are searched as a search condition based on the comparison result so as to give a priority to the classification items of the attribute having a large degree of dispersion, Search means searches according to the set search order. As a result, the data is divided into a large number of classification items whose attribute is the residential area shown in FIG. 7C, and the difference in the data amount between the classification items is small and the data amount is scattered among the classification items. Can be searched for from an attribute that is not likely to search for a classification item having a large data amount, so that the search can be completed in a short time.

【００１４】第３発明に係るデータベース管理システム
では、演算手段が、属性毎のデータ量の各分類項目への
散らばり程度を演算する。そして、比較手段が、属性毎
のデータ量の各分類項目への散らばり程度の大小を比較
し、設定手段が、その比較結果により、属性が異なる複
数の分類項目を検索条件として検索する場合の検索順位
を、散らばり程度が大である属性の分類項目に優先順位
を与えるように設定し、検索手段が、その設定した検索
順位に従って検索する。これにより、属性毎のデータ量
の各分類項目への散らばり程度を演算できるので、図７
（ｃ）に示す、属性が居住地域であるような、多数の分
類項目にデータが分かれ、各分類項目のデータ量の差が
小さくてデータ量の各分類項目への散らばり程度が大き
く、また、大きいデータ量の分類項目を検索する虞が無
い属性から検索することができ、短い時間で検索を終了
することができる。[0014] In the database management system according to the third aspect of the invention, the calculating means calculates the degree of dispersion of the data amount for each attribute to each classification item. Then, the comparison unit compares the degree of dispersion of the data amount for each attribute to each classification item, and the setting unit searches for a plurality of classification items having different attributes as search conditions based on the comparison result. The order is set so as to give priority to the classification items of the attribute having a large degree of dispersion, and the search means performs a search according to the set search order. As a result, the degree of dispersion of the data amount for each attribute to each classification item can be calculated.
The data is divided into a large number of classification items whose attribute is the residential area shown in (c), the difference in the data amount of each classification item is small, and the degree of dispersion of the data amount to each classification item is large. A search can be performed from an attribute that does not have a risk of searching for a classification item having a large data amount, and the search can be completed in a short time.

【００１５】第７発明に係るデータベース検索方法は、
前記散らばり程度は、前記各分類項目に、分類項目毎の
データ量に応じた所定順に、所定の等差を有する等差数
列の各項を割り当て、前記データ量を度数とし、前記各
項をクラスとして、前記属性毎に該クラス毎の前記度数
分布の分散を演算して得ることを特徴とする。A database search method according to a seventh aspect of the present invention
The degree of dispersion is assigned to each item of the arithmetic progression having a predetermined equal difference in a predetermined order according to the data amount of each classification item, and the data amount is set as a frequency, and the respective items are classified into classes. The variance of the frequency distribution for each class is calculated and obtained for each attribute.

【００１６】第４発明に係るデータベース管理システム
及び第７発明に係るデータベース検索方法では、演算手
段は、各分類項目に、分類項目毎のデータ量に応じた所
定順に、所定の等差を有する等差数列の各項を割り当
て、データ量を度数とし、等差数列の各項をクラスとし
て、属性毎にクラス毎の度数分布の分散を演算する。こ
れにより、属性毎のデータ量の各分類項目への散らばり
程度を演算できるので、図７（ｃ）に示す、属性が居住
地域であるような、多数の分類項目にデータが分かれ、
各分類項目のデータ量の差が小さくてデータ量の各分類
項目への散らばり程度が大きく、また、大きいデータ量
の分類項目を検索する虞が無い属性から検索することが
でき、短い時間で検索を終了することができる。[0016] In the database management system according to the fourth invention and the database search method according to the seventh invention, the arithmetic means has a predetermined equal difference in each classification item in a predetermined order according to the data amount of each classification item. Each term of the difference sequence is assigned, the data amount is set as a frequency, and each item of the arithmetic sequence is set as a class, and the variance of the frequency distribution for each class is calculated for each attribute. As a result, the degree of dispersion of the data amount for each attribute to each classification item can be calculated, so that the data is divided into a large number of classification items such as those shown in FIG.
The difference in the data amount of each classification item is small, the degree of dispersion of the data amount to each classification item is large, and it is possible to search from an attribute that does not have a risk of searching for a classification item with a large data amount, and search in a short time Can be terminated.

【００１７】第８発明に係るデータベース検索方法は、
前記分散は、推定法により求める標本分散であることを
特徴とする。According to an eighth aspect of the present invention, there is provided a database search method comprising:
The variance is a sample variance obtained by an estimation method.

【００１８】第５発明に係るデータベース管理システム
及び第８発明に係るデータベース検索方法では、演算手
段は、属性毎のデータ量の各分類項目への散らばり程度
である分散を、各データを無作為抽出して推定により、
標本分散として演算する。そして、比較手段が、属性毎
のデータ量の各分類項目への散らばり程度の大小を比較
し、設定手段が、その比較結果により、属性が異なる複
数の分類項目を検索条件として検索する場合の検索順位
を、散らばり程度が大である属性の分類項目に優先順位
を与えるように設定し、検索手段が、その設定した検索
順位に従って検索する。これにより、属性毎のデータ量
の各分類項目への散らばり程度を、短時間に演算できる
ので、コンピュータの能力に負担を掛けることなく、ま
た、短い時間で検索を終了することができる。In the database management system according to the fifth invention and the database search method according to the eighth invention, the calculating means randomly extracts the variance, which is the degree of dispersion of the data amount for each attribute to each classification item, and extracts each data. And by estimation
Calculate as sample variance. Then, the comparison unit compares the degree of dispersion of the data amount for each attribute to each classification item, and the setting unit searches for a plurality of classification items having different attributes as search conditions based on the comparison result. The order is set so as to give priority to the classification items of the attribute having a large degree of dispersion, and the search means performs a search according to the set search order. As a result, the degree of dispersion of the data amount for each attribute to each classification item can be calculated in a short time, so that the search can be completed in a short time without imposing a load on the capability of the computer.

【００１９】第９発明に係る記録媒体は、複数の属性を
有し、各属性が１又は複数の分類項目を有するデータ群
からなるデータベースを所定の検索条件により検索する
ためのコンピュータプログラムを記録してある記録媒体
において、前記属性毎のデータ量の各分類項目への散ら
ばり程度の大小を比較するステップと、該ステップの比
較結果により、前記属性が異なる複数の分類項目を検索
条件として検索する場合の検索順位を、前記散らばり程
度が大である前記属性の分類項目に優先順位を与えるよ
うに設定するステップと、該検索順位に従って検索する
ステップとを含むことを特徴とする。According to a ninth aspect of the present invention, there is provided a recording medium storing a computer program for searching a database consisting of a data group having a plurality of attributes and each attribute having one or a plurality of classification items under a predetermined search condition. Comparing the degree of dispersion of the data amount for each attribute to each classification item in a recording medium, and searching for a plurality of classification items having different attributes as search conditions based on the comparison result of the step. Setting a search order so as to give a priority to the classification items of the attribute having a large degree of dispersion, and performing a search in accordance with the search order.

【００２０】この記録媒体が記録してあるコンピュータ
プログラムにより制御されるコンピュータでは、属性毎
のデータ量の各分類項目への散らばり程度の大小を比較
し、その比較結果により、属性が異なる複数の分類項目
を検索条件として検索する場合の検索順位を、散らばり
程度が大である属性の分類項目に優先順位を与えるよう
に設定して、その検索順位に従って検索する。これによ
り、図７（ｃ）に示す、属性が居住地域であるような、
多数の分類項目にデータが分かれ、各分類項目のデータ
量の差が小さくデータ量の各分類項目への散らばり程度
が大きく、また、大きいデータ量の分類項目を検索する
虞が無い属性から検索できるので、短い時間で検索を終
了することができる。In a computer controlled by a computer program recorded on the recording medium, the data amount of each attribute is compared in the degree of dispersion to each classification item, and a plurality of classifications having different attributes are determined based on the comparison result. The search order in the case where an item is searched as a search condition is set so as to give priority to the classification items of the attribute having a large degree of dispersion, and the search is performed according to the search order. As a result, as shown in FIG.
The data is divided into a large number of classification items, the difference in the data amount of each classification item is small, the degree of dispersion of the data amount to each classification item is large, and there is no risk of searching for a classification item with a large data amount. Therefore, the search can be completed in a short time.

【００２１】第１０発明に係る記録媒体は、前記各分類
項目に、分類項目毎のデータ量に応じた所定順に、所定
の等差を有する等差数列の各項を割り当てるステップ
と、前記データ量を度数とし、前記各項をクラスとし
て、前記属性毎に該クラス毎の前記度数分布の分散を演
算して前記散らばり程度とするステップとを含むことを
特徴とする。According to a tenth aspect of the present invention, there is provided a recording medium, comprising: assigning, to each of the classification items, respective items of an arithmetic progression having a predetermined equality in a predetermined order according to the data amount of each classification item; , And calculating the variance of the frequency distribution for each class for each of the attributes to obtain the degree of dispersion.

【００２２】この記録媒体が記録してあるコンピュータ
プログラムにより制御されるコンピュータでは、演算手
段が、各分類項目に、分類項目毎のデータ量に応じた所
定順に、所定の等差を有する等差数列の各項を割り当
て、データ量を度数とし、等差数列の各項をクラスとし
て、属性毎にクラス毎の度数分布の分散を演算して散ら
ばり程度とする。これにより、属性毎のデータ量の各分
類項目への散らばり程度を演算できるので、図７（ｃ）
に示す、属性が居住地域であるような、多数の分類項目
にデータが分かれ、各分類項目のデータ量の差が小さく
データ量の各分類項目への散らばり程度が大きく、ま
た、大きいデータ量の分類項目を検索する虞が無い属性
から検索することができ、短い時間で検索を終了するこ
とができる。In a computer controlled by a computer program recorded on the recording medium, the arithmetic means comprises: for each classification item, an arithmetic progression having a predetermined equality in a predetermined order according to the data amount of each classification item. Are assigned, the data amount is set as a frequency, and each term of the arithmetic progression is set as a class, and the variance of the frequency distribution for each class is calculated for each attribute to obtain the degree of dispersion. As a result, the degree of dispersion of the data amount for each attribute to each classification item can be calculated.
The data is divided into a large number of classification items whose attributes are residential areas as shown in, and the difference in the data amount of each classification item is small, the degree of distribution of the data amount to each classification item is large, and the data amount is large. A search can be performed from an attribute that does not have a risk of searching for a classification item, and the search can be completed in a short time.

【００２３】第１１発明に係る記録媒体は、前記分散
は、推定法により求める標本分散であることを特徴とす
る。[0023] In a recording medium according to an eleventh aspect, the variance is a sample variance obtained by an estimation method.

【００２４】この記録媒体が記録してあるコンピュータ
プログラムにより制御されるコンピュータでは、演算手
段は、属性毎のデータ量の各分類項目への散らばり程度
である分散を、各データを無作為抽出して推定により、
標本分散として演算する。そして、比較手段が、属性毎
のデータ量の各分類項目への散らばり程度の大小を比較
し、設定手段が、その比較結果により、属性が異なる複
数の分類項目を検索条件として検索する場合の検索順位
を、散らばり程度が大である属性の分類項目に優先順位
を与えるように設定し、検索手段が、その設定した検索
順位に従って検索する。これにより、属性毎のデータ量
の各分類項目への散らばり程度を、短時間に演算できる
ので、コンピュータの能力に負担を掛けることなく、ま
た、短い時間で検索を終了することができる。In the computer controlled by the computer program stored in the recording medium, the arithmetic means randomly extracts the variance, which is the degree of dispersion of the data amount for each attribute to each classification item, by extracting each data. By estimation,
Calculate as sample variance. Then, the comparison unit compares the degree of dispersion of the data amount for each attribute to each classification item, and the setting unit searches for a plurality of classification items having different attributes as search conditions based on the comparison result. The order is set so as to give priority to the classification items of the attribute having a large degree of dispersion, and the search means performs a search according to the set search order. As a result, the degree of dispersion of the data amount for each attribute to each classification item can be calculated in a short time, so that the search can be completed in a short time without imposing a load on the capability of the computer.

【００２５】[0025]

【発明の実施の形態】以下に、本発明をその実施の形態
を示す図面に基づいて説明する。図１は、本発明に係る
データベース管理システム及びデータベース検索方法の
実施の形態の構成を示すブロック図である。このデータ
ベース管理システムは、ＣＰＵ１０と、ＲＯＭ１１と、
ＲＡＭ１２と、磁気ディスク及びそれに記録されたデー
タからなるデータベース１３と、ＣＲＴ１５に表示する
ための信号を発生し、ＣＲＴ１５の表示を制御するＣＲ
Ｔ制御回路１４とを備えている。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The present invention will be described below with reference to the drawings showing an embodiment. FIG. 1 is a block diagram showing a configuration of an embodiment of a database management system and a database search method according to the present invention. The database management system includes a CPU 10, a ROM 11,
A RAM 12, a database 13 including a magnetic disk and data recorded thereon, and a CR for generating a signal to be displayed on the CRT 15 and controlling the display on the CRT 15
And a T control circuit 14.

【００２６】このデータベース管理システムは、また、
ユーザーが入力するためのキーボード１６と、プリンタ
１７と、フレキシブルディスクを駆動するためのディス
クドライブ１８と、ＣＰＵ１０、ＲＯＭ１１、ＲＡＭ１
２、データベース１３、ＣＲＴ制御回路１４、キーボー
ド１６、プリンタ１７及びディスクドライブ１８を相互
に接続するバス１７とを備えている。This database management system also includes:
A keyboard 16 for a user to input, a printer 17, a disk drive 18 for driving a flexible disk, a CPU 10, a ROM 11, a RAM 1
2, a database 13, a CRT control circuit 14, a keyboard 16, a printer 17, and a bus 17 for interconnecting a disk drive 18.

【００２７】ディスクドライブ１８には、データベース
１３への検索を行うためのコンピュータプログラムを記
録してあるフレキシブルディスクＦＤ（図６参照）が装
填されている。このコンピュータプログラムは、全部又
は一部がＲＡＭ１２に読み込まれた状態で、ＣＰＵ１０
を制御する。The disk drive 18 is loaded with a flexible disk FD (see FIG. 6) in which a computer program for searching the database 13 is recorded. This computer program is executed by the CPU 10 in a state where all or a part thereof is read into the RAM 12.
Control.

【００２８】以下に、このような構成のデータベース管
理システムを説明する。このデータベース管理システム
は、属性毎のデータ量の各分類項目への散らばり程度を
所定の時期に演算しておく。このとき、例えば、図７
（ｂ）に示すような、分類項目が無作為に並べられた分
布表を、図４（ａ）に示すように、データ量ｙ_mが最多
の分類項目を中央位置として、データ量が多い順に中央
位置から振り分けて各分類項目を並べ、各分類項目に所
定の等差を有する等差数列の各項‥‥ｘ_m-2，ｘ_m-1，
ｘ_m，ｘ_m+1，ｘ_m+2‥‥，ｘ_n（ｎは当該属性全体の
分類項目数）を順次割り当てて数量化する。そして、デ
ータ量ｙを度数とし、各項ｘをクラスとして、属性毎に
クラスｘ毎の度数分布の分散を演算して散らばり程度と
する。Hereinafter, a database management system having such a configuration will be described. This database management system calculates the degree of dispersion of the data amount for each attribute to each classification item at a predetermined time. At this time, for example, FIG.
(B) a as shown, the distribution table categories are arranged in random, as shown in FIG. 4 (a), the amount of data y _m is largest in the classification item as a central position, in order large amount of data Each classification item is arranged by sorting from the center position, and each term of the arithmetic progression having a predetermined equal difference in each classification item {x _m-2 , x _m-1 ,
x _m , x _{m + 1} , x _{m + 2} }, x _n (n is the number of classification items of the entire attribute) are sequentially allocated and quantified. Then, using the data amount y as a frequency and each term x as a class, the variance of the frequency distribution for each class x is calculated for each attribute to obtain the degree of dispersion.

【００２９】ここで、データ量ｙが十分大きい場合、ク
ラスｘ毎の度数分布は正規分布であると見做すことがで
きるので、推定により標本分散を求めて分散とすること
ができる。推定では、当該属性全体のデータ量ｙ_tから
ある程度の大きさのデータ量数Ｙ_tのデータを無作為に
抽出して、分類項目毎に計数し、図４（ａ）に示した方
法と同様にして、データ量が最多Ｙ_mの分類項目を中央
位置として、データ量が多い順に中央位置から振り分け
て各分類項目を並べ、各分類項目に所定の等差を有する
等差数列の各項‥‥Ｘ_m-2，Ｘ_m-1，Ｘ_m，Ｘ_m+1，Ｘ
_m+2‥‥，Ｘ_p（ｐは無作為抽出したデータが属する分
類項目数）を順次割り当て、データ量Ｙを度数とし、各
項Ｘをクラスとして、クラスＸ毎の度数分布の標本分散
を演算すれば良い。Here, when the data amount y is sufficiently large, the frequency distribution for each class x can be regarded as a normal distribution, so that the variance can be obtained by estimating the sample variance. The estimation, as with the data of the certain size of the data volume number Y _t from the data amount y _t of the entire attributes extracted at random were counted for each classification item, shown in FIG. 4 (a) method a manner, as the classification items central position of the data amount most Y _m, in order a large amount of data by distributing the center arranges each classification item, ‥ terms of arithmetic progression having a predetermined arithmetic to each classification item ‥ X _m-2 , X _m-1 , X _m , X _{m + 1} , X
_{m + 2} ‥‥, X _p (p is the number of classification items to which the randomly extracted data belongs) is sequentially assigned, the data amount Y is set as a frequency, each term X is set as a class, and the sample variance of the frequency distribution for each class X is calculated. What is necessary is just to calculate.

【００３０】このデータベース管理システムでは、属性
毎のデータ量の各分類項目への散らばり程度を演算す
る。そして、属性毎のデータ量の各分類項目への散らば
り程度の大小を比較し、属性が異なる複数の分類項目を
検索条件として検索する場合の検索順位を、散らばり程
度が大である属性の分類項目に優先順位を与えるように
設定し、その設定した検索順位に従って検索する。In this database management system, the degree of dispersion of the data amount for each attribute to each classification item is calculated. Then, the degree of dispersion of the data amount for each attribute to each classification item is compared, and the search order when searching for a plurality of classification items having different attributes as a search condition is determined by the classification item of the attribute having a large degree of dispersion. Is set so as to give a priority, and a search is performed according to the set search order.

【００３１】図２は、このデータベース管理システム
の、属性毎のデータ量の各分類項目への散らばり程度を
演算する動作を示すフローチャートである。このデータ
ベース管理システムは、例えば、データベースが更新さ
れたとき（Ｓ１０）、データベースへのアクセスが無い
ことを確認する（Ｓ１２）。次に、当該属性全体のデ
ータ量ｙ_tから定まる所定の大きさのデータ量数Ｙ_tの
データを無作為に抽出し、分類項目毎に計数する（Ｓ１
４）。FIG. 2 is a flowchart showing the operation of the database management system for calculating the degree of dispersion of the data amount for each attribute to each classification item. For example, when the database is updated (S10), the database management system confirms that there is no access to the database (S12). Then, the data of a predetermined size of data amount number Y _t determined from the data amount y _t of the entire attribute randomly selected and counted for each classification item (S1
4).

【００３２】次に、このデータベース管理システムは、
計数した（Ｓ１４）分類項目毎のデータ量が最多Ｙ_mの
分類項目を中央位置として、図４（ａ）に示した方法と
同様にして、データ量‥‥Ｙ_m-2，Ｙ_m-1，Ｙ_m，Ｙ
_m+1，Ｙ_m+2‥‥，Ｙ_pが多い順に中央位置から振り分
けて各分類項目を並べ、各分類項目に所定の等差を有す
る等差数列の各項‥‥Ｘ_m-2，Ｘ_m-1，Ｘ_m，Ｘ_m+1，
Ｘ_m+2‥‥，Ｘ_p（ｐは無作為抽出したデータが属する
分類項目数）を順次割り当てる（Ｓ１６）。Next, this database management system:
The categories were counted (S14) data of each classification item most Y _m as a central position, in the same manner as in the method shown in FIG. 4 (a), the amount of data _{‥‥ Y m-2, Y m} -1 , Y _m , Y
_{_{m + 1, Y m + 2}} ‥‥, Y p arrange each classification item sorting from the center position in order, the more terms ‥‥ X _m-2 of arithmetic progression having a predetermined arithmetic to each classification item, X _m−1 , X _m , X _{m + 1} ,
X _{m + 2} }, X _p (p is the number of classification items to which the randomly extracted data belongs) are sequentially assigned (S16).

【００３３】次に、このデータベース管理システムは、
この無作為抽出したデータが属する数量化した分類項目
‥‥Ｘ_m-2，Ｘ_m-1，Ｘ_m，Ｘ_m+1，Ｘ_m+2‥‥，Ｘ_p
の平均値Ｘ_aを次式Ｘ_a＝（ΣＸ_i・Ｙ_i）／Ｙ_t （ｉ＝１〜ｐ）（１）により演算する。Next, this database management system:
The randomized data is quantified belonging category _{‥‥ X m-2, X m} -1, X m, X m + 1, X m + 2 ‥‥, X p
Calculating the average value X calculated by the following expression _a X _a of _{_{= (ΣX i · Y i)}} / Y t (i = 1~p) (1).

【００３４】次に、（１）により演算した平均値Ｘ_aを
用いて、クラスＸ毎の度数分布の標本分散ｓ²を次式ｓ²＝（Σ（Ｘ_i−Ｘ_a）²・Ｙ_i）／Ｙ_t （ｉ＝１〜ｐ）（２）により演算して、標本分散ｓ²を当該属性のデータ量ｙ
の分類項目（クラスｘ）への散らばり程度とする（Ｓ１
８）。このデータベース管理システムは、全属性につい
て（Ｓ１９）、上述した方法により標本分散ｓ²を演算
し、当該属性のデータ量ｙの分類項目（クラスｘ）への
散らばり程度とする（Ｓ１８）。Next, using the average value X _a calculated in (1), the sample variance s ² of the frequency distribution for each class X is calculated as follows: s ² = (Σ (X _i −X _a ) ² · Y _i ) / Y _t (i = 1 to p) (2) to calculate the sample variance s ² as the data amount y of the attribute.
(S1)
8). The database management system calculates the sample variance s ² for all attributes (S 19) by the above-described method, and determines the data amount y of the attribute to be scattered to the classification item (class x) (S 18).

【００３５】図３は、このデータベース管理システムの
検索動作を示すフローチャートである。このデータベー
ス管理システムは、ユーザーが、ＣＲＴ１５を見ながら
キーボード１６により、データ検索機能を選択すると
（Ｓ２０）、データ検索画面をＣＲＴ１５に表示する
（Ｓ２２）。ユーザーは、このデータ検索画面を見なが
らキーボード１６により、属性が異なる複数の分類項目
（例えば、「男であり宝石加工を職業とする」（図７参
照））を検索条件として入力する（Ｓ２４）。FIG. 3 is a flowchart showing a search operation of the database management system. When the user selects a data search function using the keyboard 16 while viewing the CRT 15 (S20), the database management system displays a data search screen on the CRT 15 (S22). The user uses the keyboard 16 to input a plurality of classification items having different attributes (for example, “I am a man and work in jewelry processing” (see FIG. 7)) as search conditions while viewing this data search screen (S24). .

【００３６】データベース管理システムは、検索するデ
ータ項目が入力されると（Ｓ２４）、検索条件である分
類項目が属する属性（「性」、「職業」）毎の標本分散
を比較し（Ｓ２６）、標本分散が大である属性（「職
業」）の分類項目（「宝石加工」）の検索順位を１番と
し、標本分散が２番目に大である属性（この場合は、標
本分散が小である「性」）の検索順位を２番として設定
する（Ｓ２７）。When the data item to be searched is input (S24), the database management system compares the sample variances for each attribute ("sex", "occupation") to which the classification item as the search condition belongs (S26). The search order of the classification item (“jewelry processing”) of the attribute (“occupation”) with the largest sample variance is the first, and the attribute with the second largest sample variance (in this case, the sample variance is small) The search order of “sex” is set as the second (S27).

【００３７】次に、この設定した（Ｓ２７）検索順位に
従って、検索順位が１番の属性（「職業」）の分類項目
（「宝石加工」）により検索を行い、次に、この検索し
たデータから、検索順位が２番である属性（「性」）の
分類項目（「男」）により検索を行う（Ｓ２８）。次
に、この検索したデータをＣＲＴ１５に表示する（Ｓ３
０）。Next, in accordance with the set search order (S27), a search is performed according to the classification item ("jewelry processing") of the attribute ("occupation") having the first search order. Then, a search is performed according to the classification item ("male") of the attribute ("sex") having the second search order (S28). Next, the searched data is displayed on the CRT 15 (S3).
0).

【００３８】この場合、図７（ａ）に示したような、多
量のデータが、少ない分類項目に集中した分類項目毎の
度数（データ量）分布表の（標本）分散は、図５（ｃ）
に示すように小さくなる。（ｂ）に示したような、多量
のデータが、多くの分類項目に分かれ、それぞれのデー
タ量に差がある、分類項目毎の度数分布表の（標本）分
散は、図５（ｂ）に示すよう中程度の値となる。（ｃ）
に示したような、多量のデータが、多くの分類項目に分
かれ、それぞれのデータ量に差が余り無い、分類項目毎
の度数分布表の（標本）分散は、図５（ａ）に示すよう
に大きくなる。In this case, as shown in FIG. 7A, the (sample) variance of the frequency (data amount) distribution table for each classification item in which a large amount of data is concentrated on a small number of classification items is as shown in FIG. )
As shown in FIG. As shown in FIG. 5B, a large amount of data is divided into many classification items, and there is a difference between the data amounts. The (sample) variance of the frequency distribution table for each classification item is shown in FIG. It has a medium value as shown. (C)
As shown in FIG. 5A, the (sample) variance of the frequency distribution table for each classification item, in which a large amount of data is divided into many classification items and there is no significant difference in the respective data amounts, is as shown in FIG. Become larger.

【００３９】図６は、本発明に係る記録媒体の実施の形
態を説明するための説明図である。この実施の形態で
は、所定の時期に、属性毎に、各分類項目に、分類項目
毎のデータ量に応じた所定順に、所定の等差を有する等
差数列の各項を割り当てるステップ（Ｓ２）と、データ
量を度数とし、等差数列の各項をクラスとして、属性毎
にクラス毎の度数分布の標本分散を演算して散らばり程
度とするステップ（Ｓ４）と、属性が異なる複数の分類
項目を検索条件として検索するとき、属性毎のデータ量
の各分類項目への散らばり程度の大小を比較するステッ
プ（Ｓ６）と、その比較結果により、属性が異なる複数
の分類項目を検索条件として検索する場合の検索順位
を、散らばり程度が大である属性の分類項目に優先順位
を与えるように設定するステップ（Ｓ７）と、その設定
した検索順位に従って、散らばり程度が大である属性の
分類項目から検索するステップ（Ｓ８）とを含むコンピ
ュータプログラムＰＲを記録してあるフレキシブルディ
スクＦＤが、ディスクドライブＤＤに装填されている。FIG. 6 is an explanatory diagram for explaining an embodiment of a recording medium according to the present invention. In this embodiment, at a predetermined time, for each attribute, each term of the arithmetic progression having a predetermined arithmetic difference is assigned to each classification item in a predetermined order according to the data amount of each classification item (S2). And a step (S4) of calculating the sample variance of the frequency distribution of each class for each attribute to set the degree of dispersion by using the data amount as a frequency and each term of the arithmetic progression as a class, and a plurality of classification items having different attributes Is used as a search condition, a step (S6) of comparing the degree of dispersion of the data amount for each attribute to each classification item, and a plurality of classification items having different attributes are searched as a search condition based on the comparison result. Setting the search order in such a case that priority is given to the classification items of the attributes having a large degree of dispersion (S7); and setting the search order of the attributes having a large degree of dispersion in accordance with the set search order. Flexible disk FD that are recorded thereon a computer program PR and a step (S8) for retrieving from the item has been loaded into the disk drive DD.

【００４０】コンピュータプログラムＰＲは、コンピュ
ータＣＰに読み込まれ、コンピュータＣＰを制御する。
その他の構成及び動作は、上述したデータベース管理シ
ステム、データベース検索方法の実施の形態と同様であ
るので、説明を省略する。The computer program PR is read by the computer CP and controls the computer CP.
Other configurations and operations are the same as those of the above-described embodiment of the database management system and the database search method, and thus the description is omitted.

【００４１】尚、上述した実施の形態では、無作為抽出
したデータが属する数量化した分類項目の平均値Ｘ_aを
求めたが、データ量が最多の分類項目Ｘ_mを平均値とし
て演算しても、略同様の結果を得ることができる。この
場合、Ｘ_m＝０として数量化した分類項目を整数‥‥−
２，−１，０，＋１，＋２‥‥とすれば、演算が簡略化
できる。また、上述した実施の形態では、（標本）分散
ｓ²を散らばり程度として利用したが、（標本）標準偏
差ｓを散らばり程度として利用しても同じことである。[0041] In the embodiment described above, although the average value X _a classification item randomly extracted data is quantified belongs, the data amount by calculating the largest number of categories X _m as an average value Can obtain substantially the same result. In this case, the classification items and the quantity as X _m = 0 integer ‥‥ -
2, -1, 0, +1, +2}, the operation can be simplified. In the above-described embodiment, the (sample) variance s ² is used as the degree of dispersion. However, the same applies when the (sample) standard deviation s is used as the degree of dispersion.

【００４２】また、分類項目を、図４（ｂ）に示すよう
に、データ量が多い順に並べ、データ量が最多の分類項
目Ｘ_m＝０として、数量化した分類項目を整数‥‥０，
１，２，３，４‥‥とし、上述した方法で演算を行って
も、属性毎のデータ量の各分類項目への散らばり程度を
求めることができ、上述した実施の形態と同様の結果を
得ることができる。Further, as shown in FIG. 4B, the classification items are arranged in descending order of the data amount, the classification item having the largest data amount X _m = 0, and the quantified classification items are represented by an integer ‥‥ 0,
1, 2, 3, 4}, the degree of dispersion of the data amount for each attribute to each classification item can be obtained even if the calculation is performed by the above-described method, and the same result as in the above-described embodiment can be obtained. Obtainable.

【００４３】[0043]

【発明の効果】本発明の第１発明に係るデータベース管
理システムに係るデータベース管理システムによれば、
検索条件の最適な検索順位を得て、コンピュータに負担
をかけずに短時間で検索することができる。According to the database management system according to the first aspect of the present invention,
It is possible to obtain the optimum search order of the search condition and perform the search in a short time without putting a burden on the computer.

【００４４】第２発明に係るデータベース管理システム
及び第６発明に係るデータベース検索方法によれば、多
数の分類項目にデータが分かれ、各分類項目のデータ量
の差が小さくてデータ量の各分類項目への散らばり程度
が大きく、また、大きいデータ量の分類項目を検索する
虞が無い属性から検索できるので、短い時間で検索を終
了することができる。According to the database management system according to the second invention and the database search method according to the sixth invention, data is divided into a large number of classification items, and the difference in data amount between the classification items is small, so that each classification item has a data amount. Can be searched from an attribute that has a high degree of dispersion and that does not have a risk of searching for a classification item with a large data amount, so that the search can be completed in a short time.

【００４５】第３，４発明に係るデータベース管理シス
テム及び第７発明に係るデータベース検索方法によれ
ば、属性毎のデータ量の各分類項目への散らばり程度を
演算できるので、多数の分類項目にデータが分かれ、各
分類項目のデータ量の差が小さくてデータ量の各分類項
目への散らばり程度が大きく、また、大きいデータ量の
分類項目を検索する虞が無い属性から検索することがで
き、短い時間で検索を終了することができる。According to the database management system according to the third and fourth inventions and the database search method according to the seventh invention, it is possible to calculate the degree of dispersion of the data amount for each attribute to each classification item. , The difference between the data amounts of the respective classification items is small, the degree of dispersion of the data amount among the respective classification items is large, and it is possible to search from an attribute that does not have a risk of searching for the classification item with a large data amount. The search can be completed in time.

【００４６】第５発明に係るデータベース管理システム
及び第８発明に係るデータベース検索方法によれば、属
性毎のデータ量の各分類項目への散らばり程度を、短時
間に演算できるので、コンピュータの能力に負担を掛け
ることなく、また、短い時間で検索を終了することがで
きる。According to the database management system according to the fifth invention and the database search method according to the eighth invention, the degree of dispersion of the data amount for each attribute to each classification item can be calculated in a short time. The search can be completed without burden and in a short time.

【００４７】第９発明に係る記録媒体が記録してあるコ
ンピュータプログラムにより制御されるコンピュータに
よれば、多数の分類項目にデータが分かれ、各分類項目
のデータ量の差が小さくデータ量の各分類項目への散ら
ばり程度が大きく、また、大きいデータ量の分類項目を
検索する虞が無い属性から検索できるので、短い時間で
検索を終了することができる。According to the computer controlled by the computer program in which the recording medium according to the ninth aspect of the invention is recorded, the data is divided into a large number of classification items, the difference in the data amount of each classification item is small, and Since the degree of dispersion to the items is large, and a classification item having a large data amount is not likely to be searched, it is possible to search from an attribute, so that the search can be completed in a short time.

【００４８】第１０発明に係る記録媒体が記録してある
コンピュータプログラムにより制御されるコンピュータ
によれば、属性毎のデータ量の各分類項目への散らばり
程度を演算できるので、多数の分類項目にデータが分か
れ、各分類項目のデータ量の差が小さくデータ量の各分
類項目への散らばり程度が大きく、また、大きいデータ
量の分類項目を検索する虞が無い属性から検索すること
ができ、短い時間で検索を終了することができる。According to the computer controlled by the computer program recorded on the recording medium according to the tenth aspect, the degree of dispersion of the data amount for each attribute to each classification item can be calculated. Are separated, the difference in the data amount of each classification item is small, the degree of dispersion of the data amount to each classification item is large, and it is possible to search from an attribute that does not have a risk of searching for a classification item with a large data amount. To end the search.

【００４９】第１１発明に係る記録媒体が記録してある
コンピュータプログラムにより制御されるコンピュータ
によれば、属性毎のデータ量の各分類項目への散らばり
程度を、短時間に演算できるので、コンピュータの能力
に負担を掛けることなく、また、短い時間で検索を終了
することができる。According to the computer controlled by the computer program recorded on the recording medium according to the eleventh aspect, the degree of dispersion of the data amount for each attribute to each classification item can be calculated in a short time. The search can be completed in a short time without putting a burden on the ability.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明に係るデータベース管理システム及びデ
ータベース検索方法の実施の形態の構成を示すブロック
図である。FIG. 1 is a block diagram showing a configuration of an embodiment of a database management system and a database search method according to the present invention.

【図２】本発明に係るデータベース管理システムの、属
性毎のデータ量の各分類項目への散らばり程度を演算す
る動作を示すフローチャートである。FIG. 2 is a flowchart showing the operation of the database management system according to the present invention for calculating the degree of dispersion of the data amount for each attribute to each classification item.

【図３】本発明に係るデータベース管理システムの検索
動作を示すフローチャートである。FIG. 3 is a flowchart showing a search operation of the database management system according to the present invention.

【図４】属性毎のデータ量の各分類項目への散らばり程
度を分散として演算する方法を説明するための説明図で
ある。FIG. 4 is an explanatory diagram for describing a method of calculating the degree of dispersion of the data amount for each attribute to each classification item as a variance.

【図５】分散の大小を説明するための説明図である。FIG. 5 is an explanatory diagram for explaining the magnitude of dispersion.

【図６】本発明に係る記録媒体の実施の形態を説明する
ための説明図である。FIG. 6 is an explanatory diagram illustrating an embodiment of a recording medium according to the present invention.

【図７】データベースの、データの属性、分類項目及び
データ量の関係を説明するためのデータ量の分布図であ
る。FIG. 7 is a data amount distribution diagram for explaining the relationship among data attributes, classification items, and data amount in a database.

[Explanation of symbols]

１０ＣＰＵ１２ＲＡＭ１３データベース１５ＣＲＴ１８，ＤＤディスクドライブＣＰコンピュータＦＤフレキシブルディスクＰＲコンピュータプログラム 10 CPU 12 RAM 13 Database 15 CRT 18, DD Disk Drive CP Computer FD Flexible Disk PR Computer Program

Claims

[Claims]

1. A database management system for searching a database consisting of a data group having a plurality of attributes and each attribute having one or a plurality of classification items according to a predetermined search condition, comprising: A database management system comprising: a setting unit that sets a search order when searching as a search condition; and a search unit that searches according to the search order set by the setting unit.

2. The method according to claim 1, further comprising: comparing means for comparing the degree of dispersion of the data amount for each attribute to each classification item, wherein the setting means determines that the degree of dispersion is large based on a comparison result of the comparing means. 2. The database management system according to claim 1, wherein a priority search order is assigned to attribute classification items.

3. The database management system according to claim 2, further comprising a calculation unit that calculates the degree of dispersion for each attribute.

4. The scatter degree is assigned to each of the classification items, in a predetermined order according to the data amount of each classification item, each term of an arithmetic progression having a predetermined equal difference, and the data amount is a frequency, 4. The database management system according to claim 2, wherein the variance of the frequency distribution for each class is calculated for each attribute, with each of the terms as a class. 5.

5. The database management system according to claim 4, wherein the variance is a sample variance obtained by an estimation method.

6. A database search method for searching a database consisting of a data group having a plurality of attributes and each attribute having one or a plurality of classification items under a predetermined search condition, wherein each classification item of the data amount for each attribute The degree of dispersion is compared, and a search order when searching for a plurality of classification items having different attributes as a search condition is set so as to give a priority to the classification items of the attribute having the higher degree of dispersion. And a search according to the search order.

7. The degree of scatter is assigned to each of the classification items, in a predetermined order according to the data amount of each classification item, each term of an arithmetic progression having a predetermined arithmetic difference, and the data amount is a frequency, 7. The database search method according to claim 6, wherein each of the terms is set as a class and the variance of the frequency distribution for each class is calculated for each attribute.

8. The database search method according to claim 7, wherein the variance is a sample variance obtained by an estimation method.

9. A recording medium storing a computer program for searching a database having a plurality of attributes, each of which has a data group having one or a plurality of classification items, under a predetermined search condition, Comparing the degree of dispersion of the data amount for each attribute to each classification item; and, based on the comparison result of the step, determining the search order when a plurality of classification items having different attributes are searched as search conditions. A recording medium comprising: a step of setting a priority order to a category item of the attribute having a large degree; and a step of searching according to the search order.

10. A step of allocating each term of an arithmetic progression having a predetermined equality to each of the classification items in a predetermined order according to the data amount of each classification item; 10. The recording medium according to claim 9, further comprising: calculating a variance of the frequency distribution for each class for each of the attributes to obtain the degree of dispersion.

11. The recording medium according to claim 10, wherein the variance is a sample variance obtained by an estimation method.