JP2003030231A

JP2003030231A - Document search service providing method and apparatus, document search service providing program, and storage medium storing document search service providing program

Info

Publication number: JP2003030231A
Application number: JP2001210873A
Authority: JP
Inventors: Koichi Ushijima; 浩一牛島
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2001-07-11
Filing date: 2001-07-11
Publication date: 2003-01-31

Abstract

(57)【要約】【課題】特定の話題に興味を持つ人々が各人の推薦す
る最新のＷＷＷページ情報を共有することを容易にする
と共に、特定の話題に特化したＷＷＷページ検索サービ
スを構築する。【解決手段】本発明は、情報登録者端末から入力され
たＷＷＷサイト情報及び登録者名をＷＷＷサイト情報Ｄ
Ｂに記憶し、指定されたＷＷＷサイトから該ＷＷＷサイ
トに含まれるＷＷＷページ群を取得し、検索用ＤＢに格
納し、検索時に、検索キーワードに関連するＷＷＷペー
ジ群を検索用ＤＢから検索し、ＷＷＷサイト毎にグルー
プ化し、グループ毎に、ＷＷＷサイト情報と、情報登録
者の名前を付加したページを作成し、検索利用者端末に
表示する。 (57) [Summary] [Problem] To make it easy for people who are interested in a specific topic to share the latest WWW page information recommended by each person, and to provide a WWW page search service specialized for a specific topic. To construct. According to the present invention, WWW site information and a registrant name input from an information registrant terminal are converted to WWW site information D.
B, obtains a group of WWW pages included in the specified WWW site from the designated WWW site, stores it in the search DB, and searches a group of WWW pages related to the search keyword from the search DB at the time of search. Grouping is performed for each WWW site, and a page is created for each group, to which WWW site information and the name of the information registrant are added, and displayed on the search user terminal.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、文書検索サービス
提供方法及び装置及び文書検索サービス提供プログラム
及び文書検索サービス提供プログラムを格納した記憶媒
体に係り、特に、ネットワーク上に存在する文書集合か
ら特定の条件に適合する文書を探し出す検索システムに
おいて、検索対象となる文書集合を収集するための文書
検索サービス提供方法及び装置及び文書検索サービス提
供プログラム及び文書検索サービス提供プログラムを格
納した記憶媒体に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document search service providing method and apparatus, a document search service providing program, and a storage medium storing the document search service providing program. The present invention relates to a document search service providing method and apparatus for collecting a set of documents to be searched, a document search service providing program, and a storage medium storing the document search service providing program in a search system that searches for documents that match a condition.

【０００２】[0002]

【従来の技術】従来、特定の話題に興味を持つ人々の間
で、どこのＷＷＷサイトにどのような情報があるかとい
う情報を共有するために、リンク集と呼ばれるページを
作成する方法が採られている。リンク集は図１０のよう
なものであり、ＷＷＷのハイパーリンク機能を用いて、
関連するＷＷＷサイトへのリンクを配置し、リンク先の
タイトルやロゴ、簡単な紹介文などを付加したものであ
る。利用者はこの紹介文を見て、どのサイトに所望の情
報があるかを判断することができる。2. Description of the Related Art Conventionally, a method of creating a page called a link collection has been adopted in order to share information about which WWW site has what information among people who are interested in a particular topic. Has been. The link collection is as shown in Fig. 10. Using the hyperlink function of WWW,
Links to related WWW sites are placed, and the titles, logos, and brief introductions of the links are added. The user can judge which site has desired information by looking at this introduction.

【０００３】しかし、従来のリンク集を用いる方法で
は、関連サイトが多くなるにつれてリンク集から所望の
サイトを探すことが困難となっている。そこで考えられ
たのが、“Ｙａｈｏｏ”に代表されるディレクトリ型検
索サービスである。これは、人手により、ＷＷＷサイト
のカテゴリ分けや説明文の入力を行なっており、これを
元にＷＷＷサイトの検索を行なうシステムである。ディ
レクトリ型検索サービスはいわば、すべての話題を扱う
大規模なリンク集に検索機能を付加したものであると言
える。However, in the conventional method using the link collection, it becomes difficult to find a desired site from the link collection as the number of related sites increases. Then, a directory-type search service represented by "Yahoo" was considered. This is a system for manually categorizing WWW sites and inputting explanations, and searching for WWW sites based on this. It can be said that a directory-type search service is, in a sense, a large collection of links that deal with all topics, with a search function added.

【０００４】一方、増加し続けるＷＷＷページを探し出
すためのサービスとして、ロボット型検索と呼ばれる検
索サービスも広く利用されている。ロボット型検索サー
ビスは、ＷＷＷページのリンクを辿りながら自動的にＷ
ＷＷページを収集する「情報収集ロボット」あるいは
「クローラー」と呼ばれるプログラムを使用している。
これらは、機械的に情報の収集が行なわれるため、リン
ク集やディレクトリ型検索サービスのようにカテゴリや
紹介文を提供することはできないが、新しく現れたＷＷ
Ｗページを自動的に発見することが可能であることや、
ＷＷＷページそのものを検索対象としているためにディ
レクトリ型検索サービスでは発見できないようなＷＷＷ
ページも見つけ出すことができる。また、情報収集ロボ
ットにより絶えず情報が更新されているため、最新の情
報に基づいた検索を行なうことができる。On the other hand, as a service for searching for an ever-increasing number of WWW pages, a search service called robot type search is also widely used. The robot-type search service automatically follows the WWW page links
It uses a program called "information gathering robot" or "crawler" that collects WW pages.
Since they collect information mechanically, they cannot provide categories or introductory sentences like link collections and directory-type search services, but they are new to WW.
W pages can be automatically discovered,
WWW that cannot be found by the directory-type search service because the WWW page itself is the search target
You can also find the page. Also, since the information collecting robot constantly updates the information, it is possible to perform a search based on the latest information.

【０００５】ロボット型検索サービスのこのような利点
を生かしながら、特定の話題のみに特化した検索サービ
スを提供することも行なわれている。これは、ロボット
が巡回するＷＷＷサイトを、特定の話題に関連したＷＷ
Ｗサイトのみに限定することによって実現されている。While making use of such advantages of the robot type search service, it is also provided to provide a search service specialized only for a specific topic. This is a WWW site where robots travel around, and a WW related to a specific topic.
It is realized by limiting to W site only.

【０００６】[0006]

【発明が解決しようとする課題】しかしながら、上記従
来のリンク集やディレクトリ検索サービスでは、更新を
手作業に負っているため、情報が古くなりやすく、リン
ク先が既に存在しなくなっている、いわゆるリンク切れ
や、新しいＷＷＷサイトの情報が存在しない、という状
態が発生し易い。特に、ディレクトリサービスでは、デ
ータ登録時の情報を検索対象としているため、最近のキ
ーワードでは検索できないことも多い。However, in the above-described conventional link collections and directory search services, since updating is manually performed, information is apt to become old, and so-called link destinations no longer exist. It is easy for a situation such as a disconnection or the absence of new WWW site information. In particular, since the directory service targets the information at the time of data registration as a search target, it is often impossible to search by the recent keywords.

【０００７】また、ロボット型検索サービスでは、情報
の更新は、頻繁に行なわれるものの、特定の話題に検索
対象を制限することができず、検索結果に利用者にとっ
て必要のないＷＷＷページが大量に含まれることにな
る。また、紹介文やカテゴリなどの付加情報がないた
め、利用者が検索結果の中からどのリンクを選べば良い
かを判断することが難しくなっている。Further, in the robot type search service, although information is updated frequently, the search target cannot be limited to a specific topic, and a large number of WWW pages that are not needed by the user are included in the search results. Will be included. Moreover, since there is no additional information such as an introduction sentence and a category, it is difficult for the user to determine which link should be selected from the search results.

【０００８】また、ロボットが巡回するＷＷＷサイトを
制限して話題を特定する方法では、ロボット型検索サー
ビスの特徴である新しいＷＷＷサイトを発見する機能が
生きず、やはり情報が古くなるという問題が発生する。[0008] Further, in the method of limiting the WWW sites that the robot crawls to identify a topic, the function of discovering a new WWW site, which is a feature of the robot type search service, does not work, and information also becomes old. To do.

【０００９】本発明は、上記の点に鑑みなされたもの
で、特定の話題に興味を持つ人々が各人の推薦する最新
のＷＷＷページ情報を共有することを容易にすると共
に、特定の話題に特化したＷＷＷページ検索サービスを
構築するための文書検索サービス提供方法及び装置及び
文書検索サービス提供プログラム及び文書検索サービス
提供プログラムを格納した記憶媒体を提供することを目
的とする。The present invention has been made in view of the above points, and makes it easy for people who are interested in a particular topic to share the latest WWW page information recommended by each person, and An object of the present invention is to provide a document search service providing method and apparatus for constructing a specialized WWW page search service, a document search service providing program, and a storage medium storing the document search service providing program.

【００１０】[0010]

【課題を解決するための手段】図１は、本発明の原理を
説明するための図である。FIG. 1 is a diagram for explaining the principle of the present invention.

【００１１】本発明（請求項１）は、コンピュータネッ
トワーク上に存在するＷＷＷページの集合から所望の文
書を発見するための文書検索サービス提供方法におい
て、情報登録者の情報登録者端末からのアクセスに対し
て、情報登録者の認証を行い（ステップ１）、情報登録
者端末に対して、検索対象となるＷＷＷサイトの名前、
ＵＲＬ、紹介文を含むＷＷＷサイト情報の入力を促し
（ステップ２）、情報登録者端末から入力されたＷＷＷ
サイト情報及び登録者名をＷＷＷサイト情報データベー
スに記憶し（ステップ３）、指定されたＷＷＷサイトか
ら該ＷＷＷサイトに含まれるＷＷＷページ群を取得し、
検索用データベースに格納し（ステップ４）、検索利用
者の検索利用者端末に対して、検索キーワードの入力を
促し（ステップ５）、検索利用者端末から入力された検
索キーワードに関連するＷＷＷページ群を検索用データ
ベースから検索し（ステップ６）、検索されたＷＷＷペ
ージ群をＷＷＷサイト毎にグループ化し（ステップ
７）、グループ毎に、ＷＷＷサイトの名前、紹介文を含
むＷＷＷサイト情報と、情報登録者の名前を付加したペ
ージを作成し（ステップ８）、作成されたページを検索
結果として検索利用者端末に表示する（ステップ９）。The present invention (Claim 1) provides a method for providing a document search service for finding a desired document from a set of WWW pages existing on a computer network, wherein the information registrant accesses from an information registrant terminal. On the other hand, the information registrant is authenticated (step 1), and the name of the WWW site to be searched is displayed on the information registrant terminal.
WWW input from the information registrant terminal prompting for input of WWW site information including URL and introduction text (step 2)
The site information and the registrant name are stored in the WWW site information database (step 3), the WWW page group included in the WWW site is acquired from the designated WWW site,
Stored in the search database (step 4), prompting the search user's search user terminal to enter the search keyword (step 5), and grouping WWW pages related to the search keyword input from the search user terminal. Is searched from the search database (step 6), the searched WWW page group is grouped by WWW site (step 7), and WWW site information including the name of WWW site and introductory text is registered for each group. A page to which the name of the person is added is created (step 8), and the created page is displayed as a search result on the search user terminal (step 9).

【００１２】本発明（請求項２）は、コンピュータネッ
トワーク上に存在するＷＷＷページの集合から所望の文
書を発見するための文書検索サービス提供方法におい
て、情報登録者の情報登録者端末からのアクセスに対し
て、情報登録者の認証を行い、情報登録者端末に対し
て、検索対象となるＷＷＷサイトの名前、ＵＲＬ、紹介
文を含むＷＷＷサイト情報の入力を促し、情報登録者端
末から入力されたＷＷＷサイト情報及び登録者名をＷＷ
Ｗサイト情報データベースに記憶し、指定されたＷＷＷ
サイトから該ＷＷＷサイトに含まれるＷＷＷページ群を
取得し、検索用データベースに格納する。The present invention (Claim 2) provides a method for providing a document search service for finding a desired document from a set of WWW pages existing on a computer network, wherein access is made from an information registrant terminal of an information registrant. On the other hand, the information registrant is authenticated, the information registrant terminal is prompted to enter the WWW site information including the name, URL, and introductory text of the WWW site to be searched. WWW site information and registrant name WW
Stored in the W site information database and designated WWW
A WWW page group included in the WWW site is acquired from the site and stored in the search database.

【００１３】本発明（請求項３）は、コンピュータネッ
トワーク上に存在するＷＷＷページの集合から所望の文
書を発見するための文書検索サービス提供方法におい
て、検索利用者の検索利用者端末に対して、検索キーワ
ードの入力を促し、検索利用者端末から入力された検索
キーワードに関連するＷＷＷページ群を検索用データベ
ースから検索し、検索されたＷＷＷページ群をＷＷＷサ
イト毎にグループ化し、グループ毎に、ＷＷＷサイトの
名前、紹介文を含むＷＷＷサイト情報と、情報登録者の
名前を付加したページを作成し、作成されたページを検
索結果として検索利用者端末に表示する。According to the present invention (claim 3), in a document search service providing method for finding a desired document from a set of WWW pages existing on a computer network, a search user terminal of a search user is provided with: It prompts the user to enter a search keyword, searches the search database for WWW page groups related to the search keyword input from the search user terminal, groups the searched WWW page groups for each WWW site, and for each group, WWW WWW site information including the site name and introductory text and a page to which the name of the information registrant is added are created, and the created page is displayed as a search result on the search user terminal.

【００１４】本発明（請求項４）は、所定の条件を満た
している場合に、ＷＷＷサイト情報データベースに保存
されているＷＷＷサイト情報を読み出し、ＷＷＷサイト
情報に含まれているＵＲＬからＷＷＷページ群を取得
し、取得したＷＷＷページを元に、検索用データベース
を更新する。According to the present invention (claim 4), when a predetermined condition is satisfied, WWW site information stored in the WWW site information database is read, and the WWW page group is read from the URL included in the WWW site information. Is acquired, and the search database is updated based on the acquired WWW page.

【００１５】図２は、本発明の原理構成図である。FIG. 2 is a block diagram showing the principle of the present invention.

【００１６】本発明（請求項５）は、コンピュータネッ
トワーク上に存在するＷＷＷページの集合から所望の文
書を発見するための文書検索サービス提供装置であっ
て、情報登録者の認証を行なう会員認証手段１１と、会
員認証手段１１により認証された情報登録者からＵＲ
Ｌ、タイトル、紹介文を含むＷＷＷサイト情報を受け付
けるＷＷＷサイト情報入力手段１２と、ＷＷＷサイト情
報入力手段１２からのＷＷＷサイト情報及び情報登録者
名を保持するＷＷＷサイト情報保持手段１５と、ＷＷＷ
サイト情報保持手段１５に保持されたＷＷＷサイト情報
に基づいて、ＷＷＷページ情報を収集するＷＷＷページ
収集手段１４と、ＷＷＷページ収集手段１４によって収
集されたＷＷＷページの情報を検索可能な形で保持する
ＷＷＷページ索引保持手段１６と、検索利用者の検索利
用者端末４０からのアクセスに対して検索キーワードの
入力を促し、該検索利用者端末４０から入力された検索
キーワードに関連するＷＷＷページ群を検索し、ＷＷＷ
サイト毎にグループ化し、ＷＷＷサイト情報及び情報登
録者名を付加した検索結果を該検索利用者端末４０に表
示する検索手段１７とを有する。The present invention (Claim 5) is a document retrieval service providing apparatus for finding a desired document from a set of WWW pages existing on a computer network, and a member authentication means for authenticating an information registrant. 11 and UR from the information registrant authenticated by the member authentication means 11.
WWW site information input means 12 for receiving WWW site information including L, title, and introduction text, WWW site information holding means 15 for holding WWW site information and information registrant name from WWW site information input means 12, and WWW
Based on the WWW site information stored in the site information storage unit 15, a WWW page collection unit 14 that collects WWW page information, and WWW page information collected by the WWW page collection unit 14 is stored in a searchable form. The WWW page index holding unit 16 and the search user's access to the search user terminal 40 are prompted to enter the search keyword, and the WWW page group related to the search keyword input from the search user terminal 40 is searched. And WWW
A search unit 17 is provided for grouping the sites and displaying the search results with the WWW site information and the information registrant name added on the search user terminal 40.

【００１７】本発明（請求項６）は、コンピュータネッ
トワーク上に存在するＷＷＷページの集合から所望の文
書を発見するための文書検索サービス提供装置であっ
て、情報登録者の認証を行なう会員認証手段と、会員認
証手段により認証された情報登録者からＵＲＬ、タイト
ル、紹介文を含むＷＷＷサイト情報を受け付けるＷＷＷ
サイト情報入力手段と、ＷＷＷサイト情報入力手段から
のＷＷＷサイト情報及び情報登録者名を保持するＷＷＷ
サイト情報保持手段と、ＷＷＷサイト情報保持手段に保
持されたＷＷＷサイト情報に基づいて、ＷＷＷページ情
報を収集するＷＷＷページ収集手段と、ＷＷＷページ収
集手段によって収集されたＷＷＷページの情報を検索可
能な形で保持するＷＷＷページ索引保持手段とを有す
る。The present invention (Claim 6) is a document retrieval service providing apparatus for discovering a desired document from a set of WWW pages existing on a computer network, and a member authenticating means for authenticating an information registrant. And WWW that accepts WWW site information including URL, title, and introductory text from the information registrant authenticated by the member authentication means.
Site information input means, WWW holding WWW site information and information registrant name from WWW site information input means
Site information holding means, WWW page collection means for collecting WWW page information based on the WWW site information held in the WWW site information holding means, and WWW page information collected by the WWW page collection means can be searched And WWW page index holding means for holding in a form.

【００１８】本発明（請求項７）は、コンピュータネッ
トワーク上に存在するＷＷＷページの集合から所望の文
書を発見するための文書検索サービス提供装置であっ
て、検索利用者の検索利用者端末からのアクセスに対し
て検索キーワードの入力を促し、該検索利用者端末から
入力された検索キーワードに関連するＷＷＷページ群を
検索し、ＷＷＷサイト毎にグループ化し、ＷＷＷサイト
情報及び情報登録者名を付加した検索結果を該検索利用
者端末に表示する検索手段とを有する。The present invention (Claim 7) is a document search service providing apparatus for finding a desired document from a set of WWW pages existing on a computer network, which is provided from a search user terminal of a search user. Prompting the user to enter a search keyword for access, searching for WWW page groups related to the search keyword input from the search user terminal, grouping by WWW site, and adding WWW site information and information registrant name. And a search means for displaying the search result on the search user terminal.

【００１９】本発明（請求項８）は、時間を計測するタ
イマと、タイマが一定の時間を計数する度に、ＷＷＷペ
ージ収集手段において収集されたＷＷＷページで、ＷＷ
Ｗページ索引保持手段に保持されているＷＷＷページ情
報を更新する更新手段とを更に有する。According to the present invention (claim 8), a timer for measuring time, and a WWW page collected by the WWW page collection means each time the timer counts a certain time, the WW is collected.
It further comprises update means for updating the WWW page information held in the W page index holding means.

【００２０】本発明（請求項９）は、コンピュータネッ
トワーク上に存在するＷＷＷページの集合から所望の文
書を発見するための文書検索サービス提供プログラムで
あって、情報登録者の認証を行なう会員認証プロセス
と、会員認証プロセスにより認証された情報登録者から
ＵＲＬ、タイトル、紹介文を含むＷＷＷサイト情報を受
け付けるＷＷＷサイト情報入力プロセスと、ＷＷＷサイ
ト情報入力プロセスからのＷＷＷサイト情報及び情報登
録者名を保持するＷＷＷサイト情報データベースに保持
されたＷＷＷサイト情報に基づいて、ＷＷＷページ情報
を収集するＷＷＷページ収集プロセスと、ＷＷＷページ
収集プロセスによって収集されたＷＷＷページの情報を
検索可能な形で保持するＷＷＷページ索引データベース
に格納するＷＷＷページ索引格納プロセスとを有する。The present invention (claim 9) is a document retrieval service providing program for discovering a desired document from a set of WWW pages existing on a computer network, which is a member authentication process for authenticating an information registrant. And WWW site information input process that receives WWW site information including URL, title, and introduction from the information registrant authenticated by the member authentication process, and WWW site information and information registrant name from the WWW site information input process A WWW page collection process that collects WWW page information based on the WWW site information stored in the WWW site information database, and a WWW page that retains the WWW page information collected by the WWW page collection process in a searchable form WWW page stored in the index database And a di-index storage process.

【００２１】本発明（請求項１０）は、コンピュータネ
ットワーク上に存在するＷＷＷページの集合から所望の
文書を発見するための文書検索サービス提供プログラム
であって、検索利用者の検索利用者端末からのアクセス
に対して検索キーワードの入力を促し、該検索利用者端
末から入力された検索キーワードに関連するＷＷＷペー
ジ群を検索し、ＷＷＷサイト毎にグループ化し、ＷＷＷ
サイト情報及び情報登録者名を付加した検索結果を該検
索利用者端末に表示する検索プロセスとを有する。本発
明（請求項１１）は、時間を計測するタイマが一定の時
間を計数する度に、ＷＷＷページ収集プロセスにおいて
収集されたＷＷＷページで、ＷＷＷページ索引データベ
ースに保持されているＷＷＷページ情報を更新する更新
プロセスとを更に有する。The present invention (claim 10) is a program for providing a document search service for finding a desired document from a set of WWW pages existing on a computer network, the program being provided from a search user terminal of a search user. Prompt the user to enter a search keyword for access, search for WWW page groups related to the search keyword input from the search user terminal, group by WWW site, WWW
And a search process for displaying the search result added with the site information and the information registrant name on the search user terminal. The present invention (Claim 11) updates the WWW page information held in the WWW page index database with the WWW page collected in the WWW page collection process every time the timer for counting time counts a certain time. And an update process to perform.

【００２２】本発明（請求項１２）は、コンピュータネ
ットワーク上に存在するＷＷＷページの集合から所望の
文書を発見するための文書検索サービス提供プログラム
を格納した記憶媒体であって、情報登録者の認証を行な
う会員認証プロセスと、会員認証プロセスにより認証さ
れた情報登録者からＵＲＬ、タイトル、紹介文を含むＷ
ＷＷサイト情報を受け付けるＷＷＷサイト情報入力プロ
セスと、ＷＷＷサイト情報入力プロセスからのＷＷＷサ
イト情報及び情報登録者名を保持するＷＷＷサイト情報
データベースに保持されたＷＷＷサイト情報に基づい
て、ＷＷＷページ情報を収集するＷＷＷページ収集プロ
セスと、ＷＷＷページ収集プロセスによって収集された
ＷＷＷページの情報を検索可能な形で保持するＷＷＷペ
ージ索引データベースに格納するＷＷＷページ索引格納
プロセスとを有する。The present invention (Claim 12) is a storage medium storing a document search service providing program for finding a desired document from a set of WWW pages existing on a computer network, the authentication of an information registrant. W including the URL, title, and introductory text from the information registrant authenticated by the member authentication process
Collect WWW page information based on the WWW site information input process that accepts WW site information, and the WWW site information stored in the WWW site information database that holds the WWW site information and the information registrant name from the WWW site information input process And a WWW page index storing process for storing the information of the WWW pages collected by the WWW page collecting process in a WWW page index database that holds the WWW page information in a searchable manner.

【００２３】本発明（請求項１３）は、コンピュータネ
ットワーク上に存在するＷＷＷページの集合から所望の
文書を発見するための文書検索サービス提供プログラム
を格納した記憶媒体であって、検索利用者の検索利用者
端末からのアクセスに対して検索キーワードの入力を促
し、該検索利用者端末から入力された検索キーワードに
関連するＷＷＷページ群を検索し、ＷＷＷサイト毎にグ
ループ化し、ＷＷＷサイト情報及び情報登録者名を付加
した検索結果を該検索利用者端末に表示する検索プロセ
スとを有する。本発明（請求項１４）は、時間を計測す
るタイマが一定の時間を計数する度に、ＷＷＷページ収
集プロセスにおいて収集されたＷＷＷページで、ＷＷＷ
ページ索引データベースに保持されているＷＷＷページ
情報を更新する更新プロセスとを更に有する。The present invention (Claim 13) is a storage medium storing a document search service providing program for finding a desired document from a set of WWW pages existing on a computer network. Prompt the user to enter a search keyword for access from the user terminal, search for WWW page groups related to the search keyword input from the search user terminal, group by WWW site, and register WWW site information and information. And a search process for displaying the search result with the person name added on the search user terminal. According to the present invention (claim 14), the WWW page collected in the WWW page collection process is used every time the timer for counting time counts a certain time.
And an update process for updating the WWW page information held in the page index database.

【００２４】上記のように、本発明では、インターネッ
ト上のホームページ等の情報の在処を提供する検索サー
ビス（例えば、サーチエンジン、Ｙａｈｏｏのようなデ
ィレクトリ）において、登録型とロボット検索型の両方
の長所を併せ持つシステムを構築することで、情報登録
者は、ＷＷＷサイトの情報をネットワーク上から自由に
登録することができる。これにより、検索サービスの対
象が固定されず、情報登録者の働きかけによって検索対
象を拡張することを可能とする。As described above, according to the present invention, in a search service (for example, a search engine, a directory such as Yahoo) that provides the location of information such as a home page on the Internet, the advantages of both the registration type and the robot search type are provided. By constructing a system having both, the information registrant can freely register the information of the WWW site from the network. As a result, the target of the search service is not fixed, and the search target can be expanded by the action of the information registrant.

【００２５】ここで、情報登録者は、検索利用者であっ
てもよい。この場合、検索サービスを利用する利用者コ
ミュニティ内の興味の変化に応じて検索対象を変化させ
ることができるようになり、検索サービスの利用価値を
向上させることができる。Here, the information registrant may be a search user. In this case, the search target can be changed according to the change of interest in the user community that uses the search service, and the utility value of the search service can be improved.

【００２６】また、本発明では、実際にＷＷＷサイトか
らＷＷＷページ群を取得し、検索対象としているため、
実際にＷＷＷページ中に現れるキーワードで検索できる
ようになる。また、実際には存在しないＵＲＬが入力さ
れたとしてもこの段階でＷＷＷページが取得できないた
め、リンク切れを未然に防ぐことができる。Further, in the present invention, since the WWW page group is actually acquired from the WWW site and is set as the search target,
You will be able to search by the keywords that actually appear in the WWW page. Further, even if a URL that does not actually exist is input, the WWW page cannot be acquired at this stage, so it is possible to prevent the link from being broken.

【００２７】また、本発明による検索結果は、ＷＷＷサ
イト毎に纏められ、ＷＷＷサイトを登録した情報登録者
の名前と紹介文が提示される。検索利用者は、この紹介
文により適切なＷＷＷサイトを選別することが可能とな
る。また、情報登録者の名前が提示されるため、コミュ
ニケーション内での信頼関係を利用した選別、例えば、
「ある分野の専門家であるＡさんの紹介ならば有効だろ
う」、あるいは、「自分と趣味の似ているＢさんの推薦
は重要だ」、という選別ができるようになる。また、本
発明では、情報登録者の名前を提示するため、コミュニ
ティの話題にそぐわないＷＷＷサイトを登録することに
対する抑止効果が期待できる。The search results according to the present invention are summarized for each WWW site, and the name and the introductory text of the information registrant who registered the WWW site are presented. A search user can select an appropriate WWW site by this introduction. Also, since the name of the information registrant is presented, selection using the trust relationship in communication, for example,
You will be able to select "It would be effective to introduce Mr. A, who is an expert in a certain field," or "It is important to recommend Mr. B who has a hobby similar to me." Further, in the present invention, since the name of the information registrant is presented, it is possible to expect a deterrent effect on registering a WWW site that does not fit the topic of the community.

【００２８】これらの機能より、コミュニティの活性化
が図られ、特定の話題に向けた検索サービスの情報を最
新に維持できるようになる。With these functions, the community is activated and the information of the search service for a specific topic can be kept up to date.

【００２９】また、本発明では、所定の条件を満たした
場合に、保存されているＷＷＷサイト情報を読み出し、
ＷＷＷサイト情報に含まれているＵＲＬからＷＷＷペー
ジ群を取得し、取得したＷＷＷページを元に検索用デー
タベースを更新することが可能であり、例えば、１日１
回のように、定期的にＷＷＷサイトから最新のＷＷＷペ
ージ情報を取得し、検索用データベースを更新する。こ
れにより、検索対象の情報を最新に保ち、かつリンク切
れを防ぐことが可能となる。Further, in the present invention, when the predetermined condition is satisfied, the stored WWW site information is read,
It is possible to acquire the WWW page group from the URL included in the WWW site information and update the search database based on the acquired WWW page.
Like the times, the latest WWW page information is periodically acquired from the WWW site and the search database is updated. This makes it possible to keep the information to be searched up to date and prevent broken links.

【００３０】[0030]

【発明の実施の形態】以下、図面と共に本発明の実施の
形態について説明する。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described below with reference to the drawings.

【００３１】図３は、本発明の一実施の形態における文
書検索サービス提供装置の構成を示す。FIG. 3 shows the configuration of a document search service providing apparatus according to an embodiment of the present invention.

【００３２】同図示す装置１０は、情報登録者端末２
０、ＷＷＷサーバ３０、検索利用者端末４０に接続され
ている。The device 10 shown in FIG.
0, WWW server 30, and search user terminal 40.

【００３３】情報登録者端末２０及び情報利用者端末４
０には、ＷＷＷブラウザがインストールされている。例
えば、マイクロソフト社の「インターネットエクスプロ
ーラ」等などで本装置１０にアクセスすることができ
る。Information registrant terminal 20 and information user terminal 4
0 has a WWW browser installed. For example, the device 10 can be accessed by "Internet Explorer" of Microsoft Corporation.

【００３４】装置１０は、サーバコンピュータ上に構成
されており、情報登録者端末２０、情報利用者端末４０
からの要求を受け付けるためにＨＴＴＰサーバを利用す
ることができる。例えば、フリーソフトの“ａｐａｃｈ
ｅ（アパッチ）”等を利用してもよい。The device 10 is constructed on a server computer, and has an information registrant terminal 20 and an information user terminal 40.
An HTTP server can be used to accept requests from. For example, the free software "apach"
e (Apache) ”or the like may be used.

【００３５】装置１０は、会員認証部１１、ＷＷＷサイ
ト情報入力部１２、タイマ１３、ＷＷＷページ収集部１
４、ＷＷＷサイト情報保持部１５、ＷＷＷぺージ索引保
持部１６、検索部１７より構成される。The device 10 includes a member authentication unit 11, a WWW site information input unit 12, a timer 13, and a WWW page collection unit 1.
4, WWW site information holding unit 15, WWW page index holding unit 16, and search unit 17.

【００３６】このうち、ＷＷＷサイト情報入力部１２、
検索部１７は、ＨＴＴＰサーバの機能拡張方法であるＣ
ＧＩインタフェースを使って作成することができる。Of these, the WWW site information input section 12,
The search unit 17 is C, which is an HTTP server function expansion method.
It can be created using the GI interface.

【００３７】会員認証部１１は、ＨＴＴＰサーバの認証
機構をそのまま利用してもよい。他に認証サーバを用意
して、認証結果を“Ｃｏｏｋｉｅ”などの技術を使って
伝えるという一般的な手法で実現してもよい。The member authentication section 11 may use the authentication mechanism of the HTTP server as it is. Alternatively, an authentication server may be prepared and the authentication result may be transmitted using a technique such as "Cookie".

【００３８】タイマ１３は、設定された時間間隔で信号
を出力し、ＷＷＷページ収集部１４を起動させることが
できる。タイマ１３は、例えば、ＵＮＩＸ（登録商標）
の“ｃｒｏｎ”などの機能を使って実現することができ
る。The timer 13 can output a signal at a set time interval to activate the WWW page collection unit 14. The timer 13 is, for example, UNIX (registered trademark).
It can be realized by using a function such as "cron" of.

【００３９】ＷＷＷページ収集部１４は、一般に情報収
集ロボットと呼ばれるものであり、フリーソフトウェア
の“ｗｇｅｔ”または、“ｈｔｔｐｄｏｗｎ”などのソ
フトウェアを使えばよい。The WWW page collection unit 14 is generally called an information collection robot, and may use software such as free software "wget" or "httpdown".

【００４０】ＷＷＷサイト情報保持部１５は、ＷＷＷサ
イト情報ＤＢ１５１を含んでいる。ＷＷＷサイト情報Ｄ
Ｂ１５１は、市販のＲＤＢＭＳを使って実現することが
できる。The WWW site information holding section 15 includes a WWW site information DB 151. WWW site information D
B151 can be realized by using a commercially available RDBMS.

【００４１】ＷＷＷページ索引保持部１６は、ＷＷＷペ
ージＤＢ１６１と、全文検索エンジン１６２を含んでい
る。ＷＷＷページＤＢ１６１は、ＷＷＷサイト情報ＤＢ
１５１と同様に、市販のＲＤＢＭＳを使って実現するこ
とができる。全文検索エンジン１６２は、全文検索機能
を提供するもので、“Ｎ−ｇｒａｍ”を使ったもの、形
態素解析を使ったもの、ベクトル空間法などの手法を使
ったものが一般に使用されている。The WWW page index holding unit 16 includes a WWW page DB 161 and a full-text search engine 162. The WWW page DB 161 is a WWW site information DB
Similar to 151, it can be realized by using a commercially available RDBMS. The full-text search engine 162 provides a full-text search function, and generally, the one using “N-gram”, the one using morphological analysis, and the one using a method such as a vector space method are generally used.

【００４２】次に、本発明の動作を説明する。Next, the operation of the present invention will be described.

【００４３】最初に、情報登録時の動作について説明す
る。First, the operation at the time of information registration will be described.

【００４４】図４は、本発明の一実施の形態における情
報登録動作のフローチャートである。FIG. 4 is a flowchart of the information registration operation according to the embodiment of the present invention.

【００４５】ステップ１０１）まず、情報登録者端末
２０からのアクセスに対して、ログイン即ち、会員認証
が完了しているかどうかを検査する。ログインが完了し
ていなければ、ステップ１１１に移行する。Step 101) First, for the access from the information registrant terminal 20, it is checked whether or not login, that is, member authentication is completed. If the login is not completed, the process proceeds to step 111.

【００４６】ステップ１０２）ログインが完了してい
れば、ＷＷＷサイト情報が入力されているかどうかを確
認する。ここで、ＷＷＷサイト情報が入力されていなけ
れば、ステップ１２１に移行する。ＷＷＷサイト情報が
入力されている場合には、ステップ１０３に移行する。Step 102) If the login is completed, it is confirmed whether the WWW site information has been entered. Here, if the WWW site information has not been input, the process proceeds to step 121. If the WWW site information has been input, the process proceeds to step 103.

【００４７】ステップ１０３）図５に示すＷＷＷ情報
登録画面（詳細は、ステップ１２１で後述する）で入力
されたＷＷＷサイト情報と認証によって取得された情報
登録者ＩＤをＷＷＷサイト情報ＤＢ１５１に格納する。Step 103) The WWW site information entered on the WWW information registration screen (details will be described later in step 121) shown in FIG. 5 and the information registrant ID obtained by the authentication are stored in the WWW site information DB 151.

【００４８】ステップ１０４）ＷＷＷページ収集部１
７により、指定されたＷＷＷサイトからＷＷＷページを
収集する。このとき、図５の「ハイパーリンクを辿る」
チェックボックス１０３の状態によってＷＷＷページ収
集部１７の動作を替える。Step 104) WWW page collection unit 1
7, collect WWW pages from the designated WWW site. At this time, "follow hyperlink" in FIG.
The operation of the WWW page collection unit 17 is changed depending on the state of the check box 103.

【００４９】ステップ１０５）ＷＷＷページ収集部１
７でＷＷＷページの取得が成功したかどうかを検査す
る。もし取得に失敗していれば、ステップ１３１に移行
する。取得に成功していればステップ１０６に移行す
る。Step 105) WWW page collection unit 1
In step 7, check whether the WWW page has been successfully acquired. If the acquisition has failed, the process proceeds to step 131. If the acquisition is successful, the process proceeds to step 106.

【００５０】ステップ１０６）情報の取得に成功して
いれば、「ご登録ありがとうございました」というメッ
セージを表示する。Step 106) If the information acquisition is successful, a message "Thank you for registration" is displayed.

【００５１】ステップ１０７）取得したＷＷＷページ
に基づいてＷＷＷページＤＢ１６１及び全文検索エンジ
ン１６２を更新し、終了する。Step 107) The WWW page DB 161 and the full-text search engine 162 are updated based on the acquired WWW page, and the process ends.

【００５２】ステップ１１１）ステップ１０１におい
て、ログインが完了していない場合には、ログイン画面
あるいはログインサーバへ情報登録者端末を移動させる
ように制御して終了する。Step 111) If the login is not completed in Step 101, control is performed to move the information registrant terminal to the login screen or the login server, and the process ends.

【００５３】ステップ１２１）ステップ１０２におい
て、ＷＷＷ情報が入力されていない場合には、ＷＷＷサ
イト登録画面を表示して終了する。図５の画面には、Ｗ
ＷＷサイトのタイトルを入力するテキストフィールド１
０１、ＵＲＬを入力するテキストフィールド１０２、指
定したＵＲＬの中に含まれるハイパーリンクをさらにた
どってそのＷＷＷサイトの内容も検索対象に含めるかど
うかを指定するチェックボックス１０３、紹介文を入力
するテキストエリア１０４、登録ボタン１０５が表示さ
れる。Step 121) When the WWW information is not input in Step 102, the WWW site registration screen is displayed and the process ends. The screen of FIG.
Text field 1 for entering title of WW site
01, a text field 102 for inputting a URL, a check box 103 for specifying whether or not to further search the contents of the WWW site by following the hyperlink included in the specified URL, and a text area for inputting an introduction sentence 104 and a registration button 105 are displayed.

【００５４】ステップ１３１）ステップ１０５におい
て、ＷＷＷページの取得に失敗している場合には、「指
定されたサイトから情報が取得できません」というメッ
セージを表示して終了する。Step 131) In step 105, if the acquisition of the WWW page has failed, a message "Information cannot be acquired from the specified site" is displayed and the process ends.

【００５５】ここで、データベースの構造について説明
する。Here, the structure of the database will be described.

【００５６】図６は、本発明の一実施の形態におけるＷ
ＷＷサイト情報ＤＢ及びＷＷＷページＤＢのテーブル構
造を示す。本装置１０は、ＷＷＷサイト情報ＤＢ１５１
とＷＷＷページＤＢ１６１の２つのデータベースを使用
している。FIG. 6 shows W in one embodiment of the present invention.
The table structure of WW site information DB and WWW page DB is shown. This device 10 has a WWW site information DB 151.
And WWW page DB 161 are used.

【００５７】ＷＷＷサイト情報ＤＢ１５１のＷＷＷサイ
ト情報は、それぞれサイトＩＤを付与して他のＷＷＷサ
イト情報と区別される。一方、ＷＷＷページＤＢ１６１
では、ページを区別するページＩＤの他に、そのページ
がどのＷＷＷサイトに含まれるかを示すサイトＩＤを持
っている。サイトＩＤは検索時のグループ化の際に使用
される。The WWW site information in the WWW site information DB 151 is given a site ID to be distinguished from other WWW site information. On the other hand, WWW page DB 161
Then, in addition to the page ID that distinguishes the page, it has a site ID indicating which WWW site the page is included in. The site ID is used for grouping at the time of search.

【００５８】次に、検索時の処理について説明する。Next, the processing at the time of retrieval will be described.

【００５９】図７は、本発明の一実施の形態における検
索動作のフローチャートである。FIG. 7 is a flowchart of the search operation according to the embodiment of the present invention.

【００６０】ステップ２０１）まず、検索利用者端末
４０からのアクセスに対し、検索キーワードが与えられ
ているかを確認する。検索キーワードが与えられていな
ければ、ステップ２１１に移行する。Step 201) First, it is confirmed whether or not a search keyword is given to the access from the search user terminal 40. If no search keyword is given, the process proceeds to step 211.

【００６１】ステップ２０２）検索キーワードが与え
られている場合には、全文検索エンジン１６２を使って
与えられた検索キーワードを含むＷＷＷページのページ
ＩＤの集合を取得する。Step 202) When a search keyword is given, the full-text search engine 162 is used to obtain a set of page IDs of WWW pages including the given search keyword.

【００６２】ステップ２０３）ステップ２０２におけ
る検索結果がある場合には、ステップ２０４に移行し、
ない場合には、ステップ２２１に移行する。Step 203) If there is a search result in Step 202, move to Step 204,
If not, the process proceeds to step 221.

【００６３】ステップ２０４）検索結果がある場合に
は、ＷＷＷぺージＤＢ１６１を使って、各ページＩＤを
サイトＩＤ毎にグループ化する。Step 204) If there is a search result, each page ID is grouped by site ID using the WWW page DB 161.

【００６４】ステップ２０５）各サイトＩＤ毎にサイ
トのスコアを計算する。このスコアは、全文検索エンジ
ンがＷＷＷページ毎に返却するスコアの和でも良いし、
単純にヒットしたＷＷＷページが何件ＷＷＷサイトに含
まれているかでもよい。Step 205) The score of the site is calculated for each site ID. This score may be the sum of the scores returned by the full-text search engine for each WWW page,
It may be simply how many WWW pages are hits included in the WWW site.

【００６５】ステップ２０６）ステップ２０５で求め
られたスコアの大きい順番にサイト情報を並べ替える。Step 206) The site information is sorted in descending order of the score obtained in Step 205.

【００６６】ステップ２０７）検索結果画面に表示可
能な件数に検索結果を切り詰める処理を行なう。１画面
に表示できるＷＷＷページの件数は、設定で変更できる
ように構築することが望ましい。Step 207) The search results are truncated to the maximum number that can be displayed on the search result screen. It is desirable to construct the number of WWW pages that can be displayed on one screen so that the number can be changed by setting.

【００６７】ステップ２０８）ステップ２０７で残っ
たＷＷＷサイト、ＷＷＷページに関する表示データをＷ
ＷＷサイト情報ＤＢ１５１及びＷＷＷページＤＢ１６１
から取得する。Step 208) The display data concerning the WWW site and WWW page remaining in Step 207 is W
WW site information DB 151 and WWW page DB 161
To get from.

【００６８】ステップ２０９）ステップ２０８で取得
した表示データを組み合わせた検索結果画面を作成し、
検索利用者端末４０に送信して終了する。Step 209) Create a search result screen combining the display data obtained in Step 208,
Send to the search user terminal 40 and end.

【００６９】ステップ２１１）ステップ２０１で検索
キーワードが入力されていない場合には、検索画面を表
示する。検索画面は、検索キーワードを入力するテキス
トフィールドと検索ボタンを含んでいる。Step 211) When the search keyword is not input in Step 201, the search screen is displayed. The search screen includes a text field for inputting a search keyword and a search button.

【００７０】ステップ２２１）ステップ２０３におい
て、検索結果がゼロの場合には、「検索件数は０です」
というメッセージを表示して終了する。Step 221) If the search result is zero in Step 203, "the number of searches is 0".
Is displayed and the process ends.

【００７１】図８は、本発明の一実施の形態における検
索結果画面の例を示す。検索結果は、ＷＷＷサイト毎に
タイトル２０３、情報登録者が入力したサイトの紹介文
２０４、情報登録者の名前（ＩＤ）２０５と、検索キー
ワードで照合に成功したＷＷＷページのタイトル群２０
６が表示されている。タイトル２０３とタイトル群２０
６はハイパーリンクになっている。タイトル２０３をク
リックすると、ＷＷＷサイトのトップページにジャンプ
する。また、タイトル群２０６は、検索にヒットしたＷ
ＷＷページそのものにジャンプする。また、情報登録者
ＩＤ２０６を情報登録者のメールアドレスへのハイパー
リンクにすることもできる。このように、本装置１０の
検索結果は、サイト別に紹介文を付与して提示されるの
で、どのＷＷＷページを見ればよいかが分かりやすい。
また、情報登録者名が提示されているので、Ａさんの推
薦なら間違いないだろうという判断材料を提供すること
ができる。FIG. 8 shows an example of the search result screen in the embodiment of the present invention. The search result is a title 203 for each WWW site, an introduction sentence 204 of the site input by the information registrant, a name (ID) 205 of the information registrant, and a title group 20 of WWW pages that have been successfully matched with the search keyword.
6 is displayed. Title 203 and title group 20
6 is a hyperlink. Click the title 203 to jump to the top page of the WWW site. In addition, the title group 206 is W that has hit the search.
Jump to the WW page itself. The information registrant ID 206 can also be a hyperlink to the information registrant's email address. In this way, since the search result of the device 10 is presented with an introduction sentence for each site, it is easy to understand which WWW page should be viewed.
In addition, since the information registrant name is presented, it is possible to provide a judgment material that it is certain that Mr. A's recommendation will be correct.

【００７２】次に、ＷＷＷページの更新処理について説
明する。Next, the WWW page update process will be described.

【００７３】図９は、本発明の一実施の形態におけるＷ
ＷＷページ情報更新のフローチャートである。FIG. 9 shows W in one embodiment of the present invention.
It is a flowchart of WW page information update.

【００７４】当該ＷＷＷページの更新は、タイマ１３に
よって起動されるものである。The updating of the WWW page is started by the timer 13.

【００７５】ステップ３０１）まず、ＷＷＷページ収
集部１４が一日に一度など、タイマ１３から定期的に処
理が起動されると、ＷＷＷサイト情報ＤＢ１５１から１
レコードずつＷＷＷサイト情報を取り出す。Step 301) First, when the process of the WWW page collection unit 14 is periodically activated from the timer 13 such as once a day, the WWW site information DB 151 is set to 1
Retrieve WWW site information record by record.

【００７６】ステップ３０２）ＷＷＷサイト情報ＤＢ
１５１からのレコードの取り出しが最後のレコードに到
達したかをチェックする。レコードの終わりでなけれ
ば、ステップ３０３に移行し、最終レコードである場合
には、ステップ３１１に移行する。Step 302) WWW site information DB
It is checked whether the record fetch from 151 has reached the last record. If it is not the end of the record, the process proceeds to step 303, and if it is the last record, the process proceeds to step 311.

【００７７】ステップ３０３）レコードの終わりでな
ければ、取り出されたレコードのＵＲＬからＷＷＷペー
ジ収集部１４を使って、ＷＷＷページを取得し、再び、
ステップ３０１の処理を行なう。Step 303) If it is not the end of the record, the WWW page is acquired from the URL of the retrieved record using the WWW page collection unit 14, and again,
The process of step 301 is performed.

【００７８】ステップ３１１）ステップ３０２におい
て、取り出されたレコードが最終レコードである場合に
は、ＷＷＷページ収集部１４において、収集されたＷＷ
Ｗページの情報を使って、ＷＷＷページＤＢ１６１及び
全文検索エンジン１６２の情報を更新し、終了する。Step 311) If the record retrieved in step 302 is the last record, the WW collected by the WWW page collection unit 14
The information on the WWW page DB 161 and the full-text search engine 162 is updated using the information on the W page, and the process ends.

【００７９】また、上記の実施の形態における図４、図
７、図９に示す各フローチャートの処理をプログラムと
して構築し、文書検索サービス提供装置として利用され
るコンピュータのＣＰＵにインストールする、または、
ネットワークを介して流通させることが可能である。Further, the processes of the flowcharts shown in FIGS. 4, 7, and 9 in the above-described embodiment are constructed as a program and installed in the CPU of the computer used as the document search service providing device, or
It can be distributed via a network.

【００８０】また、構築されたプログラムを文書検索サ
ービス提供装置として利用されるコンピュータに接続さ
れるハードディスク装置や、フロッピー（登録商標）デ
ィスク、ＣＤ−ＲＯＭ等の可搬記憶媒体に格納してお
き、本発明を実施する際にインストールすることによ
り、容易に本発明を実現できる。Further, the constructed program is stored in a hard disk device connected to a computer used as a document search service providing device, a portable storage medium such as a floppy (registered trademark) disk, or a CD-ROM, The present invention can be easily realized by installing it when carrying out the present invention.

【００８１】なお、本発明は、上記の実施の形態に限定
されることなく、特許請求の範囲内において種々変更・
応用が可能である。The present invention is not limited to the above-mentioned embodiments, and various modifications and changes are made within the scope of the claims.
It can be applied.

【００８２】[0082]

【発明の効果】上述のように、本発明によれば、特定の
話題に興味を持つ人々が各人の推薦する最新のＷＷＷペ
ージ情報を共有することを容易にすることができると共
に、特定の話題に特化したＷＷＷページ検索サービスを
構築することが可能である。As described above, according to the present invention, it is possible for people who are interested in a particular topic to share the latest WWW page information recommended by each person, and at the same time, a specific It is possible to build a WWW page search service specialized for a topic.

[Brief description of drawings]

【図１】本発明の原理を説明するための図である。FIG. 1 is a diagram for explaining the principle of the present invention.

【図２】本発明の原理構成図である。FIG. 2 is a principle configuration diagram of the present invention.

【図３】本発明の一実施の形態における文書検索サービ
ス提供装置の構成図である。FIG. 3 is a configuration diagram of a document search service providing apparatus according to an embodiment of the present invention.

【図４】本発明の一実施の形態における情報登録動作の
フローチャートである。FIG. 4 is a flowchart of an information registration operation according to the embodiment of the present invention.

【図５】本発明の一実施の形態におけるＷＷＷサイト情
報登録画面の例である。FIG. 5 is an example of a WWW site information registration screen according to the embodiment of the present invention.

【図６】本発明の一実施の形態におけるＷＷＷサイト情
報ＤＢ及びＷＷＷページＤＢテーブルの構造を示す図で
ある。FIG. 6 is a diagram showing structures of a WWW site information DB and a WWW page DB table in the embodiment of the present invention.

【図７】本発明の一実施の形態における検索動作のフロ
ーチャートである。FIG. 7 is a flowchart of a search operation according to the embodiment of the present invention.

【図８】本発明の一実施の形態における検索結果画面の
例である。FIG. 8 is an example of a search result screen in the embodiment of the present invention.

【図９】本発明の一実施の形態におけるＷＷＷページ情
報更新のフローチャートである。FIG. 9 is a flowchart of updating WWW page information according to the embodiment of the present invention.

【図１０】従来のリンク集の例である。FIG. 10 is an example of a conventional link collection.

[Explanation of symbols]

１０文書検索サービス提供装置１１会員認証手段、会員認証部１２ＷＷＷサイト情報入力手段、ＷＷＷサイト情報入
力部１３タイマ１４ＷＷＷページ収集手段、ＷＷＷページ収集部１５ＷＷＷサイト情報保持手段、ＷＷＷサイト情報保
持部１６ページ索引保持手段、ＷＷＷページ索引保持部１７検索手段２０情報登録者端末３０ＷＷＷサーバ４０検索利用者端末１０１タイトルを入力するテキストフィールド１０２ＵＲＬを入力するテキストフィールド１０３チェックボックス１０４紹介文を入力するテキストエリア１０５登録ボタン１５１ＷＷＷサイト情報ＤＢ１６１ＷＷＷページＤＢ１６２全文検索エンジン２０３タイトル２０４ＷＷＷサイトの紹介文２０５情報登録者の名前２０６情報登録者ＩＤ10 Document Retrieval Service Providing Device 11 Member Authentication Unit, Member Authentication Unit 12 WWW Site Information Input Unit, WWW Site Information Input Unit 13 Timer 14 WWW Page Collection Unit, WWW Page Collection Unit 15 WWW Site Information Storage Unit, WWW Site Information Storage Unit 16 page index holding means, WWW page index holding part 17 search means 20 information registrant terminal 30 WWW server 40 search user terminal 101 text field 102 for entering a title text field 103 for entering a URL 103 check box 104 for entering an introductory text Text area 105 Registration button 151 WWW site information DB 161 WWW page DB 162 Full-text search engine 203 Title 204 WWW site introduction sentence 205 Information registrant name 206 Information registrant ID

Claims

[Claims]

1. A method for providing a document search service for discovering a desired document from a set of WWW pages existing on a computer network, comprising: accessing an information registrant terminal from an information registrant terminal;
The information registrant is authenticated, the information registrant terminal is prompted to enter WWW site information including the name, URL, and introduction text of the WWW site to be searched, and the information registrant terminal inputs the WWW site information. WWW site information and registrant name are stored in the WWW site information database, WWW page groups included in the WWW site are acquired from the designated WWW site, stored in the search database, and the search user of the search user The terminal is prompted to enter a search keyword, the WWW page group related to the search keyword input from the search user terminal is searched from the search database, and the searched WWW page group is grouped for each WWW site. WW including the WWW site name and introductory text for each group
A document search service providing method comprising: creating a page to which W site information and the name of the information registrant are added, and displaying the created page as a search result on the search user terminal.

2. A document search service providing method for discovering a desired document from a set of WWW pages existing on a computer network, comprising: accessing an information registrant terminal from an information registrant terminal;
The information registrant is authenticated, the information registrant terminal is prompted to enter WWW site information including the name, URL, and introduction text of the WWW site to be searched, and the information registrant terminal inputs the WWW site information. A document search service characterized by storing WWW site information and a registrant name in a WWW site information database, acquiring a WWW page group included in the WWW site from a designated WWW site, and storing the WWW page group in a search database. How to provide.

3. A document search service providing method for finding a desired document from a set of WWW pages existing on a computer network, wherein a search user terminal of a search user is prompted to enter a search keyword, The WWW page group related to the search keyword input from the search user terminal is searched from the search database, the searched WWW page group is grouped for each WWW site, and the WWW site name and introduction text are grouped for each group. Including WW
A document search service providing method comprising: creating a page to which W site information and the name of the information registrant are added, and displaying the created page as a search result on the search user terminal.

4. A WWW stored in the WWW site information database when a predetermined condition is satisfied.
The site information is read, and the WWW is read from the URL included in the WWW site information.
Acquire a page group, based on the acquired WWW page,
3. The database for search is updated, claim 1 or 2.
Document search service provision method described.

5. A document search service providing apparatus for discovering a desired document from a set of WWW pages existing on a computer network, the member authenticating means for authenticating an information registrant, and the member authenticating means. UR from authenticated information registrant
WWW site information input means for receiving WWW site information including L, title, and introduction, WWW site information holding means for holding WWW site information and information registrant name from the WWW site information input means, and the WWW site information The WWW held by the holding means
W to collect WWW page information based on site information W
WW page collecting means, WWW page index holding means for holding the information of WWW pages collected by the WWW page collecting means in a searchable form, and a search keyword for access from a search user terminal of a search user. The WWW page group related to the search keyword input from the search user terminal, grouped by WWW site, and the search result obtained by adding the WWW site information and the information registrant name is used for the search. An apparatus for providing a document search service, comprising: a search unit for displaying on a person's terminal.

6. A document retrieval service providing apparatus for discovering a desired document from a set of WWW pages existing on a computer network, comprising: a member authenticating means for authenticating an information registrant; and the member authenticating means. UR from authenticated information registrant
WWW site information input means for receiving WWW site information including L, title, and introduction, WWW site information holding means for holding WWW site information and information registrant name from the WWW site information input means, and the WWW site information The WWW held by the holding means
W to collect WWW page information based on site information W
An apparatus for providing a document search service, comprising: a WW page collecting means; and a WWW page index holding means for holding information of WWW pages collected by the WWW page collecting means in a searchable form.

7. A document search service providing apparatus for finding a desired document from a set of WWW pages existing on a computer network, wherein a search keyword is provided for access from a search user terminal of a search user. Prompting for input, searching for WWW page groups related to the search keyword input from the search user terminal, grouping by WWW site, and adding the WWW site information and information registrant name to the search result. An apparatus for providing a document search service, comprising: a search unit for displaying on a terminal.

8. A timer for measuring time, and a WWW page collected by said WWW page collection means every time said timer counts a certain time, and a WWW page held in said WWW page index holding means 7. The document search service providing device according to claim 5, further comprising update means for updating information.

9. A document search service providing program for discovering a desired document from a set of WWW pages existing on a computer network, comprising: a member authentication process for authenticating an information registrant; and a member authentication process. WWW site information input process for receiving WWW site information including URL, title, and introductory text from an authenticated information registrant, and WWW site information database holding WWW site information and information registrant name from the WWW site information input process A WWW page collection process for collecting WWW page information based on the WWW site information stored in the WWW page information, and a WW collected by the WWW page collection process.
And a WWW page index storage process for storing W page information in a WWW page index database that retains information in a searchable form.

10. A document search service providing program for finding a desired document from a set of WWW pages existing on a computer network, wherein a search keyword is provided for access from a search user terminal of a search user. Prompting for input, searching for WWW page groups related to the search keyword input from the search user terminal, grouping by WWW site, and adding the WWW site information and information registrant name to the search result. A document search service providing program, comprising: a search process for displaying on a terminal.

11. An update for updating WWW page information held in the WWW page index database with a WWW page collected in the WWW page collection process every time a timer for measuring time counts a certain time. The document search service providing program according to claim 9, further comprising a process.

12. A storage medium storing a document search service providing program for discovering a desired document from a set of WWW pages existing on a computer network, and a member authentication process for authenticating an information registrant, WWW site information input process for receiving WWW site information including URL, title, and introduction from the information registrant authenticated by the member authentication process, and holding WWW site information and information registrant name from the WWW site information input process A WWW page collection process for collecting WWW page information based on the WWW site information held in the WWW site information database, and a WW collected by the WWW page collection process
A storage medium storing a document search service providing program, comprising: a WWW page index storage process for storing information of W pages in a searchable manner in a WWW page index database.

13. A storage medium storing a program for providing a document search service for finding a desired document from a set of WWW pages existing on a computer network, which is accessible from a search user terminal of a search user. A search result that prompts the user to enter a search keyword, searches for WWW page groups related to the search keyword input from the search user terminal, groups them by WWW site, and adds WWW site information and information registrant name. And a search process for displaying the document on the search user terminal.

14. An update for updating WWW page information held in the WWW page index database with a WWW page collected in the WWW page collection process every time a timer for counting time counts a certain time. The storage medium storing the document search service providing program according to claim 12, further comprising a process.