WO2002033585A1 - Procede de creation et d'utilisation d'un moteur de recherche sur un site internet - Google Patents
Procede de creation et d'utilisation d'un moteur de recherche sur un site internet Download PDFInfo
- Publication number
- WO2002033585A1 WO2002033585A1 PCT/CN2000/000343 CN0000343W WO0233585A1 WO 2002033585 A1 WO2002033585 A1 WO 2002033585A1 CN 0000343 W CN0000343 W CN 0000343W WO 0233585 A1 WO0233585 A1 WO 0233585A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- search
- information
- user
- search engine
- website
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
Definitions
- the invention relates to a method for constructing and using a search engine website, in particular to a control method for constructing and using a search engine website.
- Existing search engine websites use a combination of static pages and dynamic pages, and use the structure of a category directory plus a robot search program to construct a website to provide users with information query and retrieval services.
- the classified directory is a static webpage or a dynamic webpage generated after querying the database.
- the hierarchical database is used to access the website's database to provide customers with information.
- the robotic search program uses a customized search program to access the website's database for searching and The retrieval work is provided as a service to the user.
- Existing search engine websites are very convenient to use. You can search down and find information according to the category and level, or you can enter keywords to be searched by a robot search program.
- the search targets of existing search engine websites are basically web pages and websites. In order to find the information to the maximum, there are many choices on the homepage of a general search engine website, and the robot search program can also enter any keywords.
- the search result provided to the user after the search may be 0 or may be Many, even astronomical figures. For example: Enter the keyword "American University" on the search engine website YAH00 (Yahoo), and you will get more than 18,000 results. This is only the current result. Over time, there will be more and more, but the real ones are already registered. Of the US-based universities, including colleges, are only over 3,500, and if you need more results, you can also click the "More Results" webpage instruction, then you will get more "Results"!
- the biggest problem of the existing search engine website business model lies in the complete freedom of information in the construction and use of search engine websites, and the uncontrolled provision and acquisition.
- the "information highway” and “information ocean” can be easily accessed through the Internet, but at the same time they will be overwhelmed by a lot of useless information provided by the "information ocean”.
- For the construction of search engine websites because of the use of robot search programs, a lot of useless information is searched, which reduces the search engine's Work efficiency wasted computer network resources.
- the object of the present invention is to solve the problem of "spam" of the former search engine website, that is, the search results are too many and do not match the target. After a long search, the search has no results (0), and provides an accurate, authentic, and effective A controlled method for the construction and use of search engine websites that provide search information.
- the information service provider controls the source of the information.
- the source of information for a search engine website is limited to a limited target, which is an entity, that is, a concrete existence, and not just a website or web page.
- the website or webpage will exist as a special type of entity and will be subdivided.
- the information source is limited to the subject of economic activities-natural and legal persons, that is, the provider of the product or service.
- the product will exist as an extension of the main information in a subset of the related information.
- the information service provider uses a combination of static and dynamic pages to search, and entity information is stored in the database.
- the users of the website are under the control of the search engine provided by the information service provider.
- the search engine will identify this key field and find out the information in the database.
- the service provider provides a limited number of key fields with a limited target, and ultimately reaches the user's search target.
- an information service provider uses a catalog for searches, it can use static and dynamic pages to control the search process.
- the user must search from the main directory to the next lower sub-directory in a categorized manner based on the keywords provided by the information service provider until the required entity information is found.
- a robotic search program is used for searching. Status page controls the search process.
- the user must enter the restricted keywords according to the keyword input rules notified in advance by the information service provider (ICP).
- the robot search program retrieves the corresponding entity information from the database.
- FIG. 1 is a schematic diagram of a database structure of a search engine website according to an embodiment of the present invention.
- FIG. 2 is a schematic diagram of a search engine website according to an embodiment of the present invention. The best way to implement the invention
- the scheme of establishing a website mainly adopts virtual hosting and hosting.
- Virtual hosting is renting network space, using static pages or dynamic pages for searching, making all categories and entities and their attributes information into web pages, and then linking them together.
- Hosting should provide one or more servers.
- a World Wide Web (WWW SERVER) server and a database server (DATABASE SERVER) are used, and a combination of static and dynamic pages is used for searching.
- the information of categories and entities and their attributes are all stored in the database, and the background public gateway interface (CGI) program is used to access the database and control the access to entity information.
- CGI public gateway interface
- the embodiment of the present invention introduces a method of constructing and using a commercial search engine website by taking hosting as an example.
- Fig. 1 is a schematic diagram showing a database structure of a search engine website according to an embodiment of the present invention.
- the database uses a hierarchical model, that is, a tree structure, and the bottom of the tree structure is entity information.
- the root category in the figure is the largest category of entity classification
- the second category is a subclass of the root category
- the third category is a subclass of the second category, and so on until the specific entity information.
- Each layer of the data structure is a one-to-many relationship.
- Each fragment in the data structure is equivalent to a record in the database.
- Each fragment consists of multiple fields.
- the database administrator defines the name of a fragment and the name and data type (character, numeric, etc.) and length of each field in it as needed.
- Each fragment defines a key field to identify the fragment value, which should uniquely identify a fragment value.
- the prerequisite for a fragment to exist is that its parent fragment exists in the database record.
- the information source is limited to a natural person or a legal person, that is, the limited related information of a provider of goods or services.
- FIG. 2 it is a schematic diagram of an embodiment of a commercial search engine.
- a user needs to search for information about a manufacturer of running shoes.
- a category search is used as an example. The user selects a product category based on the root category provided by the information service provider, selects daily necessities in the next level, selects clothing, shoes and hats in the next level, and selects shoes in the next level. , And so on until the information of the manufacturer of the sports shoes required by the user is found.
- the user can only enter keywords defined by the information service provider.
- the user enters "sneakers”, and then the program searches the database of each level in Figure 1 from top to bottom until the relevant keywords are found. If no relevant keywords are found, the user is prompted through the page " Sorry, please enter keywords as required. If keywords related to sneakers are found, all categories belonging to "sneakers” will be taken out and returned to the user. In this way, the user sees a subset of "sports shoes” such as “soccer shoes” and “running shoes” and asks the user to make a selection.
- the program searches for subordinates of the running shoes in its next category The category of the set is then passed back to the user. If the user has selected running shoes.
- the next page will be information on the products and manufacturers of various running shoes, so that users can easily, quickly and accurately obtain product and manufacturer information under the guidance of information service providers.
- the method for constructing and using a search engine website of the present invention locates a search target as a limited target, that is, an entity through control of an information source, so that the search target is accurate and specific, and can accurately and quickly search for information required by network users. Therefore, a method is provided for solving the problem of a large amount of useless information, that is, "spam", which appears in the search process of a search engine website at present.
- the method for constructing and using the search engine website of the present invention can save the time of website users, improve the work efficiency of the search engine, and save the computer network resources of the information service provider.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/CN2000/000343 WO2002033585A1 (fr) | 2000-10-20 | 2000-10-20 | Procede de creation et d'utilisation d'un moteur de recherche sur un site internet |
| AU2000278999A AU2000278999A1 (en) | 2000-10-20 | 2000-10-20 | Building-up and employing method for search network station |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/CN2000/000343 WO2002033585A1 (fr) | 2000-10-20 | 2000-10-20 | Procede de creation et d'utilisation d'un moteur de recherche sur un site internet |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2002033585A1 true WO2002033585A1 (fr) | 2002-04-25 |
Family
ID=4574727
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2000/000343 Ceased WO2002033585A1 (fr) | 2000-10-20 | 2000-10-20 | Procede de creation et d'utilisation d'un moteur de recherche sur un site internet |
Country Status (2)
| Country | Link |
|---|---|
| AU (1) | AU2000278999A1 (fr) |
| WO (1) | WO2002033585A1 (fr) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110555159A (zh) * | 2018-03-30 | 2019-12-10 | 北大方正集团有限公司 | 网页检索方法、装置、设备及存储介质 |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP0774722A2 (fr) * | 1995-11-17 | 1997-05-21 | Microsoft Corporation | Système de recouvrement d'informations |
| EP0829811A1 (fr) * | 1996-09-11 | 1998-03-18 | Nippon Telegraph And Telephone Corporation | Procédé et système pour le recouvrement d'informations |
| CN1235447A (zh) * | 1998-05-11 | 1999-11-17 | 龙卷风科技股份有限公司 | 万维网站的网页全文检索系统 |
| CN1245937A (zh) * | 1998-08-26 | 2000-03-01 | 英业达股份有限公司 | 同时进行多个搜寻引擎检索的方法 |
-
2000
- 2000-10-20 AU AU2000278999A patent/AU2000278999A1/en not_active Abandoned
- 2000-10-20 WO PCT/CN2000/000343 patent/WO2002033585A1/fr not_active Ceased
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP0774722A2 (fr) * | 1995-11-17 | 1997-05-21 | Microsoft Corporation | Système de recouvrement d'informations |
| EP0829811A1 (fr) * | 1996-09-11 | 1998-03-18 | Nippon Telegraph And Telephone Corporation | Procédé et système pour le recouvrement d'informations |
| CN1235447A (zh) * | 1998-05-11 | 1999-11-17 | 龙卷风科技股份有限公司 | 万维网站的网页全文检索系统 |
| CN1245937A (zh) * | 1998-08-26 | 2000-03-01 | 英业达股份有限公司 | 同时进行多个搜寻引擎检索的方法 |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110555159A (zh) * | 2018-03-30 | 2019-12-10 | 北大方正集团有限公司 | 网页检索方法、装置、设备及存储介质 |
Also Published As
| Publication number | Publication date |
|---|---|
| AU2000278999A1 (en) | 2002-04-29 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US8185545B2 (en) | Task/domain segmentation in applying feedback to command control | |
| US8255541B2 (en) | Method and apparatus for utilizing user feedback to improve signifier mapping | |
| US6112202A (en) | Method and system for identifying authoritative information resources in an environment with content-based links between information resources | |
| US6006217A (en) | Technique for providing enhanced relevance information for documents retrieved in a multi database search | |
| US7987165B2 (en) | Indexing system and method | |
| US6947924B2 (en) | Group based search engine generating search results ranking based on at least one nomination previously made by member of the user group where nomination system is independent from visitation system | |
| Yuwono et al. | WISE: a world wide web resource database system | |
| KR100719009B1 (ko) | 데이터베이스 검색 시스템에서 관련 검색을 식별하기 위한장치 | |
| US6665837B1 (en) | Method for identifying related pages in a hyperlinked database | |
| US20050050023A1 (en) | Method, device and software for querying and presenting search results | |
| US7047246B2 (en) | Search and index hosting system | |
| US8321400B2 (en) | Method, device and software for querying and presenting search results | |
| US20020147880A1 (en) | Systems and methods for performing crawl searches and index searches | |
| US20110238662A1 (en) | Method and system for searching a wide area network | |
| US20030088553A1 (en) | Method for providing relevant search results based on an initial online search query | |
| US20100106701A1 (en) | Electronic document retrieval system | |
| WO2011102765A1 (fr) | Procédé et dispositif de recherche de réseau | |
| US7490082B2 (en) | System and method for searching internet domains | |
| WO2002033585A1 (fr) | Procede de creation et d'utilisation d'un moteur de recherche sur un site internet | |
| CN101133415A (zh) | 使用页面集而提供信息搜索服务的服务器、方法和系统 | |
| JPH11259486A (ja) | 閲覧用ホームページ作成方法及び閲覧用ホームページ作成装置 | |
| JP3933617B2 (ja) | 共有情報検索方法、共有情報検索プログラム、および情報共有システム | |
| Ozsoyoglu et al. | Web information resource discovery: Past, present, and future | |
| CA2537270A1 (fr) | Procede, dispositif et logiciel de demande et de presentation de resultats de recherche | |
| Balke | A roadmap to personalized information systems by cognitive expansion of queries |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZA ZW |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
| REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
| 122 | Ep: pct application non-entry in european phase | ||
| NENP | Non-entry into the national phase |
Ref country code: JP |