[go: up one dir, main page]

WO2002033585A1 - Procede de creation et d'utilisation d'un moteur de recherche sur un site internet - Google Patents

Procede de creation et d'utilisation d'un moteur de recherche sur un site internet Download PDF

Info

Publication number
WO2002033585A1
WO2002033585A1 PCT/CN2000/000343 CN0000343W WO0233585A1 WO 2002033585 A1 WO2002033585 A1 WO 2002033585A1 CN 0000343 W CN0000343 W CN 0000343W WO 0233585 A1 WO0233585 A1 WO 0233585A1
Authority
WO
WIPO (PCT)
Prior art keywords
search
information
user
search engine
website
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2000/000343
Other languages
English (en)
Chinese (zh)
Inventor
Wei Chen
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BEIJING PDN XINREN INFORMATION TECHNOLOGY Co Ltd
Original Assignee
BEIJING PDN XINREN INFORMATION TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING PDN XINREN INFORMATION TECHNOLOGY Co Ltd filed Critical BEIJING PDN XINREN INFORMATION TECHNOLOGY Co Ltd
Priority to PCT/CN2000/000343 priority Critical patent/WO2002033585A1/fr
Priority to AU2000278999A priority patent/AU2000278999A1/en
Publication of WO2002033585A1 publication Critical patent/WO2002033585A1/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor

Definitions

  • the invention relates to a method for constructing and using a search engine website, in particular to a control method for constructing and using a search engine website.
  • Existing search engine websites use a combination of static pages and dynamic pages, and use the structure of a category directory plus a robot search program to construct a website to provide users with information query and retrieval services.
  • the classified directory is a static webpage or a dynamic webpage generated after querying the database.
  • the hierarchical database is used to access the website's database to provide customers with information.
  • the robotic search program uses a customized search program to access the website's database for searching and The retrieval work is provided as a service to the user.
  • Existing search engine websites are very convenient to use. You can search down and find information according to the category and level, or you can enter keywords to be searched by a robot search program.
  • the search targets of existing search engine websites are basically web pages and websites. In order to find the information to the maximum, there are many choices on the homepage of a general search engine website, and the robot search program can also enter any keywords.
  • the search result provided to the user after the search may be 0 or may be Many, even astronomical figures. For example: Enter the keyword "American University" on the search engine website YAH00 (Yahoo), and you will get more than 18,000 results. This is only the current result. Over time, there will be more and more, but the real ones are already registered. Of the US-based universities, including colleges, are only over 3,500, and if you need more results, you can also click the "More Results" webpage instruction, then you will get more "Results"!
  • the biggest problem of the existing search engine website business model lies in the complete freedom of information in the construction and use of search engine websites, and the uncontrolled provision and acquisition.
  • the "information highway” and “information ocean” can be easily accessed through the Internet, but at the same time they will be overwhelmed by a lot of useless information provided by the "information ocean”.
  • For the construction of search engine websites because of the use of robot search programs, a lot of useless information is searched, which reduces the search engine's Work efficiency wasted computer network resources.
  • the object of the present invention is to solve the problem of "spam" of the former search engine website, that is, the search results are too many and do not match the target. After a long search, the search has no results (0), and provides an accurate, authentic, and effective A controlled method for the construction and use of search engine websites that provide search information.
  • the information service provider controls the source of the information.
  • the source of information for a search engine website is limited to a limited target, which is an entity, that is, a concrete existence, and not just a website or web page.
  • the website or webpage will exist as a special type of entity and will be subdivided.
  • the information source is limited to the subject of economic activities-natural and legal persons, that is, the provider of the product or service.
  • the product will exist as an extension of the main information in a subset of the related information.
  • the information service provider uses a combination of static and dynamic pages to search, and entity information is stored in the database.
  • the users of the website are under the control of the search engine provided by the information service provider.
  • the search engine will identify this key field and find out the information in the database.
  • the service provider provides a limited number of key fields with a limited target, and ultimately reaches the user's search target.
  • an information service provider uses a catalog for searches, it can use static and dynamic pages to control the search process.
  • the user must search from the main directory to the next lower sub-directory in a categorized manner based on the keywords provided by the information service provider until the required entity information is found.
  • a robotic search program is used for searching. Status page controls the search process.
  • the user must enter the restricted keywords according to the keyword input rules notified in advance by the information service provider (ICP).
  • the robot search program retrieves the corresponding entity information from the database.
  • FIG. 1 is a schematic diagram of a database structure of a search engine website according to an embodiment of the present invention.
  • FIG. 2 is a schematic diagram of a search engine website according to an embodiment of the present invention. The best way to implement the invention
  • the scheme of establishing a website mainly adopts virtual hosting and hosting.
  • Virtual hosting is renting network space, using static pages or dynamic pages for searching, making all categories and entities and their attributes information into web pages, and then linking them together.
  • Hosting should provide one or more servers.
  • a World Wide Web (WWW SERVER) server and a database server (DATABASE SERVER) are used, and a combination of static and dynamic pages is used for searching.
  • the information of categories and entities and their attributes are all stored in the database, and the background public gateway interface (CGI) program is used to access the database and control the access to entity information.
  • CGI public gateway interface
  • the embodiment of the present invention introduces a method of constructing and using a commercial search engine website by taking hosting as an example.
  • Fig. 1 is a schematic diagram showing a database structure of a search engine website according to an embodiment of the present invention.
  • the database uses a hierarchical model, that is, a tree structure, and the bottom of the tree structure is entity information.
  • the root category in the figure is the largest category of entity classification
  • the second category is a subclass of the root category
  • the third category is a subclass of the second category, and so on until the specific entity information.
  • Each layer of the data structure is a one-to-many relationship.
  • Each fragment in the data structure is equivalent to a record in the database.
  • Each fragment consists of multiple fields.
  • the database administrator defines the name of a fragment and the name and data type (character, numeric, etc.) and length of each field in it as needed.
  • Each fragment defines a key field to identify the fragment value, which should uniquely identify a fragment value.
  • the prerequisite for a fragment to exist is that its parent fragment exists in the database record.
  • the information source is limited to a natural person or a legal person, that is, the limited related information of a provider of goods or services.
  • FIG. 2 it is a schematic diagram of an embodiment of a commercial search engine.
  • a user needs to search for information about a manufacturer of running shoes.
  • a category search is used as an example. The user selects a product category based on the root category provided by the information service provider, selects daily necessities in the next level, selects clothing, shoes and hats in the next level, and selects shoes in the next level. , And so on until the information of the manufacturer of the sports shoes required by the user is found.
  • the user can only enter keywords defined by the information service provider.
  • the user enters "sneakers”, and then the program searches the database of each level in Figure 1 from top to bottom until the relevant keywords are found. If no relevant keywords are found, the user is prompted through the page " Sorry, please enter keywords as required. If keywords related to sneakers are found, all categories belonging to "sneakers” will be taken out and returned to the user. In this way, the user sees a subset of "sports shoes” such as “soccer shoes” and “running shoes” and asks the user to make a selection.
  • the program searches for subordinates of the running shoes in its next category The category of the set is then passed back to the user. If the user has selected running shoes.
  • the next page will be information on the products and manufacturers of various running shoes, so that users can easily, quickly and accurately obtain product and manufacturer information under the guidance of information service providers.
  • the method for constructing and using a search engine website of the present invention locates a search target as a limited target, that is, an entity through control of an information source, so that the search target is accurate and specific, and can accurately and quickly search for information required by network users. Therefore, a method is provided for solving the problem of a large amount of useless information, that is, "spam", which appears in the search process of a search engine website at present.
  • the method for constructing and using the search engine website of the present invention can save the time of website users, improve the work efficiency of the search engine, and save the computer network resources of the information service provider.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

L'invention porte sur un procédé de création et d'utilisation d'un moteur de recherche sur un site Internet. Ledit procédé fait appel à une structure faisant intervenir une liste de classifications ainsi qu'un programme de recherche robotisé, il utilise en outre des pages statiques et des pages dynamiques afin de créer le site, dans le but de proposer aux utilisateurs et abonnés un service de recherche d'informations. Les informations qui sont mises à disposition des utilisateurs sur le site Internet par le biais du moteur de recherche proviennent de fournisseurs de services (ICP) et se réfèrent à des substances limitées. Lors de chaque opération de recherche, le surfeur qui se trouve sur ledit site Internet effectue ses recherches sous le contrôle du fournisseur de services qui propose le moteur de recherche. Il est en conséquence en mesure de rechercher rapidement et avec précision des informations souhaitées en surfant sur la toile, le temps de recherche requis est réduit, l'efficacité de la recherche est améliorée et les ressources informatiques nécessaires à la recherche sur la toile sont également revues à la baisse.
PCT/CN2000/000343 2000-10-20 2000-10-20 Procede de creation et d'utilisation d'un moteur de recherche sur un site internet Ceased WO2002033585A1 (fr)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2000/000343 WO2002033585A1 (fr) 2000-10-20 2000-10-20 Procede de creation et d'utilisation d'un moteur de recherche sur un site internet
AU2000278999A AU2000278999A1 (en) 2000-10-20 2000-10-20 Building-up and employing method for search network station

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2000/000343 WO2002033585A1 (fr) 2000-10-20 2000-10-20 Procede de creation et d'utilisation d'un moteur de recherche sur un site internet

Publications (1)

Publication Number Publication Date
WO2002033585A1 true WO2002033585A1 (fr) 2002-04-25

Family

ID=4574727

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2000/000343 Ceased WO2002033585A1 (fr) 2000-10-20 2000-10-20 Procede de creation et d'utilisation d'un moteur de recherche sur un site internet

Country Status (2)

Country Link
AU (1) AU2000278999A1 (fr)
WO (1) WO2002033585A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110555159A (zh) * 2018-03-30 2019-12-10 北大方正集团有限公司 网页检索方法、装置、设备及存储介质

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0774722A2 (fr) * 1995-11-17 1997-05-21 Microsoft Corporation Système de recouvrement d'informations
EP0829811A1 (fr) * 1996-09-11 1998-03-18 Nippon Telegraph And Telephone Corporation Procédé et système pour le recouvrement d'informations
CN1235447A (zh) * 1998-05-11 1999-11-17 龙卷风科技股份有限公司 万维网站的网页全文检索系统
CN1245937A (zh) * 1998-08-26 2000-03-01 英业达股份有限公司 同时进行多个搜寻引擎检索的方法

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0774722A2 (fr) * 1995-11-17 1997-05-21 Microsoft Corporation Système de recouvrement d'informations
EP0829811A1 (fr) * 1996-09-11 1998-03-18 Nippon Telegraph And Telephone Corporation Procédé et système pour le recouvrement d'informations
CN1235447A (zh) * 1998-05-11 1999-11-17 龙卷风科技股份有限公司 万维网站的网页全文检索系统
CN1245937A (zh) * 1998-08-26 2000-03-01 英业达股份有限公司 同时进行多个搜寻引擎检索的方法

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110555159A (zh) * 2018-03-30 2019-12-10 北大方正集团有限公司 网页检索方法、装置、设备及存储介质

Also Published As

Publication number Publication date
AU2000278999A1 (en) 2002-04-29

Similar Documents

Publication Publication Date Title
US8185545B2 (en) Task/domain segmentation in applying feedback to command control
US8255541B2 (en) Method and apparatus for utilizing user feedback to improve signifier mapping
US6112202A (en) Method and system for identifying authoritative information resources in an environment with content-based links between information resources
US6006217A (en) Technique for providing enhanced relevance information for documents retrieved in a multi database search
US7987165B2 (en) Indexing system and method
US6947924B2 (en) Group based search engine generating search results ranking based on at least one nomination previously made by member of the user group where nomination system is independent from visitation system
Yuwono et al. WISE: a world wide web resource database system
KR100719009B1 (ko) 데이터베이스 검색 시스템에서 관련 검색을 식별하기 위한장치
US6665837B1 (en) Method for identifying related pages in a hyperlinked database
US20050050023A1 (en) Method, device and software for querying and presenting search results
US7047246B2 (en) Search and index hosting system
US8321400B2 (en) Method, device and software for querying and presenting search results
US20020147880A1 (en) Systems and methods for performing crawl searches and index searches
US20110238662A1 (en) Method and system for searching a wide area network
US20030088553A1 (en) Method for providing relevant search results based on an initial online search query
US20100106701A1 (en) Electronic document retrieval system
WO2011102765A1 (fr) Procédé et dispositif de recherche de réseau
US7490082B2 (en) System and method for searching internet domains
WO2002033585A1 (fr) Procede de creation et d'utilisation d'un moteur de recherche sur un site internet
CN101133415A (zh) 使用页面集而提供信息搜索服务的服务器、方法和系统
JPH11259486A (ja) 閲覧用ホームページ作成方法及び閲覧用ホームページ作成装置
JP3933617B2 (ja) 共有情報検索方法、共有情報検索プログラム、および情報共有システム
Ozsoyoglu et al. Web information resource discovery: Past, present, and future
CA2537270A1 (fr) Procede, dispositif et logiciel de demande et de presentation de resultats de recherche
Balke A roadmap to personalized information systems by cognitive expansion of queries

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP