[go: up one dir, main page]

WO2000050972A2 - Outil de collecte d'informations presentes dans un reseau - Google Patents

Outil de collecte d'informations presentes dans un reseau Download PDF

Info

Publication number
WO2000050972A2
WO2000050972A2 PCT/US2000/005008 US0005008W WO0050972A2 WO 2000050972 A2 WO2000050972 A2 WO 2000050972A2 US 0005008 W US0005008 W US 0005008W WO 0050972 A2 WO0050972 A2 WO 0050972A2
Authority
WO
WIPO (PCT)
Prior art keywords
information
selection
received
web
page
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US2000/005008
Other languages
English (en)
Other versions
WO2000050972A3 (fr
WO2000050972A9 (fr
Inventor
Cecilia Lenk
James E. Gouldstone
James G. Reidy
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
WEBIVORE KNOWLEDGE SYSTEMS LLC
Original Assignee
WEBIVORE KNOWLEDGE SYSTEMS LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by WEBIVORE KNOWLEDGE SYSTEMS LLC filed Critical WEBIVORE KNOWLEDGE SYSTEMS LLC
Priority to AU40045/00A priority Critical patent/AU4004500A/en
Publication of WO2000050972A2 publication Critical patent/WO2000050972A2/fr
Publication of WO2000050972A3 publication Critical patent/WO2000050972A3/fr
Publication of WO2000050972A9 publication Critical patent/WO2000050972A9/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/954Navigation, e.g. using categorised browsing

Definitions

  • This invention relates to information collection, more particularly to techniques for collecting information from different networked computers.
  • the Internet's World-Wide-Web offers users easy access to a tremendous amount of information on a wide range of subjects.
  • Search engines such as YahooJ can provide lengthy lists of sites related to almost any topic of interest.
  • formulating meaningful queries can prove difficult to both new and experienced search engine users.
  • visiting the tremendous number of web-sites typically retrieved by a search engine can make Internet-based research time-consuming and frustrating.
  • Search engines also frequently list Internet sites that may be too advanced for users investigating a topic for the first time. For example, a junior high school student investigating evolution can easily find themselves viewing a web-page that presents an involved Ph.D. thesis instead of a page that presents material in an introductory manner. Beyond finding appropriate information, collecting information from the web requires careful organization of stored web content for later use.
  • a method of collecting information received from an Internet or Intranet URL includes receiving a user selection from a web-page corresponding to the URL and analyzing information included in the web-page to determine citation information for the user selection. The method further includes storing and displaying the selection and the determined citation information.
  • Uniform Resource Locator Uniform Resource Locator
  • a method of collecting information received from a networked computer includes receiving a user selection from displayed information received from a networked computer and analyzing the information received from the network computer to determine at least one attribute of the user selection. The method also includes storing the selection and the determined attribute(s).
  • the attribute may be the authorship of the received information.
  • the analyzing may include examining tags (e.g., HTML ⁇ META> tags) included in the received information.
  • the received information may be a web-page from an Internet or Intranet web-site.
  • the user selection may be dragged-and-dropped, cut-and- pasted, or transferred to a storage medium (e.g., a user's hard disk).
  • the user selection may include text, graphics, sounds, animation, video, etc.
  • the method may further include displaying the stored selection and the determined attributes, for example, by dynamically generating HTML pages that include the stored selection and determined attributes.
  • the method may include navigating through a hierachically organized grouping of topics to request the information received from the networked computer.
  • a file may include information that describes the hierarchically organized grouping of topics.
  • a tool for collecting information received from a networked computer includes means for receiving a user selection from displayed information received from a networked computer, means for analyzing the information received from the network computer to determine at least one attribute of the user selection, and means for storing the selection and the determined attribute(s).
  • the tool may also include means for maintaining the tool in the foreground during display of the received information.
  • Advantages may include one or more of the following.
  • the methods, programs, and tools ease network-based research by speeding information location and collection and easing organization of the collected material. By automatically collecting and storing information needed for citations from tags and other information included in a web-page, users can clip information and provide citations without having to perform their own research to determine authorship, publication date, etc. By providing a persistent window in the foreground to collect user clippings, a user can quickly and easily access clipping functions.
  • FIG. 1 is a screenshot of a tool for collecting web-page clippings.
  • FIG. 2 is a listing of a file for producing a directory of web-pages hierarchically organized by topic.
  • FIGS. 3A and 3B are screenshots illustrating dragging-and-dropping of a file onto the tool.
  • FIG. 4 is a screenshot of a user-selected clipping.
  • FIG. 5 is a screenshot of notes that a user can enter.
  • FIG. 6 is a screenshot of a screen including saved clippings and corresponding citation information.
  • FIGS. 7A-7C are screenshots of a different user interface for the tool.
  • FIG. 8 is a flowchart of a process for clipping information from different web-pages.
  • FIG. 9 is a block diagram of a computer.
  • the inventors have designed a tool that works with a user's browser to ease Interneflntranet based research.
  • a user can quickly find web-pages that include information pertinent to the user, collect information from the web-pages, and organize the collected material. Additionally, the tool can automatically collect and store information needed for citations to Internet material.
  • a browser screen 100 displays a URL's (Universal Resource Locator) 104 graphics and text 102.
  • a URL 104 identifies a resource (e.g., a file of HTML (HyperText Markup Language) instructions and data) at a remote network site. HTML instructions describe an arrangement of text, graphics, animation, video, sound, and other data provided by the resource.
  • HTML files are delivered to the user's computer.
  • a browser e.g., NetscapeJ NavigatorJ, Microsoft 1 Internet ExplorerJ, or MosaicJ constructs a display based on the received information. As shown, the browser is an independent application, however, browser functions can be incorporated into other applications. Additionally, many browsers can process formats other than HTML (e.g., XML).
  • FIG. 1 also shows a clipping tool 200 superimposed over the browser screen 100.
  • the clipping tool 200 provides different functions for finding research material on the Internet and saving selections from the material (i.e., "clippings") for later use. These functions include a "Select” function that enables a user to quickly find a web-page of interest from a group of pre-selected sites; a "Collect” function that enables a user to save a clipping from a web-page; an "Edit” functions that enables a user to enter notes about the material; and a “List” function that enables a user to manage and view clipped material.
  • the tool 200 presents these functions in a window that includes a tabbed GUI (graphical user interface) control that permits quick and intuitive access to the different functions.
  • GUI graphical user interface
  • the clipping tool 200 remains completely visible regardless of whether the tool 200 or the browser 100 is active (i.e., is currently the target of user input). Thus, a user always has easy access to tool 200 functions without using operating system commands to bring the tool 200 to the foreground.
  • the tool 200 and browser 100 can communicate in a variety of ways.
  • a MicrosoftJ Internet ExplorerJ browser object can be instantiated by programming languages such as MicrosoftJ Visual BasicJ.
  • the tool 200 can control the behavior of the browser 100 using public methods and properties of the instantiated browser object.
  • the tool 200 is a Java applet that includes instructions executed by a browser's Java interpreter. A wide variety of other implementations are possible.
  • the "Select” function presents a directory 202 of different subject areas.
  • the directory 202 organizes these subjects using a hierarchy of directory nodes. At the “bottom" of the hierarchy are nodes that refer to specific URLs.
  • the system can provide a preview (e.g., a thumbnail or description)(not shown) of the corresponding URL.
  • a user selects e.g., clicks a mouse
  • a URL node 204d the tool 200 causes the browser 100 to download information from the URL associated with the node.
  • the user has selected the "Metazoa: Animals, Animals, Animals" node 204d.
  • the tool 200 can build the directory 202 from a file 208 that identifies the different nodes in the directory. For nodes corresponding to specific web-pages, the file 208 can include both the URL 208 and any other information that might be needed for a citation to the web-page such as authorship 206 information.
  • Different files can include different directory information describing different hierarchies of nodes.
  • one directory file may include directory information suitable for high school students while another may include information for professional researchers. This enables researchers of different ages and having different research requirements to access suitable material.
  • a user can specify which directory 208 the tool 200 displays by dragging-and-dropping a directory file 208 onto the tool 200.
  • the tool 200 interprets the file 208 contents to construct a directory.
  • the tool 200 can present a GUI control that enables a user to navigate through different system folders and documents in search of a particular directory file.
  • FIG. 4 after accessing a URL either by using the directory shown in FIG. 1 or by using more traditional browser techniques (e.g., directly entering a URL or by selecting a link), the tool 200 allows a user to create or add to a list of web-page clippings.
  • the tool's 200 "Collect” function presents a control 210 that accepts text, graphics, and other information selected from a web-page.
  • a user has selected a paragraph 212 discussing animal morphology.
  • a user can "drag” (e.g., moving the mouse while depressing a mouse button) the selection 212 from the web-page and drop the selection (e.g., releasing the mouse button) onto the tool 200.
  • the user can copy the selection for later pasting into the tool.
  • the tool 200 stores the selected information.
  • the tool 200 can also analyze the source of the selection (e.g., the web-page originally received from a networked computer) for attributes of the selected information.
  • the tool 200 can also examine text included in the web-page to extract information. For example, the tool 200 can look for text that includes words such as "author” and analyze the text that follows to see if it corresponds to
  • the tool 200 can query the browser 100 for information such as the URL currently being displayed.
  • the tool 200 can also use system functions to determine other information such as the time a URL was loaded into memory. Additionally, if the user reached the URL via the directory file, the tool 200 can retrieve authorship information stored in the directory file (see FIG. 2).
  • the tool 200 automatically initiates its "Edit" function to enable a user to enter notes 214 describing the clipping. These notes 214 enable a user to preserve thoughts about the clipping such as how the clipping might be incorporated into a paper and ideas for subsequent research.
  • the tool 200 also generates a default title 212 for the clipping (e.g., the first few words of selected text). As shown, the tool 200 can also display a thumbnail 213 of any selected graphics.
  • the tool's 200 "List” function lists different collected clippings 220. Selecting the tool's 200 "View” button 221 causes the tool 200 to dynamically prepare HTML instructions for a display 223 that includes the web-page clippings 212 and the collected citation information 224-238.
  • citation information can include when the material was clipped 226, the type of material 228 (e.g., text or graphics), the name of the page 230, and its URL 232.
  • the information can further include the site name 234, authors 236, and site URL 238.
  • the display 223 further includes any notes the user may have entered for the clipping.
  • the URLs 232, 238 can be displayed as
  • any number of different user interfaces can provide functions described above.
  • the tool 200 "Search" As shown in FIG. 7A, the tool 200 "Search"
  • the 10 function uses pull-down menus that enable a user to navigate through a directory 204 to view the content of different network sites. As shown in FIGS. 7B and 7C, the tool's 200 "Collect" function provides a thumbnail 213 image of selected content and an area to enter notes 214.
  • a user can navigate to a web-page either using
  • the tool can analyze the web-page's contents (e.g., HTML instructions and text) to determine citation information 808. Additionally, the tool can use information stored in the
  • the selected clipping and citation information are saved 810. Thereafter, the tool can dynamically produce an HTML instruction file that includes the clippings.
  • a computer platform 900 includes a monitor 902, a keyboard 904, a pointing device 906 such as a mouse, and a digital computer
  • the digital computer 910 includes memory 912, a processor 914, a mass storage device 916, and other customary components such as a memory bus and peripheral bus (not shown).
  • the platform 900 further includes a network connection 908.
  • Mass storage device 916 can store instructions for an operating system (e.g., MicrosoftJ WindowsJ), a browser, and the tool 200. The instructions may be transferred to memory 912 and processor 914 in the course of operation. Of course, instructions for the tool 200 may be stored on a variety of mass storage devices such as a floppy disk, CD-ROM, or PROM (not shown).
  • the techniques described here are not limited to any particular hardware or software configuration; they may find applicability in any computing or processing environment that may be used for speech recognition.
  • the techniques may be implemented in hardware or software, or a combination of the two.
  • the techniques are implemented in computer programs executing on programmable computers that each include a processor, a storage medium readable by the processor (including volatile and non-volatile memory and/or storage elements), at least one input device, and one or more output devices.
  • Program code is applied to data entered using the input device to perform the functions described and to generate output information.
  • the output information is applied to one or more output devices.
  • Each program is preferably implemented in a high level procedural or object oriented programming language to communicate with a computer system, however, the programs can be implemented in assembly or machine language, if desired. In any case, the language may be a compiled or interpreted language.
  • Each such computer program is preferable stored on a storage medium or device (e.g., CD-ROM, hard disk or magnetic diskette) that is readable by a general or special purpose programmable computer for configuring and operating the computer when the storage medium or device is read by the computer to perform the procedures described in this document.
  • a storage medium or device e.g., CD-ROM, hard disk or magnetic diskette
  • the system may also be considered to be implemented as a computer-readable storage medium, configured with a computer program, where the storage medium so configured causes a computer to operate in a specific and predefined manner.
  • Other embodiments are within the scope of the following claims. What is claimed is:

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • Remote Sensing (AREA)
  • General Physics & Mathematics (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Information Transfer Between Computers (AREA)
  • User Interface Of Digital Computer (AREA)
  • Debugging And Monitoring (AREA)
  • Computer And Data Communications (AREA)

Abstract

Procédé de collecte d'informations reçues d'un ordinateur se trouvant en réseau, qui consiste à recevoir une sélection d'utilisateur parmi des informations affichées reçues en provenance d'un ordinateur se trouvant en réseau, à analyser les informations reçues de l'ordinateur se trouvant en réseau pour déterminer au moins un attribut de la sélection de l'utilisateur et à mettre en mémoire la sélection et le(s) attribut(s) déterminé(s).
PCT/US2000/005008 1999-02-26 2000-02-25 Outil de collecte d'informations presentes dans un reseau Ceased WO2000050972A2 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU40045/00A AU4004500A (en) 1999-02-26 2000-02-25 Network information collection tool

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US25939599A 1999-02-26 1999-02-26
US09/259,395 1999-02-26

Publications (3)

Publication Number Publication Date
WO2000050972A2 true WO2000050972A2 (fr) 2000-08-31
WO2000050972A3 WO2000050972A3 (fr) 2000-12-28
WO2000050972A9 WO2000050972A9 (fr) 2001-04-19

Family

ID=22984767

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/005008 Ceased WO2000050972A2 (fr) 1999-02-26 2000-02-25 Outil de collecte d'informations presentes dans un reseau

Country Status (2)

Country Link
AU (1) AU4004500A (fr)
WO (1) WO2000050972A2 (fr)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6366923B1 (en) * 1998-03-23 2002-04-02 Webivore Research, Llc Gathering selected information from the world wide web
WO2002021305A3 (fr) * 2000-09-05 2003-02-27 Nicola Carena Edgardo Di Systeme et procede d'acces et d'organisation d'information disponsible sur un reseau
WO2003079133A3 (fr) * 2002-03-15 2004-02-05 Koninkl Philips Electronics Nv Pre-visionnement de documents sur un systeme informatique

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2210581C (fr) * 1995-01-23 2002-03-26 British Telecommunications Public Limited Company Procedes et/ou systemes d'acces a des informations
US5931907A (en) * 1996-01-23 1999-08-03 British Telecommunications Public Limited Company Software agent for comparing locally accessible keywords with meta-information and having pointers associated with distributed information
US5862325A (en) * 1996-02-29 1999-01-19 Intermind Corporation Computer-based communication system and method using metadata defining a control structure

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6366923B1 (en) * 1998-03-23 2002-04-02 Webivore Research, Llc Gathering selected information from the world wide web
WO2002021305A3 (fr) * 2000-09-05 2003-02-27 Nicola Carena Edgardo Di Systeme et procede d'acces et d'organisation d'information disponsible sur un reseau
WO2003079133A3 (fr) * 2002-03-15 2004-02-05 Koninkl Philips Electronics Nv Pre-visionnement de documents sur un systeme informatique

Also Published As

Publication number Publication date
WO2000050972A3 (fr) 2000-12-28
AU4004500A (en) 2000-09-14
WO2000050972A9 (fr) 2001-04-19

Similar Documents

Publication Publication Date Title
US6310630B1 (en) Data processing system and method for internet browser history generation
US6832350B1 (en) Organizing and categorizing hypertext document bookmarks by mutual affinity based on predetermined affinity criteria
US6597377B1 (en) Web links objects
US10706091B2 (en) User driven computerized selection, categorization, and layout of live content components
US6366923B1 (en) Gathering selected information from the world wide web
US20030184582A1 (en) Browser plug-ins
US7188319B2 (en) Displaying graphical information and user selected properties on a computer interface
US7631255B2 (en) Interface and method for exploring a collection of data
US7519573B2 (en) System and method for clipping, repurposing, and augmenting document content
US6667751B1 (en) Linear web browser history viewer
US5978828A (en) URL bookmark update notification of page content or location changes
US6100890A (en) Automatic bookmarks
CA2333372C (fr) Liens a plusieurs cibles permettant de naviguer entre des documents hypertexte et analogues
US7962843B2 (en) Browser session overview
US7315848B2 (en) Web snippets capture, storage and retrieval system and method
KR100355853B1 (ko) 웹문서의비동기식인쇄방법및그시스템
US7406664B1 (en) System for integrating HTML Web site views into application file dialogs
KR100337810B1 (ko) 인터넷상의 검색전문웹사이트 및 그 검색방법
US20060155728A1 (en) Browser application and search engine integration
US7546530B1 (en) Method and apparatus for mapping a site on a wide area network
WO2007134894A1 (fr) Procédé d'insertion de signets dans un navigateur internet
EP1974318A2 (fr) Système, procédé et dispositif logiciel d'assistance à la navigation sur internet
US20030080986A1 (en) System and method for accessing and utilizing remote bookmark lists
US8826112B2 (en) Navigating table data with mouse actions
US8181116B1 (en) Method and apparatus for hyperlink list navigation

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

AK Designated states

Kind code of ref document: C2

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: C2

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

COP Corrected version of pamphlet

Free format text: PAGES 1/11-11/11, DRAWINGS, REPLACED BY NEW PAGES 1/11-11/11

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS (R.69(1) EPC) SENT ON 061202

122 Ep: pct application non-entry in european phase