[go: up one dir, main page]

WO2002017162A3 - Capture, storage and retrieval of markup elements - Google Patents

Capture, storage and retrieval of markup elements Download PDF

Info

Publication number
WO2002017162A3
WO2002017162A3 PCT/GB2001/003782 GB0103782W WO0217162A3 WO 2002017162 A3 WO2002017162 A3 WO 2002017162A3 GB 0103782 W GB0103782 W GB 0103782W WO 0217162 A3 WO0217162 A3 WO 0217162A3
Authority
WO
WIPO (PCT)
Prior art keywords
elements
stored
storage
cards
mark
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/GB2001/003782
Other languages
French (fr)
Other versions
WO2002017162A2 (en
Inventor
Geraint Wyn Edwards
Christopher Leslie Needham
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
COPYN Ltd
Original Assignee
COPYN Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from GB0021074A external-priority patent/GB2366497A/en
Priority claimed from GB0021078A external-priority patent/GB2366498A/en
Priority claimed from GB0021081A external-priority patent/GB2366499A/en
Application filed by COPYN Ltd filed Critical COPYN Ltd
Priority to AU2001282317A priority Critical patent/AU2001282317A1/en
Publication of WO2002017162A2 publication Critical patent/WO2002017162A2/en
Anticipated expiration legal-status Critical
Publication of WO2002017162A3 publication Critical patent/WO2002017162A3/en
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • G06F16/9562Bookmark management

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Document Processing Apparatus (AREA)
  • User Interface Of Digital Computer (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

Portions of mark-up language pages may be stored in an on-line repository. The user selects a portion of a page for storage using a pointer device and an extension to a browser context menu. If the mark-up code for the selected portion corresponds to a predefined meaningful element, the DOM node to which it refers is identified and the node tree traversed to look for meaningful collections of elements, the raw HTML is then extracted and sent to a new window where it can be selected and stored in a remote database. The database is configured to enable a scrapbook like presentation of displayed elements with elements displayed as cards. Cards may be stored in a number of leaves and card parameters, and leaf configurations may be customised by a user. Access rights can be granted to allow elements in a given repository to be viewed by others.
PCT/GB2001/003782 2000-08-25 2001-08-22 Capture, storage and retrieval of markup elements Ceased WO2002017162A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2001282317A AU2001282317A1 (en) 2000-08-25 2001-08-22 Capture, storage and retrieval of markup elements

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
GB0021074A GB2366497A (en) 2000-08-25 2000-08-25 Database for storage and retrieval of bookmarks of portions of web-pages
GB0021078.1 2000-08-25
GB0021074.0 2000-08-25
GB0021081.5 2000-08-25
GB0021078A GB2366498A (en) 2000-08-25 2000-08-25 Method of bookmarking a section of a web-page and storing said bookmarks
GB0021081A GB2366499A (en) 2000-08-25 2000-08-25 A method of storing a portion of a web-page

Publications (2)

Publication Number Publication Date
WO2002017162A2 WO2002017162A2 (en) 2002-02-28
WO2002017162A3 true WO2002017162A3 (en) 2004-04-08

Family

ID=27255862

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB2001/003782 Ceased WO2002017162A2 (en) 2000-08-25 2001-08-22 Capture, storage and retrieval of markup elements

Country Status (2)

Country Link
AU (1) AU2001282317A1 (en)
WO (1) WO2002017162A2 (en)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7356762B2 (en) 2002-07-08 2008-04-08 Asm International Nv Method for the automatic generation of an interactive electronic equipment documentation package
US8370423B2 (en) 2006-06-16 2013-02-05 Microsoft Corporation Data synchronization and sharing relationships
US8595635B2 (en) 2007-01-25 2013-11-26 Salesforce.Com, Inc. System, method and apparatus for selecting content from web sources and posting content to web logs
US8429551B2 (en) * 2007-02-15 2013-04-23 Microsoft Corporation Application-based copy and paste operations
US8918717B2 (en) 2007-05-07 2014-12-23 International Business Machines Corporation Method and sytem for providing collaborative tag sets to assist in the use and navigation of a folksonomy
EP2323084A1 (en) * 2009-10-23 2011-05-18 Alcatel Lucent Artifact management method
US20120041922A1 (en) * 2010-08-15 2012-02-16 Sap Portals Israel Ltd Shareable content container
US9430583B1 (en) * 2011-06-10 2016-08-30 Salesforce.Com, Inc. Extracting a portion of a document, such as a web page
KR20130048926A (en) * 2011-11-03 2013-05-13 삼성전자주식회사 Method for scrap of digital magazine edited in layers and apparatus therof
KR101928915B1 (en) * 2012-02-24 2019-03-12 삼성전자 주식회사 Apparatus and method for processing a data of mobile terminal
US9753926B2 (en) 2012-04-30 2017-09-05 Salesforce.Com, Inc. Extracting a portion of a document, such as a web page
KR102041453B1 (en) * 2018-12-07 2019-11-27 삼성전자 주식회사 Apparatus and method for processing a data of mobile terminal
CN115129960A (en) * 2022-07-04 2022-09-30 北京百度网讯科技有限公司 Data capture method and device, electronic equipment and storage medium
CN120747974B (en) * 2025-08-29 2025-12-16 之江实验室 A method, system, device, and storage medium for translating front-end dynamic screenshots.

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6021416A (en) * 1997-11-25 2000-02-01 International Business Machines Corporation Dynamic source code capture for a selected region of a display

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6021416A (en) * 1997-11-25 2000-02-01 International Business Machines Corporation Dynamic source code capture for a selected region of a display

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
ABEL T: "Microsoft Office 2000: Create Dynamic Digital Dashboards Using Office, OLAP, and DHTML", MSDN MAGAZINE, 1 July 2000 (2000-07-01), pages 1 - 7, XP002251439, Retrieved from the Internet <URL:http://msdn.microsoft.com/msdnmag/issues/0700/Dashboard> [retrieved on 20030813] *
LIU LING ET AL: "XWRAP: An XML-enabled wrapper construction system for Web information sources", DATA ENGINEERING, 2000. PROCEEDINGS. 16TH INTERNATIONAL CONFERENCE ON SAN DIEGO, CA, USA 29 FEB.-3 MARCH 2000, LOS ALAMITOS, CA, USA,IEEE COMPUT. SOC, US, 29 February 2000 (2000-02-29), pages 611 - 621, XP002246421, ISBN: 0-7695-0506-6 *
SAHUGUET A, AZAVANT F: "WysiWyg Web Wrapper Factory (W4F)", INTERNET ARTICLE, 1999, pages 1 - 22, XP002251438, Retrieved from the Internet <URL:http://citeseer.nj.nec.com/95215.html> [retrieved on 20030812] *
WOOD L: "Programming the Web: the W3C DOM specification", IEEE INTERNET COMPUTING, IEEE SERVICE CENTER, PISCATAWAY, NJ, US, vol. 3, no. 1, January 1999 (1999-01-01), pages 48 - 54, XP002163911, ISSN: 1089-7801 *

Also Published As

Publication number Publication date
AU2001282317A1 (en) 2002-03-04
WO2002017162A2 (en) 2002-02-28

Similar Documents

Publication Publication Date Title
WO2002017162A3 (en) Capture, storage and retrieval of markup elements
US6449636B1 (en) System and method for creating a dynamic data file from collected and filtered web pages
US10650185B2 (en) Accessible processing method of webpage contents and accessible webpage device
CN101582075B (en) Web information extraction system
Boden Escaping from the Chinese room
Perry et al. Handbook of air pollution analysis.
US20050091203A1 (en) Method and apparatus for improving the readability of an automatically machine-generated summary
CA2508500A1 (en) An architecture for ink annotations on web documents
CA2209541A1 (en) System for and method for providing multimedia bookmarks for hypertext markup language files
US20100083095A1 (en) Method for Extracting Data from Web Pages
US20080071739A1 (en) Using anchor text to provide context
RU2003119519A (en) TEXT PROCESSING DOCUMENT STORED IN A SINGLE XML FILE THAT MAY PERFORM AN APPLICATION UNDERSTANDING THE XML LANGUAGE
KR20090006464A (en) Device for providing customized contents, method and recording medium
Gottron Evaluating content extraction on HTML documents
CN101702160A (en) Method for acquiring internet subject information and device thereof
JP2007188352A (en) Page reranking device, page reranking program
CN106202312A (en) A kind of interest point search method for mobile Internet and system
JP5858479B2 (en) Terminal device and program
CN103577444A (en) Browser control method and system
US20190138657A1 (en) Information processing device and information terminal
Abdul-Rahman et al. Automatic Pagination of HTML Documents in a Web Browser
EP1255207A3 (en) Method and apparatus for automatically searching hypertext structure
JP2012146065A (en) Sentence retrieval device
JP2005339379A (en) Information display system and information display method
KR20010097705A (en) Method of imformation display for web clip

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PH PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP