WO2000049517A2 - Multi-document summarization system and method - Google Patents
Multi-document summarization system and method Download PDFInfo
- Publication number
- WO2000049517A2 WO2000049517A2 PCT/US2000/004118 US0004118W WO0049517A2 WO 2000049517 A2 WO2000049517 A2 WO 2000049517A2 US 0004118 W US0004118 W US 0004118W WO 0049517 A2 WO0049517 A2 WO 0049517A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- phrases
- nodes
- phrase
- temporal
- documents
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/34—Browsing; Visualisation therefor
- G06F16/345—Summarisation for human users
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/194—Calculation of difference between files
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
- G06F40/35—Discourse or dialogue representation
Definitions
- a present method for generating a summary of related documents in a collection includes extracting phrases from the documents which have common focus elements. Phrase intersection analysis is performed on the extracted phrases to generate a phrase intersection table. Temporal processing can be performed on the phrases in the phrase intersection table to remove ambiguous temporal references and to sort the phrases in a temporal sequence. Sentence generation is performed using the phrases in the phrase intersection table to generate the multi document summary.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
Abstract
Description
Claims
Priority Applications (7)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CA2363017A CA2363017C (en) | 1999-02-19 | 2000-02-18 | Multi-document summarization system and method |
| HK02106992.3A HK1045391A1 (en) | 1999-02-19 | 2000-02-18 | Multi-document summarization system and method |
| AU40026/00A AU775978B2 (en) | 1999-02-19 | 2000-02-18 | Multi-document summarization system and method |
| US09/913,745 US7366711B1 (en) | 1999-02-19 | 2000-02-18 | Multi-document summarization system and method |
| IL14495100A IL144951A0 (en) | 1999-02-19 | 2000-02-18 | Multi-document summarization system and method |
| EP00919318A EP1190343A4 (en) | 1999-02-19 | 2000-02-18 | Multi-document summarization system and method |
| IL144951A IL144951A (en) | 1999-02-19 | 2001-08-16 | Multi-document summarization system and method |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US12065999P | 1999-02-19 | 1999-02-19 | |
| US60/120,659 | 1999-02-19 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| WO2000049517A2 true WO2000049517A2 (en) | 2000-08-24 |
| WO2000049517A3 WO2000049517A3 (en) | 2000-11-30 |
Family
ID=22391735
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2000/004118 Ceased WO2000049517A2 (en) | 1999-02-19 | 2000-02-18 | Multi-document summarization system and method |
Country Status (6)
| Country | Link |
|---|---|
| EP (1) | EP1190343A4 (en) |
| AU (1) | AU775978B2 (en) |
| CA (1) | CA2363017C (en) |
| HK (1) | HK1045391A1 (en) |
| IL (2) | IL144951A0 (en) |
| WO (1) | WO2000049517A2 (en) |
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2002035376A3 (en) * | 2000-10-27 | 2003-08-28 | Science Applic Int Corp | Ontology-based parser for natural language processing |
| WO2008155225A1 (en) * | 2007-06-20 | 2008-12-24 | Amadeus S.A.S. | System and method for integrating and displaying travel advices gathered from a plurality of reliable sources |
| US7496561B2 (en) | 2001-01-18 | 2009-02-24 | Science Applications International Corporation | Method and system of ranking and clustering for document indexing and retrieval |
| US11374888B2 (en) | 2015-09-25 | 2022-06-28 | Microsoft Technology Licensing, Llc | User-defined notification templates |
Family Cites Families (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4965763A (en) * | 1987-03-03 | 1990-10-23 | International Business Machines Corporation | Computer method for automatic extraction of commonly specified information from business correspondence |
| JP2783558B2 (en) * | 1988-09-30 | 1998-08-06 | 株式会社東芝 | Summary generation method and summary generation device |
| JPH0418673A (en) * | 1990-05-11 | 1992-01-22 | Hitachi Ltd | Method and device for extracting text information |
| US5638543A (en) * | 1993-06-03 | 1997-06-10 | Xerox Corporation | Method and apparatus for automatic document summarization |
| US5384703A (en) * | 1993-07-02 | 1995-01-24 | Xerox Corporation | Method and apparatus for summarizing documents according to theme |
| US5689716A (en) * | 1995-04-14 | 1997-11-18 | Xerox Corporation | Automatic method of generating thematic summaries |
| US5778397A (en) * | 1995-06-28 | 1998-07-07 | Xerox Corporation | Automatic method of generating feature probabilities for automatic extracting summarization |
| US5838323A (en) * | 1995-09-29 | 1998-11-17 | Apple Computer, Inc. | Document summary computer system user interface |
| US5848191A (en) * | 1995-12-14 | 1998-12-08 | Xerox Corporation | Automatic method of generating thematic summaries from a document image without performing character recognition |
| US5924108A (en) * | 1996-03-29 | 1999-07-13 | Microsoft Corporation | Document summarizer for word processors |
-
2000
- 2000-02-18 WO PCT/US2000/004118 patent/WO2000049517A2/en not_active Ceased
- 2000-02-18 AU AU40026/00A patent/AU775978B2/en not_active Ceased
- 2000-02-18 CA CA2363017A patent/CA2363017C/en not_active Expired - Fee Related
- 2000-02-18 IL IL14495100A patent/IL144951A0/en active IP Right Grant
- 2000-02-18 EP EP00919318A patent/EP1190343A4/en not_active Ceased
- 2000-02-18 HK HK02106992.3A patent/HK1045391A1/en unknown
-
2001
- 2001-08-16 IL IL144951A patent/IL144951A/en not_active IP Right Cessation
Cited By (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2002035376A3 (en) * | 2000-10-27 | 2003-08-28 | Science Applic Int Corp | Ontology-based parser for natural language processing |
| US7027974B1 (en) | 2000-10-27 | 2006-04-11 | Science Applications International Corporation | Ontology-based parser for natural language processing |
| US7496561B2 (en) | 2001-01-18 | 2009-02-24 | Science Applications International Corporation | Method and system of ranking and clustering for document indexing and retrieval |
| WO2008155225A1 (en) * | 2007-06-20 | 2008-12-24 | Amadeus S.A.S. | System and method for integrating and displaying travel advices gathered from a plurality of reliable sources |
| CN101765857A (en) * | 2007-06-20 | 2010-06-30 | 阿玛得斯两合公司 | System and method for integrating and displaying travel advices gathered from a plurality of reliable sources |
| JP2010530580A (en) * | 2007-06-20 | 2010-09-09 | アマデウス エス.エイ.エス | System and method for integrated display of travel advice collected from multiple trusted sources |
| US7818117B2 (en) | 2007-06-20 | 2010-10-19 | Amadeus S.A.S. | System and method for integrating and displaying travel advices gathered from a plurality of reliable sources |
| CN101765857B (en) * | 2007-06-20 | 2013-06-19 | 阿玛得斯两合公司 | System and method for integrating and displaying travel advice collected from multiple reliable sources |
| US11374888B2 (en) | 2015-09-25 | 2022-06-28 | Microsoft Technology Licensing, Llc | User-defined notification templates |
Also Published As
| Publication number | Publication date |
|---|---|
| EP1190343A4 (en) | 2006-08-09 |
| HK1045391A1 (en) | 2002-11-22 |
| CA2363017C (en) | 2011-04-19 |
| IL144951A (en) | 2006-08-01 |
| IL144951A0 (en) | 2002-06-30 |
| AU775978B2 (en) | 2004-08-19 |
| EP1190343A2 (en) | 2002-03-27 |
| AU4002600A (en) | 2000-09-04 |
| CA2363017A1 (en) | 2000-08-24 |
| WO2000049517A3 (en) | 2000-11-30 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US7366711B1 (en) | Multi-document summarization system and method | |
| US7412385B2 (en) | System for identifying paraphrases using machine translation | |
| US8260817B2 (en) | Semantic matching using predicate-argument structure | |
| Harabagiu et al. | Topic themes for multi-document summarization | |
| US20020078090A1 (en) | Ontological concept-based, user-centric text summarization | |
| EP0886226A1 (en) | Linguistic search system | |
| US20020046018A1 (en) | Discourse parsing and summarization | |
| Jungermann | Information extraction with rapidminer | |
| WO2001096980A2 (en) | Method and system for text analysis | |
| Smadja | From n-grams to collocations: An evaluation of Xtract | |
| Moschitti et al. | Open Domain Information Extraction via Automatic Semantic Labeling. | |
| Yeasmin et al. | Study of abstractive text summarization techniques | |
| CN113779961A (en) | Method for extracting conventional sentence pattern of natural language text and electronic device | |
| AU775978B2 (en) | Multi-document summarization system and method | |
| Alias et al. | A Malay text corpus analysis for sentence compression using pattern-growth method | |
| Rahat et al. | A recursive algorithm for open information extraction from Persian texts | |
| Zeni et al. | Annotating legal documents with GaiusT 2.0 | |
| Chebanyuk | Multilingual Question-Driven Approach and Software System to Obtaining Information From Texts. | |
| Al-sarrayrih et al. | Clustering arabic documents using frequent itemset-based hierarchical clustering with an N-grams | |
| Piotrowski | NLP-supported full-text retrieval | |
| Muthusamy | Processing the Textual Information Using Open Natural Language Processing | |
| Ou et al. | Multi‐document summarization of news articles using an event‐based framework | |
| Lukose et al. | Extracting financial information from text documents | |
| Gibb | Knowledge-based indexing | |
| Benafia et al. | From Linguistic to Conceptual: A Framework Based on a Pipeline for Building Ontologies from Texts. |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
| AK | Designated states |
Kind code of ref document: A3 Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A3 Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
| DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
| WWE | Wipo information: entry into national phase |
Ref document number: 144951 Country of ref document: IL |
|
| ENP | Entry into the national phase |
Ref document number: 2363017 Country of ref document: CA Ref document number: 2363017 Country of ref document: CA Kind code of ref document: A |
|
| WWE | Wipo information: entry into national phase |
Ref document number: IN/PCT/2001/00737/DE Country of ref document: IN |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2000919318 Country of ref document: EP |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 09913745 Country of ref document: US |
|
| REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
| WWP | Wipo information: published in national office |
Ref document number: 2000919318 Country of ref document: EP |