WO2009073032A1 - Systèmes et procédés pour gestion documentaire électronique et intelligente - Google Patents
Systèmes et procédés pour gestion documentaire électronique et intelligente Download PDFInfo
- Publication number
- WO2009073032A1 WO2009073032A1 PCT/US2007/086673 US2007086673W WO2009073032A1 WO 2009073032 A1 WO2009073032 A1 WO 2009073032A1 US 2007086673 W US2007086673 W US 2007086673W WO 2009073032 A1 WO2009073032 A1 WO 2009073032A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- documents
- document
- instant invention
- user
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation; Time management
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/93—Document management systems
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/26—Techniques for post-processing, e.g. correcting the recognition result
- G06V30/262—Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
- G06V30/268—Lexical context
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
Definitions
- the method described further comprises the step of organizing documents using at least one business rule manager.
- said business rule manager uses at least one workflow rule.
- Figure 33 shows that a user can create a new cabinet using loan katalyst.
- Figure 37 shows that a user can create a new role using loan katalyst.
- Figure 43 shows that picture files can be easily uploaded and viewed using loan katalyst.
- Figure 45 shows that a user can create a custom delivery package using loan katalyst.
- the term "field” refers to the region of a document where specific items of information might be found.
- the individual's name is a "fact” and may also referred to herein as a "text snippet" when the fact is extracted from a field.
- fields are converted into facts by extracting the information and “scrubbing" the text output to create a value that can be utilized and/or consumed by a computer in the operation of embodiments of the instant invention.
- Feature Vector refers to a manner of mapping documents wherein the relationship of keywords to fields or keywords to other keywords is mapped both as to physical distance and direction.
- the Location Diagram is a relative position map of key phrases represented in unique way by their vectors of relative distances.
- the structured files are represented in flexible structure maps called grid files.
- the purposes of instant invention include conducting a business and making business decisions using an automated acquisition and analysis of information from a Dox Package.
- This invention thus, in part, provides:
- word 'Wl ' is defined as having a very high chance of occurrence in document 'Dl ' (e.g., the word 'interest' ('Wl ') in a mortgage note ("Dl ')) then, according to the uses and principles of the instant invention, the word 'Wl ' has high affinity towards document 'Dl .
- This affinity may be determined using Bayesian analysis and is represented as a probability or a conditional probability.
- Other Feature Vectors such as font size and type may also be considered in determining the affinity of a page to the reference page of document being examined. There is no limit to the number of Feature Vectors that might be considered for affinity analysis.
- Each document and/or page within the Dox Package is mapped to a class according to the taxonomy.
- the method/apparatus of the instant invention classifies these documents and collates sets of pages for industry standard taxonomy like MISMO, or any given taxonomy.
- a further feature in some embodiments of the instant invention is that the method/apparatus of the invention is also capable of generating its own taxonomy based on document features it observes. The overall method assigns most logical document structure based on the taxonomy and most appropriate position within each document for each page.
- the instant invention in preferred embodiments, is the only comprehensive collation solution which can collate pages, documents, or sets of documents identified by revision numbers, for business decision making purposes;
- the first step in administering an Online Client Site can be to administer the Folder Types that may be setup using the Online Client Site.
- Step 731 can be an extension of Folder Types administration into the Online Client Site.
- Client Sites that have subscribed to a single Folder Type through a domain, or have subscribed to the capability to create Custom Folder Types can be offered Online Administration of these Folder Types.
- Authorized Client Site Administrators can be provided with the ability to customize Folder Types to enhance the searchabiiity and usability of Folder instances of those Folder Types.
- Folder Types have properties that may be turned on and off to enhance the security model of the organization.
- Users with Public Inboxes can eliminate having sensitive documents sent solely via unsecured email to their email clients, or deploying or subscribing to third party fax-to-email servers, and can instead take advantage of Fax/SFfP/Secured Email/HTTPS/Secured Upload directly to their Inbox, and have the capability to file those documents and data securely in a repository.
- Administration of this very complex deployment of Public and Private Inboxes can be easy enough for business users to deploy without requiring specific technical knowledge.
- a Queue may have one or more automated Alerts defined.
- a Queue may have one or more automated or manual processing rules assigned.
- a Queue or its work tasks may have time limits for completion before generating an automated processing rule assigned.
- a Queue may be accessible by Authorized Users assigned to the Queue.
- a Queue may be stand alone, part of a series, or allow parallel processing.
- Site-to-Site delivery can allow any Client Site on the IPDM Network to delivery documents and/or data to any other Client Site on the IPDM Network, regardless of the Domain.
- Administration of Site-to-Site Delivery is depicted in step 741 ,
- Authorized Client Site Administrators may create, edit, disable, or delete a Site-to-Site delivery protocol between Client Sites.
- a Site-to-Site Delivery protocol may be enabled for any external Client Site.
- a Site-to-Site Delivery protocol can require two-party authentication between Client Sites.
- a Site-to-Site Delivery Protocol may be authorized by one Client Site to accept Inbound Deliveries from another Client Site through a designated Inbox.
- a fax coversheet containing details describing destinations of accompanied documents can be used to facilitate automatically indexing and/or routing of the documents to the desired folder (see examples below and Figure 52).
- e-mail containing folder ID numbers in the subject line can be used to facilitate automatically indexing and/or routing of attached documents to the desired folder.
- Such intelligent inbox can accept incoming documents from various means such as fax or e-mail with various formats such as pdf, tiff, or gif files.
- Step 808 also provide the use of at least one data extraction engine which can pull data points from pages of a document.
- Data can be extracted with high precision from native pdf files.
- data extraction can be carried out from all Fannie Mae SMART Doc, because many lenders and investors continue to produce electronic loan documents in PDF format.
- Data extraction (or data capture) services are available to isolate key fields, enabling anti-fraud and other analytics at high speed for both post-closing and pre-funding applications.
- the data extraction engine of step 808 can minimize manual data reentry which is time-consuming and error-prone. If all extracted data are consistent to one another, extracted data can be stored in a searchable online electronic repository 810 in at least one specified format. If there is any inconsistency among extracted data, step 809 provides a process flagging for human intervention. In some embodiments, the human intervention can be sorting, modifying, or deleting at least one document or file.
- Step 812 provides the use of at least one online client site administration portal.
- An exemplary administration portal is illustrated in Figure 7.
- Step 814 can also provide at least one Business Rule Manager based on at least one Workflow Rules.
- Step 815 provides the use of online collaborative folders, where each folder comprises a different metadata.
- the system can require the input of e-signature within certain time limitation for security reasons.
- the documents will be delivered into the recipient's inbox, where the delivered documents can be processed and indexed depending on the recipient's choice. Of course, there can be more than one recipients regardless means of delivery.
- Document pages that are still not mapped to a taxonomy class due to bad image quality, a new variation of a document, or for other reasons that do not result in immediate identification or classification are flagged as Unknown.
- Document pages that fall below the confidence threshold value that may be preset or varied by the user, even after the verifier, are sent to exception handling client (Classification client) (i.e., via escalation) 218.
- Classification client i.e., via escalation
- human collaborators can verify the class, assign a class, or note that the document cannot be identified. If a human collaborator verifies or changes the class, this information is sent to a feedback box for an incremental learning.
- the method/apparatus of the instant invention may find and note as identical two identical mortgage notes in a single taxonomy class.
- This collation process of the instant invention is differentiated from classification technologies known in the art by its ability to distinguish closely related documents.
- An example of this is that the method/apparatus of the instant invention can pick two mortgage notes out of a Dox Package, correctly paginate them, and identify and log them as separate, but otherwise identical, documents. Pages or documents are then segregated into a logical group determination, and the pages are mapped to a predetermined business-specific or user-determined taxonomy.
- the collation process is based on incremental learning and various artificial intelligence ("AI")-based techniques, which may include one or more of the following, such as:
- the Image and Location Diagram based locator can locate the information on forms irrespective of poor quality of images/OCR output.
- the instant invention in preferred embodiments, features collation of Knowledge Objects to create Business Objects.
- the instant invention in preferred embodiments, features efficient decision making based on Business Objects.
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- Entrepreneurship & Innovation (AREA)
- Human Resources & Organizations (AREA)
- Strategic Management (AREA)
- Data Mining & Analysis (AREA)
- General Business, Economics & Management (AREA)
- Tourism & Hospitality (AREA)
- Quality & Reliability (AREA)
- Operations Research (AREA)
- Marketing (AREA)
- Economics (AREA)
- Computational Linguistics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Multimedia (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)
Abstract
Les systèmes et les procédés selon l'invention permettent une gestion documentaire électronique et intelligente basée sur le Web. Les utilisateurs peuvent rassembler, stocker et partager des documents conservés à divers emplacements. L'invention concerne également des systèmes et des procédés dont les capacités d'extraction de données n'exigent que peu de nouvelles saisies de données. Les systèmes et procédés ci-décrits peuvent transmettre par Internet des documents à de nombreuses personnes, sans code-barres ni feuille de séparation pour les faxer ou les envoyer.
Priority Applications (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CA2745712A CA2745712C (fr) | 2007-12-06 | 2007-12-06 | Systemes et procedes pour gestion documentaire electronique et intelligente |
| CA2957327A CA2957327A1 (fr) | 2007-12-06 | 2007-12-06 | Systemes et procedes pour gestion documentaire electronique et intelligente |
| PCT/US2007/086673 WO2009073032A1 (fr) | 2007-12-06 | 2007-12-06 | Systèmes et procédés pour gestion documentaire électronique et intelligente |
| EP07865320A EP2227750A4 (fr) | 2007-12-06 | 2007-12-06 | Systèmes et procédés pour gestion documentaire électronique et intelligente |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/US2007/086673 WO2009073032A1 (fr) | 2007-12-06 | 2007-12-06 | Systèmes et procédés pour gestion documentaire électronique et intelligente |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2009073032A1 true WO2009073032A1 (fr) | 2009-06-11 |
Family
ID=40718013
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2007/086673 Ceased WO2009073032A1 (fr) | 2007-12-06 | 2007-12-06 | Systèmes et procédés pour gestion documentaire électronique et intelligente |
Country Status (3)
| Country | Link |
|---|---|
| EP (1) | EP2227750A4 (fr) |
| CA (2) | CA2745712C (fr) |
| WO (1) | WO2009073032A1 (fr) |
Cited By (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20130332369A1 (en) * | 2012-06-12 | 2013-12-12 | International Business Machines Corporation | Leveraging analytics to propose context sensitive workflows for case management solutions |
| WO2016073612A1 (fr) * | 2014-11-04 | 2016-05-12 | Beacon Communications, Llc | Portail de service aux consommateurs |
| US10110769B2 (en) | 2014-11-04 | 2018-10-23 | Tata Consultancy Services Ltd. | Computer implemented system and method for managing a stack containing a plurality of documents |
| CN110909196A (zh) * | 2019-10-28 | 2020-03-24 | 北京光年无限科技有限公司 | 识别绘本阅读过程中内页封面切换的处理方法和装置 |
| US11238539B1 (en) * | 2019-04-03 | 2022-02-01 | Progressive Casualty Insurance Company | Intelligent routing control |
| US20230105207A1 (en) * | 2021-10-06 | 2023-04-06 | Bank Of America Corporation | System and methods for intelligent entity-wide data protection |
| US20240086739A1 (en) * | 2021-06-29 | 2024-03-14 | Instabase Inc. | Systems and methods to identify document transitions between adjacent documents within document bundles |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2015008300A2 (fr) * | 2013-07-19 | 2015-01-22 | Parag Kulkarni | Système de partage de fichier/contenu spécifique à une instance, spécifique à un dispositif, spécifique à une durée, spécifique à une vue, spécifique à une estampille temporelle, spécifique à un réseau |
| CA3073199A1 (fr) * | 2017-08-18 | 2019-02-21 | ISMS Solutions, LLC | Systeme d'apprentissage informatise d'analyse d'accords |
| US11049042B2 (en) * | 2018-11-05 | 2021-06-29 | Convr Inc. | Systems and methods for extracting specific data from documents using machine learning |
| US11270213B2 (en) | 2018-11-05 | 2022-03-08 | Convr Inc. | Systems and methods for extracting specific data from documents using machine learning |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20020143704A1 (en) * | 2001-03-27 | 2002-10-03 | Nassiri Nicholas N. | Signature verifcation using a third party authenticator via a paperless electronic document platform |
| US20040199460A1 (en) * | 2000-05-31 | 2004-10-07 | Kenneth Barash | Speech recognition system for interactively gathering and storing verbal information to generate documents |
| US20050080721A1 (en) * | 2003-10-09 | 2005-04-14 | Kearney Victor Paul | Automated financial transaction due diligence systems and methods |
| US20050108001A1 (en) * | 2001-11-15 | 2005-05-19 | Aarskog Brit H. | Method and apparatus for textual exploration discovery |
| US20050209955A1 (en) * | 2004-03-16 | 2005-09-22 | Underwood Timothy J | Apparatus and method for document processing |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10114821B2 (en) * | 2000-12-27 | 2018-10-30 | Tractmanager, Inc. | Method and system to access to electronic business documents |
| US7197703B1 (en) * | 2001-04-09 | 2007-03-27 | Critical Technologies, Inc. | System and methodology for the storage and manipulation of documents |
| US7146367B2 (en) * | 2002-05-14 | 2006-12-05 | Advectis, Inc. | Document management system and method |
| US7373365B2 (en) * | 2004-04-13 | 2008-05-13 | Satyam Computer Services, Ltd. | System and method for automatic indexing and archiving of paper documents |
-
2007
- 2007-12-06 EP EP07865320A patent/EP2227750A4/fr not_active Ceased
- 2007-12-06 CA CA2745712A patent/CA2745712C/fr active Active
- 2007-12-06 WO PCT/US2007/086673 patent/WO2009073032A1/fr not_active Ceased
- 2007-12-06 CA CA2957327A patent/CA2957327A1/fr not_active Abandoned
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20040199460A1 (en) * | 2000-05-31 | 2004-10-07 | Kenneth Barash | Speech recognition system for interactively gathering and storing verbal information to generate documents |
| US20020143704A1 (en) * | 2001-03-27 | 2002-10-03 | Nassiri Nicholas N. | Signature verifcation using a third party authenticator via a paperless electronic document platform |
| US20050108001A1 (en) * | 2001-11-15 | 2005-05-19 | Aarskog Brit H. | Method and apparatus for textual exploration discovery |
| US20050080721A1 (en) * | 2003-10-09 | 2005-04-14 | Kearney Victor Paul | Automated financial transaction due diligence systems and methods |
| US20050209955A1 (en) * | 2004-03-16 | 2005-09-22 | Underwood Timothy J | Apparatus and method for document processing |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP2227750A4 * |
Cited By (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20130332369A1 (en) * | 2012-06-12 | 2013-12-12 | International Business Machines Corporation | Leveraging analytics to propose context sensitive workflows for case management solutions |
| US20130332403A1 (en) * | 2012-06-12 | 2013-12-12 | International Business Machines Corporation | Leveraging analytics to propose context sensitive workflows for case management solutions |
| WO2016073612A1 (fr) * | 2014-11-04 | 2016-05-12 | Beacon Communications, Llc | Portail de service aux consommateurs |
| US10110769B2 (en) | 2014-11-04 | 2018-10-23 | Tata Consultancy Services Ltd. | Computer implemented system and method for managing a stack containing a plurality of documents |
| US11238539B1 (en) * | 2019-04-03 | 2022-02-01 | Progressive Casualty Insurance Company | Intelligent routing control |
| CN110909196A (zh) * | 2019-10-28 | 2020-03-24 | 北京光年无限科技有限公司 | 识别绘本阅读过程中内页封面切换的处理方法和装置 |
| CN110909196B (zh) * | 2019-10-28 | 2022-07-01 | 北京光年无限科技有限公司 | 识别绘本阅读过程中内页封面切换的处理方法和装置 |
| US20240086739A1 (en) * | 2021-06-29 | 2024-03-14 | Instabase Inc. | Systems and methods to identify document transitions between adjacent documents within document bundles |
| US20230105207A1 (en) * | 2021-10-06 | 2023-04-06 | Bank Of America Corporation | System and methods for intelligent entity-wide data protection |
| US12277236B2 (en) * | 2021-10-06 | 2025-04-15 | Bank Of America Corporation | System and methods for intelligent entity-wide data protection |
Also Published As
| Publication number | Publication date |
|---|---|
| EP2227750A1 (fr) | 2010-09-15 |
| CA2745712C (fr) | 2017-03-21 |
| CA2745712A1 (fr) | 2009-06-11 |
| CA2957327A1 (fr) | 2009-06-11 |
| EP2227750A4 (fr) | 2012-06-20 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US8176004B2 (en) | Systems and methods for intelligent paperless document management | |
| CA2745712C (fr) | Systemes et procedes pour gestion documentaire electronique et intelligente | |
| US7747495B2 (en) | Business method using the automated processing of paper and unstructured electronic documents | |
| US10783367B2 (en) | System and method for data extraction and searching | |
| US7415471B1 (en) | Methods and systems for automated data collection and analysis for use in association with asset securitization | |
| US10096064B2 (en) | Method and system for source document data entry and form association | |
| US8289541B2 (en) | Document handling | |
| AU2022301331A1 (en) | Ai-augmented auditing platform including techniques for applying a composable assurance integrity framework | |
| US9507758B2 (en) | Collaborative matter management and analysis | |
| JP6307745B2 (ja) | 会計処理システム | |
| US20020111953A1 (en) | Docketing system | |
| US20070192275A1 (en) | Automatic document exchange with archiving capability | |
| WO2013123182A1 (fr) | Systèmes et procédés mis en œuvre par ordinateur pour effectuer une révision de contrat | |
| US12067039B1 (en) | Systems and methods for providing user interfaces for configuration of a flow for extracting information from documents via a large language model | |
| CN118761735B (zh) | 一种基于深度学习模型的电子合同管理方法及系统 | |
| US20040117404A1 (en) | System for utilizing audible, visual and textual data with alternative combinable multimedia forms of presenting information for real-time interactive use by multiple users in differnet remote environments | |
| US20050210068A1 (en) | Title examination systems and methods | |
| CA2914591C (fr) | Systemes et procedes pour gestion documentaire electronique et intelligente | |
| US20240411982A1 (en) | Data extraction, verification, and field population | |
| US20220138259A1 (en) | Automated document intake system | |
| US20250103919A1 (en) | Systems and methods for applying rules via artificial intelligence for document processing | |
| Saund | Scientific challenges underlying production document processing | |
| US20050209872A1 (en) | Title quality scoring systems and methods | |
| Derrig et al. | Effective Document Review Techniques in Eclipse and Relativity | |
| JP4024267B2 (ja) | 取引先要項システム |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 07865320 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2007865320 Country of ref document: EP |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2745712 Country of ref document: CA |