US7088873B2 - Bit-mapped image multi-stage analysis method - Google Patents
Bit-mapped image multi-stage analysis method Download PDFInfo
- Publication number
- US7088873B2 US7088873B2 US10/386,544 US38654403A US7088873B2 US 7088873 B2 US7088873 B2 US 7088873B2 US 38654403 A US38654403 A US 38654403A US 7088873 B2 US7088873 B2 US 7088873B2
- Authority
- US
- United States
- Prior art keywords
- analysis
- bit
- stage
- distortion
- recited
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related, expires
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/26—Techniques for post-processing, e.g. correcting the recognition result
- G06V30/262—Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
Definitions
- the present invention relates generally to a field of bit-mapped image character recognition and, more particularly, to a method of parsing and analysis pre-process assisting character and text recognition of printed text from bit-mapped binary image or other binary or raster images inputted from scanning device or like or obtained by other way.
- Segmentation and parsing methods are known in the art. Typically, such methods divide an image into parcels containing homogeneous objects and use a plurality of special computing procedures, each depending on a plurality of parameters to analyze an object.
- the most known methods initially divide the object into a plurality of regions, each region further dividing into smaller objects—paragraphs, lines, words, characters, non-text objects etc. After that all prior methods make various pre-recognition analysis and corrections of image to improve its quality for further text recognition. Corrections may include removing distortion of various types—distortion, skew, inversion text matter, cursive character matter, undesired dots—extra or missing etc.
- Another known method supposes a single stage of image pre-recognition analysis mainly as an ordinary version, and an addition of one or more profound stages of analysis in a case if errors occurred at stage 1 (U.S. Pat. No. 5,717,794, Feb. 10, 1998).
- the incorrectness criteria in said method is the difference between the length of character string, most likely comprising a line and a resulting line, as a result of the first analysis session. But for all that no supplemental data is to be collected and used but only a repeating session with the same set of the analysis means.
- One more method of the close subject deals with detecting distorted regions on the analysed image and removing the distortion in said regions by special means. (U.S. Pat. No. 5,590,224 Dec. 31, 1996). No supplemental data is collected and utilized. This considerably reduces the accuracy of the analysis.
- An object of present invention consists in hastening the raster (bit-mapped) image analysis procedure without any loss in quality and accuracy.
- the present invention has for its object to suggest in detail the method, which can efficiently correct the errors detecting at the step of layout analysis.
- a method of bit-mapped image analysis that precedes character recognition procedures, comprises
- the present invention also presumes further outputting the results to a subsequent process, which is typically a character recognition process.
- FIGURE is a block diagram showing the list of steps to be made on analysis of bit-mapped image according to the present invention.
- a method of the present invention consists in a preliminary dividing of all plurality of analysis means that are present at one's disposal into two or more groups, differing in accuracy and reliability of results, with the further performing the multi-stage analysis.
- the starting-point of the present invention is the presumption coming from engineering practice, that the more precise is the analysis means or a computing method or a procedure, the more computing resources it require for operation. And in contrary, the simpler is the analysis means or computing procedure, the less accurate are the results obtained.
- the two or more stage analysis is performed then.
- the first stage comprises parsing the image into objects, detecting and preliminarily analyzing objects that comprises a distortion, preliminarily recognizing objects that comprises a distortion, detecting among them objects, that require more precise analysis means to overcome the distortion influence, collecting a supplemental data about objects.
- objects are regions, groups of lines, paragraphs, lines, word tokens, words and groups of characters, except as otherwise provided.
- the distortion as assumed in the present invention is any factor hindering a character and the whole text recognition process, except as otherwise provided.
- the distortion may appear as undesired dots in the image (extra or missing), cursive character or text matter, inverted text matter, a skew of the image, etc.
- the analysis means are taken from the group of the most incorrect means. Those analysis means according to the presumption require less computing resources, not so complicated and, therefore, quicker. Also a collection of all available supplemental data is provided on this stage.
- the second stage that is the profound analysis, comprises repeated parsing of image or a part of it, if required, detecting and analyzing objects with the distortion, repeated recognition of objects with the distortion, that were not recognized at the previous stage, taking into account all supplemental data obtained at the previous stage, detecting among them objects, that require more sophisticated analysis means to overcome the influence of distortion, collecting an newly appeared additional data about objects.
- the more precise analysis means are used. They are taken from the subsequent group—more precise one.
- the last step of the profound analysis stage is making a decision about the necessity of repeating the profound analysis stage, taking into account all supplemental data collected at the previous stages, or finishing the analysis process.
- the decision is made mainly depending on recognition reliability level settled and a number of groups of analysis means rest.
- the iteration process will be finished in one of the following cases:
- the FIGURE shows the order in which analyzing steps are to be performed according to the present invention.
- Blocks 1 through 3 shows the set of analysis means preliminarily divided into several groups, differing in accuracy.
- the set of the most inaccurate analysis means is marked as block 1 , the set of most accurate—as block 3 .
- an inputted bit-mapped image data ( 4 ) along with the set ( 1 ) of the most inaccurate but quick means is sent to primarily analysis. All data is being parsed into objects ( 5 ). Objects that contain any kind of distortion ( 6 ) are detected and placed to special list. Objects containing distortion are further subjected to a preliminary character recognition ( 7 ). After the preliminary character recognition, some objects ( 8 ) will be marked as requiring a more precise method of analysis in order to overcome the distortion.
- the supplemental data is collected ( 9 ).
- bit-mapped image data along with the collected supplemental data ( 9 ) and the set ( 2 ) of the more accurate means, then in the set ( 1 ), is sent to profound analysis. All data is being examined for requiring new parsing. It is parsed again whole or partly if required ( 5 ). Objects that contain distortion ( 6 ), not overpassed at the previous stage, are marked and recorded. Marked objects are then subjected the character recognition process ( 7 ) taking into account all supplemental data ( 9 ) obtained at the previous stage. After character recognition some of the objects ( 8 ) are marked as requiring a more precise method of analysis in order to overcome the distortion.
- a newly appeared supplemental data is being collected ( 9 ) during all the analyzing process.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Character Discrimination (AREA)
- Character Input (AREA)
- Image Analysis (AREA)
Abstract
Description
-
- preliminarily dividing the whole set of analysis means at the disposal into a number of groups differing in accuracy;
- at least two stage analysis, comprising
- i) a primary analysis stage, said primary analysis further comprising at least
- parsing the image into regions and objects,
- detecting among said regions and objects those with a distortion embarrassing character recognition,
- preliminary recognition of said objects with the distortion,
- detecting objects that require more sophisticated analysis methods,
- collection of supplemental data;
- said primary analysis stage using the analysis means from group of most inaccurate means;
- ii) at least one profound analysis stage, comprising
- repeated recognition of said objects with the distortion that were not recognized at the previous stage taking into account the supplemental data obtained at the previous stage,
- detecting said objects and regions, that require more sophisticated analysis methods,
- collection of newly appeared supplemental data,
- making a decision about the necessity of the profound analysis stage repeating, taking into account all said supplemental data collected at previous and current stages, or finishing the analysis process;
- said each subsequent profound analysis stage using analysis means from the group of more accurate means.
- i) a primary analysis stage, said primary analysis further comprising at least
-
- the character recognition reliability factors of all objects meet or exceed the predetermined level, or
- if more precise methods of analysis are no longer available.
-
- characters in all objects containing distortion are correctly recognized with the reliability factor that meets or exceeds the predetermined reliability value;
- the set of most precise analyzing means is applied, but failed to recognize all the objects containing distortion. More precise means of analysis are no longer available.
Claims (16)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
RU2002133897A | 2002-12-17 | ||
RU2002133897/09A RU2234734C1 (en) | 2002-12-17 | 2002-12-17 | Method for multi-stage analysis of information of bitmap image |
Publications (2)
Publication Number | Publication Date |
---|---|
US20040114802A1 US20040114802A1 (en) | 2004-06-17 |
US7088873B2 true US7088873B2 (en) | 2006-08-08 |
Family
ID=32501895
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/386,544 Expired - Fee Related US7088873B2 (en) | 2002-12-17 | 2003-03-13 | Bit-mapped image multi-stage analysis method |
Country Status (2)
Country | Link |
---|---|
US (1) | US7088873B2 (en) |
RU (1) | RU2234734C1 (en) |
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080091713A1 (en) * | 2006-10-16 | 2008-04-17 | Candelore Brant L | Capture of television metadata via OCR |
US20080092045A1 (en) * | 2006-10-16 | 2008-04-17 | Candelore Brant L | Trial selection of STB remote control codes |
US20080098426A1 (en) * | 2006-10-23 | 2008-04-24 | Candelore Brant L | Decoding multiple remote control code sets |
US20080097984A1 (en) * | 2006-10-23 | 2008-04-24 | Candelore Brant L | OCR input to search engine |
US20080098433A1 (en) * | 2006-10-23 | 2008-04-24 | Hardacker Robert L | User managed internet links from TV |
US20080098432A1 (en) * | 2006-10-23 | 2008-04-24 | Hardacker Robert L | Metadata from image recognition |
US20080098357A1 (en) * | 2006-10-23 | 2008-04-24 | Candelore Brant L | Phantom information commands |
US20080199150A1 (en) * | 2007-02-14 | 2008-08-21 | Candelore Brant L | Transfer of metadata using video frames |
US20080244637A1 (en) * | 2007-03-28 | 2008-10-02 | Sony Corporation | Obtaining metadata program information during channel changes |
US20080273114A1 (en) * | 2007-05-04 | 2008-11-06 | Hardacker Robert L | STB channel reader |
US20100031173A1 (en) * | 2006-07-03 | 2010-02-04 | Ondis Pty Ltd. | Business Card Scanner |
US20100123735A1 (en) * | 2008-11-17 | 2010-05-20 | Robert Blanchard | TV screen text capture |
US20100192178A1 (en) * | 2009-01-26 | 2010-07-29 | Candelore Brant L | Capture of stylized TV table data via OCR |
US7814524B2 (en) | 2007-02-14 | 2010-10-12 | Sony Corporation | Capture of configuration and service provider data via OCR |
US8320674B2 (en) | 2008-09-03 | 2012-11-27 | Sony Corporation | Text localization for image and video OCR |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
RU2295154C1 (en) * | 2005-06-16 | 2007-03-10 | "Аби Софтвер Лтд." | Method for recognizing text information from graphic file with usage of dictionaries and additional data |
US8249344B2 (en) | 2005-07-01 | 2012-08-21 | Microsoft Corporation | Grammatical parsing of document visual structures |
RU2309456C2 (en) * | 2005-12-08 | 2007-10-27 | "Аби Софтвер Лтд." | Method for recognizing text information in vector-raster image |
US8571262B2 (en) * | 2006-01-25 | 2013-10-29 | Abbyy Development Llc | Methods of object search and recognition |
RU2006101908A (en) * | 2006-01-25 | 2010-04-27 | Аби Софтвер Лтд. (Cy) | STRUCTURAL DESCRIPTION OF THE DOCUMENT, METHOD FOR DESCRIPTION OF THE STRUCTURE OF GRAPHIC OBJECTS AND METHODS OF THEIR RECOGNITION (OPTIONS) |
US8509563B2 (en) | 2006-02-02 | 2013-08-13 | Microsoft Corporation | Generation of documents from images |
RU2311680C1 (en) * | 2006-05-19 | 2007-11-27 | Московский государственный университет инженерной экологии | Method for automated division of pixels of raster images onto brightness level groups on basis of repeat values thereof |
RU2390843C2 (en) * | 2008-04-29 | 2010-05-27 | Александр Иванович Иванов | Character recognition method |
EP2275972B1 (en) | 2009-07-06 | 2018-11-28 | AO Kaspersky Lab | System and method for identifying text-based spam in images |
US8023697B1 (en) | 2011-03-29 | 2011-09-20 | Kaspersky Lab Zao | System and method for identifying spam in rasterized images |
RU2634192C1 (en) * | 2016-06-24 | 2017-10-24 | Общество с ограниченной ответственностью "Аби Девелопмент" | Data input from series of images applicable to template document |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4802231A (en) * | 1987-11-24 | 1989-01-31 | Elliot Davis | Pattern recognition error reduction system |
US5257323A (en) * | 1991-05-29 | 1993-10-26 | Canon Kabushiki Kaisha | Selection agent for a symbol determination system with multiple character recognition processors |
US5519786A (en) * | 1994-08-09 | 1996-05-21 | Trw Inc. | Method and apparatus for implementing a weighted voting scheme for multiple optical character recognition systems |
US5590224A (en) * | 1992-10-19 | 1996-12-31 | Fast; Bruce B. | OCR image preprocessing method for image enhancement of scanned documents by correction of registration |
US5717794A (en) * | 1993-03-17 | 1998-02-10 | Hitachi, Ltd. | Document recognition method and system |
US5881172A (en) * | 1996-12-09 | 1999-03-09 | Mitek Systems, Inc. | Hierarchical character recognition system |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5172422A (en) * | 1991-05-13 | 1992-12-15 | Eastman Kodak Company | Fast character segmentation of skewed text lines for optical character recognition |
RU2037203C1 (en) * | 1993-07-16 | 1995-06-09 | Ольга Олеговна Веровенко | Method for object identification |
EP0677818B1 (en) * | 1994-04-15 | 2000-05-10 | Canon Kabushiki Kaisha | Image pre-processor for character recognition system |
US6249605B1 (en) * | 1998-09-14 | 2001-06-19 | International Business Machines Corporation | Key character extraction and lexicon reduction for cursive text recognition |
-
2002
- 2002-12-17 RU RU2002133897/09A patent/RU2234734C1/en active IP Right Revival
-
2003
- 2003-03-13 US US10/386,544 patent/US7088873B2/en not_active Expired - Fee Related
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4802231A (en) * | 1987-11-24 | 1989-01-31 | Elliot Davis | Pattern recognition error reduction system |
US5257323A (en) * | 1991-05-29 | 1993-10-26 | Canon Kabushiki Kaisha | Selection agent for a symbol determination system with multiple character recognition processors |
US5590224A (en) * | 1992-10-19 | 1996-12-31 | Fast; Bruce B. | OCR image preprocessing method for image enhancement of scanned documents by correction of registration |
US5594815A (en) * | 1992-10-19 | 1997-01-14 | Fast; Bruce B. | OCR image preprocessing method for image enhancement of scanned documents |
US5717794A (en) * | 1993-03-17 | 1998-02-10 | Hitachi, Ltd. | Document recognition method and system |
US5519786A (en) * | 1994-08-09 | 1996-05-21 | Trw Inc. | Method and apparatus for implementing a weighted voting scheme for multiple optical character recognition systems |
US5881172A (en) * | 1996-12-09 | 1999-03-09 | Mitek Systems, Inc. | Hierarchical character recognition system |
Cited By (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100031173A1 (en) * | 2006-07-03 | 2010-02-04 | Ondis Pty Ltd. | Business Card Scanner |
US20080092045A1 (en) * | 2006-10-16 | 2008-04-17 | Candelore Brant L | Trial selection of STB remote control codes |
US20080091713A1 (en) * | 2006-10-16 | 2008-04-17 | Candelore Brant L | Capture of television metadata via OCR |
US7966552B2 (en) | 2006-10-16 | 2011-06-21 | Sony Corporation | Trial selection of STB remote control codes |
US8077263B2 (en) | 2006-10-23 | 2011-12-13 | Sony Corporation | Decoding multiple remote control code sets |
US20080097984A1 (en) * | 2006-10-23 | 2008-04-24 | Candelore Brant L | OCR input to search engine |
US20080098357A1 (en) * | 2006-10-23 | 2008-04-24 | Candelore Brant L | Phantom information commands |
US8629942B2 (en) | 2006-10-23 | 2014-01-14 | Sony Corporation | Decoding multiple remote control code sets |
US8296808B2 (en) | 2006-10-23 | 2012-10-23 | Sony Corporation | Metadata from image recognition |
US20080098426A1 (en) * | 2006-10-23 | 2008-04-24 | Candelore Brant L | Decoding multiple remote control code sets |
US20080098433A1 (en) * | 2006-10-23 | 2008-04-24 | Hardacker Robert L | User managed internet links from TV |
US7689613B2 (en) | 2006-10-23 | 2010-03-30 | Sony Corporation | OCR input to search engine |
US20080098432A1 (en) * | 2006-10-23 | 2008-04-24 | Hardacker Robert L | Metadata from image recognition |
US20080199150A1 (en) * | 2007-02-14 | 2008-08-21 | Candelore Brant L | Transfer of metadata using video frames |
US7814524B2 (en) | 2007-02-14 | 2010-10-12 | Sony Corporation | Capture of configuration and service provider data via OCR |
US9241134B2 (en) | 2007-02-14 | 2016-01-19 | Sony Corporation | Transfer of metadata using video frames |
US7991271B2 (en) | 2007-02-14 | 2011-08-02 | Sony Corporation | Transfer of metadata using video frames |
US9124922B2 (en) | 2007-02-14 | 2015-09-01 | Sony Corporation | Capture of stylized TV table data via OCR |
US20080244637A1 (en) * | 2007-03-28 | 2008-10-02 | Sony Corporation | Obtaining metadata program information during channel changes |
US8438589B2 (en) | 2007-03-28 | 2013-05-07 | Sony Corporation | Obtaining metadata program information during channel changes |
US8621498B2 (en) | 2007-03-28 | 2013-12-31 | Sony Corporation | Obtaining metadata program information during channel changes |
US20080273114A1 (en) * | 2007-05-04 | 2008-11-06 | Hardacker Robert L | STB channel reader |
US8320674B2 (en) | 2008-09-03 | 2012-11-27 | Sony Corporation | Text localization for image and video OCR |
US8035656B2 (en) | 2008-11-17 | 2011-10-11 | Sony Corporation | TV screen text capture |
US20100123735A1 (en) * | 2008-11-17 | 2010-05-20 | Robert Blanchard | TV screen text capture |
US20100192178A1 (en) * | 2009-01-26 | 2010-07-29 | Candelore Brant L | Capture of stylized TV table data via OCR |
US8763038B2 (en) | 2009-01-26 | 2014-06-24 | Sony Corporation | Capture of stylized TV table data via OCR |
Also Published As
Publication number | Publication date |
---|---|
US20040114802A1 (en) | 2004-06-17 |
RU2234734C1 (en) | 2004-08-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7088873B2 (en) | Bit-mapped image multi-stage analysis method | |
JP2822189B2 (en) | Character recognition apparatus and method | |
US7881561B2 (en) | Method of pre-analysis of a machine-readable form image | |
US7889926B2 (en) | Image dictionary creating apparatus, coding apparatus, image dictionary creating method | |
US20070133029A1 (en) | Method of recognizing text information from a vector/raster image | |
US5561720A (en) | Method for extracting individual characters from raster images of a read-in handwritten or typed character sequence having a free pitch | |
US7694216B2 (en) | Automatic assignment of field labels | |
US7095894B2 (en) | Method and computer program product for recognizing italicized text | |
US20040114803A1 (en) | Method of stricken-out character recognition in handwritten text | |
JP3730073B2 (en) | Template creation method, apparatus, and recording medium recording template creation program | |
JPH07230525A (en) | Ruled line recognition method and table processing method | |
JP2001022883A (en) | Character recognition system and recording medium for realizing functions of the character recognition system | |
JP3090070B2 (en) | Form identification method and device | |
Liang et al. | Efficient algorithms for segmentation and recognition of printed characters in document processing | |
JP3074691B2 (en) | Character recognition device | |
JPH09167206A (en) | Space detection method for Japanese-English mixed document, pitch format determination method, space detection method for constant pitch alphanumeric character string, and space detection method for proportional pitch alphanumeric character string | |
JP2972516B2 (en) | Print pattern inspection device for inkjet printer | |
JP4089807B2 (en) | Bar code recognition method, apparatus, and program | |
JP2746345B2 (en) | Post-processing method for character recognition | |
JPH01277989A (en) | Character string pattern reader | |
JPS6272085A (en) | Character recognizing device | |
JP4320868B2 (en) | Character reader | |
JP2890788B2 (en) | Document recognition device | |
JP2963474B2 (en) | Similar character identification method | |
JPH02230484A (en) | character recognition device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: ABBYY SOFTWARE LTD., CYPRUS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:ANISIMOVICH, K.;TERESHCHENKO, V.;RYBKIN, V.;AND OTHERS;REEL/FRAME:015013/0639 Effective date: 20040215 |
|
FPAY | Fee payment |
Year of fee payment: 4 |
|
AS | Assignment |
Owner name: ABBYY DEVELOPMENT LLC, RUSSIAN FEDERATION Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:ABBYY SOFTWARE LTD.;REEL/FRAME:031085/0834 Effective date: 20130823 |
|
FPAY | Fee payment |
Year of fee payment: 8 |
|
FEPP | Fee payment procedure |
Free format text: PAT HOLDER NO LONGER CLAIMS SMALL ENTITY STATUS, ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: STOL); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
CC | Certificate of correction | ||
FEPP | Fee payment procedure |
Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.) |
|
LAPS | Lapse for failure to pay maintenance fees |
Free format text: PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
STCH | Information on status: patent discontinuation |
Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362 |
|
FP | Lapsed due to failure to pay maintenance fee |
Effective date: 20180808 |
|
AS | Assignment |
Owner name: ABBYY PRODUCTION LLC, RUSSIAN FEDERATION Free format text: MERGER;ASSIGNOR:ABBYY DEVELOPMENT LLC;REEL/FRAME:047997/0652 Effective date: 20171208 |