CA2500441A1 - Scannable form, system and method for image alignment and identification - Google Patents
Scannable form, system and method for image alignment and identification Download PDFInfo
- Publication number
- CA2500441A1 CA2500441A1 CA002500441A CA2500441A CA2500441A1 CA 2500441 A1 CA2500441 A1 CA 2500441A1 CA 002500441 A CA002500441 A CA 002500441A CA 2500441 A CA2500441 A CA 2500441A CA 2500441 A1 CA2500441 A1 CA 2500441A1
- Authority
- CA
- Canada
- Prior art keywords
- symbols
- symbol
- digitally encoded
- alignment
- template
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/24—Aligning, centring, orientation detection or correction of the image
- G06V10/245—Aligning, centring, orientation detection or correction of the image by locating a pattern; Special marks for positioning
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Character Input (AREA)
Abstract
A scannable form, and a method and system of use which incorporates digitally encoded symbols which have identifiable centers that act as control points to permit a digital image to be aligned with a template containing the same control points. The digital encoding permits identification of the type of form and the spatial arrangement of the symbols permits alignment of the form with the template. Preferably, the digital symbols are geometrically symmetrical and the identifiable centers are centroids. Recognition of any three centroids is sufficient to allow the system to calculate transformation parameters to realign the image. Due to the geometrical symmetry, error correction is possible if any of the symbols are incorrectly identified.
Description
1 "SCANNABLE FORM, SYSTEM AND
2 METHOD FOR IMAGE ALIGNMENT AND IDENTIFICATION"
3
4 FIELD OF THE INVENTION
The invention relates to the identification and alignment of scanned 6 images. More particularly, the images contain graphic symbols encoding 7 information related to the form and provide indicators for alignment with a known 8 template for analyzing the data thereon.
BACKGROUND OF THE INVENTION
11 Many answer sheet marking and form systems employ pre-printed 12 forms having timing systems which are read and interpreted by specialized 13 scanners.
14 Other recognition and marking systems use conventional digital scanners. One such system is described in: US Patent 5,936,225 to Arning 16 which teaches a system for identifying a form in an image by comparing vertical 17 and horizontal histograms of the image within the image file to vertical and 18 horizontal histograms of prototype form images in a prototype library. The 19 comparison is achieved using a processor and a form recognition engine.
Arning teaches that if a form is skewed or contains unwanted borders and the like, there 21 may possibly be a deskewing or cropping of the image. Arcing however does not 22 teach nor contemplate a means for achieving a realignment of the form.
Should 23 a form be unrecognizable, a secondary recognition phase is focused on 24 specified areas of the form. Arrdng does not contemplate large rotations of the images nor does he contemplate erroneous image capture.
1 In another system taught in US Patent 6,695,216 to Apperson, the 2 system utilizes a mark read scanner for reading forms having graphic switches 3 printed near the lead edge of the form. Each graphic switch or quad switch has 4 four distinct settings which can represent four different characteristics or positions and which is used to identity the family of forms from which the form is 6 derived. The switches utilize black and white designations to allow for binary or 7 quadruple interpretation of marks. The markings on the form also include timing 8 marks which are disposed along one edge of the form and which are associated 9 with an equal number of rows of "bubbles" or marking areas. A skew detector block is provided at or near an opposite edge of the form. If the leader block, 11 timing marks and skew detector block are not detected within a specified period 12 of time, it is assumed that the form was skewed or improperly positioned in the 13 scanner and it is rejected as erroneous and must be re-scanned.
14 Ideally, what is required is a form that can be scanned using a conventional digital scanner and the image data analyzed to determine the 16 template against which the form is to be compared. Further, it is ideal that the 17 analysis determine whether the image is skewed and then realign the data for 18 comparison with the template without the need to reacquire the image.
2 Figure 1 illustrates an array of sample graphical symbols of one 3 embodiment of the invention, each symbol having a size of 4x4 that can encode 4 values from 0 to 64. Symbols are surrounded by a black rectangle that has the same line width as a square in the symbol;
6 Figure 2 illustrates a sample sheet that can be scanned as an 7 image that employs an embodiment of the invention uses at least two sets of 8 three symbols each, each symbol selected from the array of symbols set forth in 9 Fig. 1; and Figure 3 illustrates sample graphical symbols of a size 5x5 that can 11 encode values from 0 to 626.
2 The scannable form, methodology and system of use set forth 3 herein is a novel approach to image alignment and identification. Digitally 4 encoded symbols having centers which define control points are arranged spatially on a form. Following capture of a digital image of the form, the control 6 points are compared to pre-determined control points on a known template of the 7 form. The digital encoding provides information regarding the type of form and 8 the known template while the spatial arrangement permits alignment of the 9 image to the known template. The integrated alignment and identification scheme combines sheet alignment and identification in a single and compact 11 framework.
12 At least three symbols are spatially arranged on the form and 13 recognition of at least three control points triggers calculation of transformation 14 parameters to align the control points and the image with the pre-determined control points on the template to permit sheet position, rotation and shearing 16 correction.
17 In one broad aspect of the invention a scannable form having at 18 least one answer response area comprises a generally rectangular sheet; and at 19 least three digitally encoded symbols, each encoded symbol having an identifiable center for forming a control point, the symbols being printed on the 21 sheet in a spatial arrangement relative to the at least one answer response area, 22 wherein when the form is scanned as an image, the control points in the image 23 are compared to pre-determined control points in a known template for 24 determining an alignment of the image with the known template.
1 In another broad aspect of the invention, a method for image 2 alignment and recognition of a scannable form comprises: providing a plurality of 3 digitally encoded symbols, each symbol encoding a unique number or character, 4 and having an identifiable center for forming a control point; positioning at least three of the plurality of digitally encoded symbols on the scannable form in a 6 spatial arrangement; generating a digital image of the scannable form including 7 the spatial arrangement; identifying a known template having pre-determined 8 control points for comparison of the scannable form thereto; and determining the 9 spatial arrangement of the control points in the digital image of at least three of the at least three digitally encoded symbols for determining alignment of the 11 digital image compared to pre-determined control points of the known template.
12 A system for preparing, administering and marking an examination 13 comprising: creating an examination having a plurality of user-defined questions;
14 selecting an answer sheet template from a plurality of customizable templates and customizing the template to contain markings representative of the answers 16 to the plurality of user-defined questions; printing a plurality of examination 17 response sheets using the selected answer sheet template and having at least 18 three digitally encoded symbols printed thereon, each symbol encoding a unique 19 number or character for identification of the template, and each symbol having a center wherein the at least three digitally encoded symbols form at least three 21 control points, the at least three control points being positioned on the plurality of 22 response sheets in a spatial arrangement to permit recognition of an alignment 23 of each of the printed answer sheets; administering the examination, each exam 24 answer sheet being marked at predetermined locations to indicate an answer to each of the plurality of user-defined questions; scanning each of the plurality of
The invention relates to the identification and alignment of scanned 6 images. More particularly, the images contain graphic symbols encoding 7 information related to the form and provide indicators for alignment with a known 8 template for analyzing the data thereon.
BACKGROUND OF THE INVENTION
11 Many answer sheet marking and form systems employ pre-printed 12 forms having timing systems which are read and interpreted by specialized 13 scanners.
14 Other recognition and marking systems use conventional digital scanners. One such system is described in: US Patent 5,936,225 to Arning 16 which teaches a system for identifying a form in an image by comparing vertical 17 and horizontal histograms of the image within the image file to vertical and 18 horizontal histograms of prototype form images in a prototype library. The 19 comparison is achieved using a processor and a form recognition engine.
Arning teaches that if a form is skewed or contains unwanted borders and the like, there 21 may possibly be a deskewing or cropping of the image. Arcing however does not 22 teach nor contemplate a means for achieving a realignment of the form.
Should 23 a form be unrecognizable, a secondary recognition phase is focused on 24 specified areas of the form. Arrdng does not contemplate large rotations of the images nor does he contemplate erroneous image capture.
1 In another system taught in US Patent 6,695,216 to Apperson, the 2 system utilizes a mark read scanner for reading forms having graphic switches 3 printed near the lead edge of the form. Each graphic switch or quad switch has 4 four distinct settings which can represent four different characteristics or positions and which is used to identity the family of forms from which the form is 6 derived. The switches utilize black and white designations to allow for binary or 7 quadruple interpretation of marks. The markings on the form also include timing 8 marks which are disposed along one edge of the form and which are associated 9 with an equal number of rows of "bubbles" or marking areas. A skew detector block is provided at or near an opposite edge of the form. If the leader block, 11 timing marks and skew detector block are not detected within a specified period 12 of time, it is assumed that the form was skewed or improperly positioned in the 13 scanner and it is rejected as erroneous and must be re-scanned.
14 Ideally, what is required is a form that can be scanned using a conventional digital scanner and the image data analyzed to determine the 16 template against which the form is to be compared. Further, it is ideal that the 17 analysis determine whether the image is skewed and then realign the data for 18 comparison with the template without the need to reacquire the image.
2 Figure 1 illustrates an array of sample graphical symbols of one 3 embodiment of the invention, each symbol having a size of 4x4 that can encode 4 values from 0 to 64. Symbols are surrounded by a black rectangle that has the same line width as a square in the symbol;
6 Figure 2 illustrates a sample sheet that can be scanned as an 7 image that employs an embodiment of the invention uses at least two sets of 8 three symbols each, each symbol selected from the array of symbols set forth in 9 Fig. 1; and Figure 3 illustrates sample graphical symbols of a size 5x5 that can 11 encode values from 0 to 626.
2 The scannable form, methodology and system of use set forth 3 herein is a novel approach to image alignment and identification. Digitally 4 encoded symbols having centers which define control points are arranged spatially on a form. Following capture of a digital image of the form, the control 6 points are compared to pre-determined control points on a known template of the 7 form. The digital encoding provides information regarding the type of form and 8 the known template while the spatial arrangement permits alignment of the 9 image to the known template. The integrated alignment and identification scheme combines sheet alignment and identification in a single and compact 11 framework.
12 At least three symbols are spatially arranged on the form and 13 recognition of at least three control points triggers calculation of transformation 14 parameters to align the control points and the image with the pre-determined control points on the template to permit sheet position, rotation and shearing 16 correction.
17 In one broad aspect of the invention a scannable form having at 18 least one answer response area comprises a generally rectangular sheet; and at 19 least three digitally encoded symbols, each encoded symbol having an identifiable center for forming a control point, the symbols being printed on the 21 sheet in a spatial arrangement relative to the at least one answer response area, 22 wherein when the form is scanned as an image, the control points in the image 23 are compared to pre-determined control points in a known template for 24 determining an alignment of the image with the known template.
1 In another broad aspect of the invention, a method for image 2 alignment and recognition of a scannable form comprises: providing a plurality of 3 digitally encoded symbols, each symbol encoding a unique number or character, 4 and having an identifiable center for forming a control point; positioning at least three of the plurality of digitally encoded symbols on the scannable form in a 6 spatial arrangement; generating a digital image of the scannable form including 7 the spatial arrangement; identifying a known template having pre-determined 8 control points for comparison of the scannable form thereto; and determining the 9 spatial arrangement of the control points in the digital image of at least three of the at least three digitally encoded symbols for determining alignment of the 11 digital image compared to pre-determined control points of the known template.
12 A system for preparing, administering and marking an examination 13 comprising: creating an examination having a plurality of user-defined questions;
14 selecting an answer sheet template from a plurality of customizable templates and customizing the template to contain markings representative of the answers 16 to the plurality of user-defined questions; printing a plurality of examination 17 response sheets using the selected answer sheet template and having at least 18 three digitally encoded symbols printed thereon, each symbol encoding a unique 19 number or character for identification of the template, and each symbol having a center wherein the at least three digitally encoded symbols form at least three 21 control points, the at least three control points being positioned on the plurality of 22 response sheets in a spatial arrangement to permit recognition of an alignment 23 of each of the printed answer sheets; administering the examination, each exam 24 answer sheet being marked at predetermined locations to indicate an answer to each of the plurality of user-defined questions; scanning each of the plurality of
5 1 answer sheets for forming a scanned digital image; processing the scanned 2 digital images for determining a location of the at least three control points for 3 aligning each answer sheet with the known template; and if aligned, comparing 4 the scanned digital image to the template for matching markings thereon for scoring the answer sheet.
6 Preferably, the symbols are geometrically symmetrical and have a
7 fixed center of mass or centroid. The geometrical symmetry permits error $ correction should a symbol be incorrectly decoded. Adjacent symbols have a first 9 hamming distance and non-adjacent symbols have a second fixed hamming distance. Determining the position of the symbol relative to the other symbols on 11 the image permits the system to predict the identity of the misidentified symbol.
12 Further, when operated in a batch mode, the system can utilize data from 13 previous scanned forms to assist in the prediction.
14 In one embodiment, the methodology is applied to the interpretation of scanned forms or examination marking sheets. Identification 16 and alignment can be applied however to any image incorporating symbols as 17 described in embodiments of the present invention arranged in a spatial 18 arrangement in the image.
1 DETAILED DESCRIPTION OF THE PRF_FERRED EMBODIMENT
2 For convenience, embodiments of the invention are described 3 herein in one possible context of images obtained from scanned forms, such as 4 examination bubble-type answer forms on which students have marked test answers. Once the type of form is recognized, the form image is aligned for 6 comparison to a known and corresponding template and response data fields 7 are located therein using embodiments of the invention, known methodologies
12 Further, when operated in a batch mode, the system can utilize data from 13 previous scanned forms to assist in the prediction.
14 In one embodiment, the methodology is applied to the interpretation of scanned forms or examination marking sheets. Identification 16 and alignment can be applied however to any image incorporating symbols as 17 described in embodiments of the present invention arranged in a spatial 18 arrangement in the image.
1 DETAILED DESCRIPTION OF THE PRF_FERRED EMBODIMENT
2 For convenience, embodiments of the invention are described 3 herein in one possible context of images obtained from scanned forms, such as 4 examination bubble-type answer forms on which students have marked test answers. Once the type of form is recognized, the form image is aligned for 6 comparison to a known and corresponding template and response data fields 7 are located therein using embodiments of the invention, known methodologies
8 are available for extracting markings representative of data input placed onto the
9 form, such as answers on a test sheet.
12 As shown in Figs. 1 and 3, a plurality of symbols are provided 13 which assist in determining the alignment of the image and realigning the image 14 to a known template having pre-determined control points, if required and may be used to identify the form type and known template to which the form is to be 16 compared. Optionally, a user may provide the system with the identity of the 17 known template manually, such as in a case where symbols encoding template 18 information are not recognized properly. In this case the symbols on the form are 19 used solely for alignment purposes.
Each symbol contains unique encoded data in a matrix of data 21 modules, each data module representing a data bit. The data modules are either 22 light, preferably white, or are dark, preferably black. A border, formed about 23 each symbol's matrix, has a line width equivalent to the width of a data module.
24 As shown in Fig. 1, symbols may utilize a 4X4 matrix which can encode 65 1 unique values. As shown in Fig. 3, symbols may utilize a 5X5 matrix which can 2 encode 627 unique values.
3 Each white and black square in a symbol represents a bit, 0 or 1, 4 respectively, permitting complete digital encoding. For example, the first symbol in Fig. 1 has a representation, in bits, of 0111 1111 1111 1110. More 6 information on various forms of symbols can be found in the standards defined 7 by ANSI/AIM BC11, International Symbology Specification called "Data Matrix".
8 Data Matrix is a two-dimensional matrix symbology containing the dark and light 9 square data modules. In conventional use, Data Matrix is designed with a fixed level of error correction capability.
11 Each of the symbols in Figs 1 and 3 has an identifiable center 12 which acts as a control point to aid in determining the alignment of the scanned 13 image. Preferably, the symbols are geometrically symmetrical having a fixed 14 center of mass, or centroid. As a result of the geometric symmetry, regardless the alignment of the form when scanned, the unique encoded data is 16 recognizable and is not corrupted as a result of sheet misalignment. The 17 symbols are thus invariant in theory and stable in practice with respect to any 18 affine transformation required to correct for the misalignment, such as shift, 19 rotation and shearing, and to bring the image into alignment for comparison to the template.
21 An affine transformation is a geometrical transformation that can be 22 precisely modeled as:
23 ~'_~'. (1) 24 In the matrix form, it looks like the following x' a b c x 1 y' = d a f y (2) 2 A is a 3x3 nonsingular transformation matrix and has only 6 free 3 parameters. X represents the coordinates of a point before transformation and Y
4 represents the transformed coordinates of the same point.
The symmetry of the symbols greatly improves signal to noise ratio 6 (SNR) assuming noise is random, as it is highly unlikely that a random noise 7 would contaminate a symbol in a symmetric way. Digital encoding makes the 8 symbols highly resistant to random noises which are prevalent in image 9 scanning.
A set of three 4X4 symbols can encode up to 274,625 (653) 11 templates. 1n a commercial situation wherein a provider supplies particular 12 templates to an end user, the provider can reserve a block of symbols for the 13 sheets or templates created only by the provider, leaving a plethora of unique 14 symbols for other uses by the end user.
17 As shown in Fig. 2, each of the plurality of symbols printed on the 18 form serves as a control point in determining the alignment of the image and in 19 calculating transformation parameters utilized in re-alignment of the image for comparison to the template, if required. Embodiments of the invention provide a 21 space-efficient system easily placed on any image or form.
1 The positions of the symbols on the form aid in minimizing errors 2 caused by non-uniform image formation, which is a common error resulting from 3 scanning speed variation.
4 After a form has been identified, the centers or centroids of the symbols are calculated. Each symbol or control point detected is matched with a 6 symbol or control point on the known template. A minimum of three or more 7 matching symbols or control points is required to calculate the transformation 8 parameters (a, b, c, d, e, f as seen in Formula (1), matrix (2)), which are 9 optimally estimated using a least square criterion, n 1o min ~ ape - P~)2 (3) 11 where pe is the estimated aligned position, 12 p, is the detected position from the detected template.
13 The pe's are calculated using formula (2) as noted above, and 14 therefore criterion (3) can be expanded out in terms of transformation parameters a, b, c, d, e, f. This is an often used and very effect way to best 16 estimate transformation parameters. Once transformation parameters are 17 calculated, a direct geometrical transformation modeled by equation (1) is 18 pertormed to transform a sheet image to align exactly with the detected 19 template.
Embodiments of the methodology of the invention use at least 21 three symbols in an image for providing the minimum data necessary for affine 22 transformation for alignment with the known template. Typically therefore, a first 1 set of three symbols is selected from the plurality of symbols and is used to 2 encode the form.
3 Preferably, the form further comprises a second, redundant set of 4 symbols, to provide robustness to the system, each set being equally useful and effective in providing information for alignment of the image. The second 6 redundant set of symbols comprises at least two symbols and in combination 7 with the first set must provide a cumulative number equaling three symbols.
The 8 second redundant set of symbols preferably comprises symbols which 9 correspond with the first set of symbols, for example, each of the three symbols of the first set having a companion symbol, arranged in pairs. Preferably, the 11 second redundant set of symbols comprises substantially identical symbols to 12 the first set of symbols and the matching symbols from each pair of symbols is 13 placed at roughly the same position on the form, allowing the system to detect 14 which symbol may be missing in the case of a scanning error.
Use of two fully redundant sets of symbols permits alignment 16 regardless of an enormously high error rate. Errors are non-recoverable only if 17 both symbols in a pair of symbols are missed or erroneously detected, which is 18 highly unlikely considering that given the spatial arrangement symbols within a 19 pair of symbols are positioned geometrically far apart. Although the symbols are redundant in terms of encoding, they are not redundant at all in terms of helping 21 sheet alignment.
22 The system is designed to detect those symbols which may be 23 missing as a result of burst errors typically caused by disruptions in the collection 24 of data from the form during scanning. Burst errors might also be caused by such events as scratches or additional markings on the form and the like.
1 In the case where a symbol's encoded value is interpreted 2 incorrectly, that is the symbol is interpreted to be another symbol, sheet 3 alignment is still possible using the position of the symbol as the symmetry of the 4 symbol is corrupted in a symmetrical manner. In the case of unsuccessful initial decoding of the symbol, the system would guess or predict what is most likely to 6 be the correct interpretation. The symbols are designed and arranged in a 7 spatial array to ensure that adjacent symbols, such as symbols representing 8 consecutive encoded values as shown in Fig. 1, have a first fixed hamming 9 distance of 2 and non-adjacent symbols have a second hamming distance of at least 4. The hamming distance between two symbols is simply the number of 11 bits that are different, for example 0110 and 1010 have a hamming distance of 2.
12 The system selects the symbol that has the least hamming distance with the 13 symbol decoding and does further image analysis to determine what is the most 14 likely value.
Further, when processing of forms is operated in a batch mode and 16 in case of error, the system takes advantage of the data on the previously 17 recognized forms to predict that what follows is very likely to be the same as 18 what has already been processed. That is, the program analyzing the image can 19 predict the erroneously detected or non-detected symbol from a subset of the plurality of symbols from the neighboring forms, instead of from the whole 21 symbol set, to compute the hamming distance with the symbol decoding. In this 22 case, if the neighboring symbols do not provide a symbol having the appropriate 23 hamming distance, the whole symbol set is used. Use of hamming distance aids 24 in efficient operation and allows the program to efficiently guess what might be the correct values.
2 Having reference again to Fig. 2, a form according to an 3 embodiment of the invention is shown, illustrating the use of a set of three 4 geometrically symmetric symbols and a redundant set of three identical geometrically symmetric symbols. The first set of symbols are identified as A, B
6 and C. The redundant set of symbols is identified as having corresponding 7 symbols A', B' and C'. Symbols A and A' are a pair, as are B and B' and C
and 8 C'. The hamming distances, comparing adjacent and non-adjacent symbols is 9 represented in Table A.
TABLE A
Ad'acent 1000 0000 0001 = 2 Ad'acent 0100 0000 0010 = 2 Non-ad'acent1100 0000 0011 = 4 12 In the embodiment shown in Fig. 2, a primary or first set of three 13 unique 2D symbols is provided in an upper portion of the form. Two of the 14 unique symbols A, B are spaced horizontally along the top margin and one C
is placed in the left margin. A second backup or redundant set of identical symbols 16 is provided in a lower portion of the form, forming pairs of symbols when 17 combined with the first set of symbols. Two of the redundant symbols A', B' are 18 positioned in the bottom margin and the third redundant symbol C' is positioned 19 in the left margin, the identical symbols of each pair being positioned at roughly the same horizontal position on the form, referenced from the left or right margin.
1 The top two symbols A,B roughly represent the middle and the 2 right end of the sheet, horizontally. The two left symbols C, C' roughly represent 3 one third to the middle of the sheet, vertically. The bottom two symbols A', B' 4 represent the middle and the right end of sheet, horizontally, at the bottom. For the embodiment illustrated, the spatial arrangement of symbols is more robust 6 than other spatial arrangements, such as positioning symbols in all four corners, 7 because the positions of the symbols closely represent the geometrical 8 distribution of the answer bubbles, as shown on the form in Fig. 2, and allows a 9 program to use a high level algorithm, such as the least sum of squared distances shown in Formula (2) to better estimate the transformation parameters 11 for sheet alignment.
12 Symbols from both the first and the second redundant set have 13 fixed individual, as well as set encoding rules In the example shown in Fig. 2, 14 A,B, and C represent symbols 12, 11 and 10, respectively, according to Fig.
1, where the first symbol is 0. Together ABC represents a value of 51,425 (12x652 16 + 11 x65 + 10). Any three individual symbols, including symbols from the first and 17 second redundant sets can be employed in estimating the transformation 18 parameters.
19 Each of the 2D symbols represents a digitally encoded value using 2 or more high contrast designations. Once the form is identified, all the relevant 21 information associated with the form is dynamically loaded into the system from 22 a database. The relevant information can be, but is not limited to, the relative 23 positions of the symbols, answer bubble positions, bubble group rules, and 24 output representation format.
1 The redundancy or backup sets of symbols are optional and may 2 be eliminated in situations where a high identification rate is not required. In this 3 case, the first and only set of symbols should consist of at least three symbols to 4 permit full affine transformation alignment.
An exact image of every form or exam sheet can be created using 6 any digital scanner and the image can be easily stored in electronic format.
This 7 eliminates the need for paper copies of exams and storage associated therewith.
8 A typical implementation in a school examination scenario includes: creating the 9 exam choosing from a set of customizable templates, printing the exam having symbols, arranged on the form according to embodiments of the invention, 11 thereon, the exam being economically printed on plain paper with any laser 12 printer, administering the exam allowing for the use of virtually any pencil or pen 13 for marking, scanning the exams with a digital scanner storing them as images, 14 processing the scanned images using the current invention for accurately identifying, aligning the exams and therefore correctly recording the exam 16 answers, and scoring the exams and generate reports for students and teachers.
12 As shown in Figs. 1 and 3, a plurality of symbols are provided 13 which assist in determining the alignment of the image and realigning the image 14 to a known template having pre-determined control points, if required and may be used to identify the form type and known template to which the form is to be 16 compared. Optionally, a user may provide the system with the identity of the 17 known template manually, such as in a case where symbols encoding template 18 information are not recognized properly. In this case the symbols on the form are 19 used solely for alignment purposes.
Each symbol contains unique encoded data in a matrix of data 21 modules, each data module representing a data bit. The data modules are either 22 light, preferably white, or are dark, preferably black. A border, formed about 23 each symbol's matrix, has a line width equivalent to the width of a data module.
24 As shown in Fig. 1, symbols may utilize a 4X4 matrix which can encode 65 1 unique values. As shown in Fig. 3, symbols may utilize a 5X5 matrix which can 2 encode 627 unique values.
3 Each white and black square in a symbol represents a bit, 0 or 1, 4 respectively, permitting complete digital encoding. For example, the first symbol in Fig. 1 has a representation, in bits, of 0111 1111 1111 1110. More 6 information on various forms of symbols can be found in the standards defined 7 by ANSI/AIM BC11, International Symbology Specification called "Data Matrix".
8 Data Matrix is a two-dimensional matrix symbology containing the dark and light 9 square data modules. In conventional use, Data Matrix is designed with a fixed level of error correction capability.
11 Each of the symbols in Figs 1 and 3 has an identifiable center 12 which acts as a control point to aid in determining the alignment of the scanned 13 image. Preferably, the symbols are geometrically symmetrical having a fixed 14 center of mass, or centroid. As a result of the geometric symmetry, regardless the alignment of the form when scanned, the unique encoded data is 16 recognizable and is not corrupted as a result of sheet misalignment. The 17 symbols are thus invariant in theory and stable in practice with respect to any 18 affine transformation required to correct for the misalignment, such as shift, 19 rotation and shearing, and to bring the image into alignment for comparison to the template.
21 An affine transformation is a geometrical transformation that can be 22 precisely modeled as:
23 ~'_~'. (1) 24 In the matrix form, it looks like the following x' a b c x 1 y' = d a f y (2) 2 A is a 3x3 nonsingular transformation matrix and has only 6 free 3 parameters. X represents the coordinates of a point before transformation and Y
4 represents the transformed coordinates of the same point.
The symmetry of the symbols greatly improves signal to noise ratio 6 (SNR) assuming noise is random, as it is highly unlikely that a random noise 7 would contaminate a symbol in a symmetric way. Digital encoding makes the 8 symbols highly resistant to random noises which are prevalent in image 9 scanning.
A set of three 4X4 symbols can encode up to 274,625 (653) 11 templates. 1n a commercial situation wherein a provider supplies particular 12 templates to an end user, the provider can reserve a block of symbols for the 13 sheets or templates created only by the provider, leaving a plethora of unique 14 symbols for other uses by the end user.
17 As shown in Fig. 2, each of the plurality of symbols printed on the 18 form serves as a control point in determining the alignment of the image and in 19 calculating transformation parameters utilized in re-alignment of the image for comparison to the template, if required. Embodiments of the invention provide a 21 space-efficient system easily placed on any image or form.
1 The positions of the symbols on the form aid in minimizing errors 2 caused by non-uniform image formation, which is a common error resulting from 3 scanning speed variation.
4 After a form has been identified, the centers or centroids of the symbols are calculated. Each symbol or control point detected is matched with a 6 symbol or control point on the known template. A minimum of three or more 7 matching symbols or control points is required to calculate the transformation 8 parameters (a, b, c, d, e, f as seen in Formula (1), matrix (2)), which are 9 optimally estimated using a least square criterion, n 1o min ~ ape - P~)2 (3) 11 where pe is the estimated aligned position, 12 p, is the detected position from the detected template.
13 The pe's are calculated using formula (2) as noted above, and 14 therefore criterion (3) can be expanded out in terms of transformation parameters a, b, c, d, e, f. This is an often used and very effect way to best 16 estimate transformation parameters. Once transformation parameters are 17 calculated, a direct geometrical transformation modeled by equation (1) is 18 pertormed to transform a sheet image to align exactly with the detected 19 template.
Embodiments of the methodology of the invention use at least 21 three symbols in an image for providing the minimum data necessary for affine 22 transformation for alignment with the known template. Typically therefore, a first 1 set of three symbols is selected from the plurality of symbols and is used to 2 encode the form.
3 Preferably, the form further comprises a second, redundant set of 4 symbols, to provide robustness to the system, each set being equally useful and effective in providing information for alignment of the image. The second 6 redundant set of symbols comprises at least two symbols and in combination 7 with the first set must provide a cumulative number equaling three symbols.
The 8 second redundant set of symbols preferably comprises symbols which 9 correspond with the first set of symbols, for example, each of the three symbols of the first set having a companion symbol, arranged in pairs. Preferably, the 11 second redundant set of symbols comprises substantially identical symbols to 12 the first set of symbols and the matching symbols from each pair of symbols is 13 placed at roughly the same position on the form, allowing the system to detect 14 which symbol may be missing in the case of a scanning error.
Use of two fully redundant sets of symbols permits alignment 16 regardless of an enormously high error rate. Errors are non-recoverable only if 17 both symbols in a pair of symbols are missed or erroneously detected, which is 18 highly unlikely considering that given the spatial arrangement symbols within a 19 pair of symbols are positioned geometrically far apart. Although the symbols are redundant in terms of encoding, they are not redundant at all in terms of helping 21 sheet alignment.
22 The system is designed to detect those symbols which may be 23 missing as a result of burst errors typically caused by disruptions in the collection 24 of data from the form during scanning. Burst errors might also be caused by such events as scratches or additional markings on the form and the like.
1 In the case where a symbol's encoded value is interpreted 2 incorrectly, that is the symbol is interpreted to be another symbol, sheet 3 alignment is still possible using the position of the symbol as the symmetry of the 4 symbol is corrupted in a symmetrical manner. In the case of unsuccessful initial decoding of the symbol, the system would guess or predict what is most likely to 6 be the correct interpretation. The symbols are designed and arranged in a 7 spatial array to ensure that adjacent symbols, such as symbols representing 8 consecutive encoded values as shown in Fig. 1, have a first fixed hamming 9 distance of 2 and non-adjacent symbols have a second hamming distance of at least 4. The hamming distance between two symbols is simply the number of 11 bits that are different, for example 0110 and 1010 have a hamming distance of 2.
12 The system selects the symbol that has the least hamming distance with the 13 symbol decoding and does further image analysis to determine what is the most 14 likely value.
Further, when processing of forms is operated in a batch mode and 16 in case of error, the system takes advantage of the data on the previously 17 recognized forms to predict that what follows is very likely to be the same as 18 what has already been processed. That is, the program analyzing the image can 19 predict the erroneously detected or non-detected symbol from a subset of the plurality of symbols from the neighboring forms, instead of from the whole 21 symbol set, to compute the hamming distance with the symbol decoding. In this 22 case, if the neighboring symbols do not provide a symbol having the appropriate 23 hamming distance, the whole symbol set is used. Use of hamming distance aids 24 in efficient operation and allows the program to efficiently guess what might be the correct values.
2 Having reference again to Fig. 2, a form according to an 3 embodiment of the invention is shown, illustrating the use of a set of three 4 geometrically symmetric symbols and a redundant set of three identical geometrically symmetric symbols. The first set of symbols are identified as A, B
6 and C. The redundant set of symbols is identified as having corresponding 7 symbols A', B' and C'. Symbols A and A' are a pair, as are B and B' and C
and 8 C'. The hamming distances, comparing adjacent and non-adjacent symbols is 9 represented in Table A.
TABLE A
Ad'acent 1000 0000 0001 = 2 Ad'acent 0100 0000 0010 = 2 Non-ad'acent1100 0000 0011 = 4 12 In the embodiment shown in Fig. 2, a primary or first set of three 13 unique 2D symbols is provided in an upper portion of the form. Two of the 14 unique symbols A, B are spaced horizontally along the top margin and one C
is placed in the left margin. A second backup or redundant set of identical symbols 16 is provided in a lower portion of the form, forming pairs of symbols when 17 combined with the first set of symbols. Two of the redundant symbols A', B' are 18 positioned in the bottom margin and the third redundant symbol C' is positioned 19 in the left margin, the identical symbols of each pair being positioned at roughly the same horizontal position on the form, referenced from the left or right margin.
1 The top two symbols A,B roughly represent the middle and the 2 right end of the sheet, horizontally. The two left symbols C, C' roughly represent 3 one third to the middle of the sheet, vertically. The bottom two symbols A', B' 4 represent the middle and the right end of sheet, horizontally, at the bottom. For the embodiment illustrated, the spatial arrangement of symbols is more robust 6 than other spatial arrangements, such as positioning symbols in all four corners, 7 because the positions of the symbols closely represent the geometrical 8 distribution of the answer bubbles, as shown on the form in Fig. 2, and allows a 9 program to use a high level algorithm, such as the least sum of squared distances shown in Formula (2) to better estimate the transformation parameters 11 for sheet alignment.
12 Symbols from both the first and the second redundant set have 13 fixed individual, as well as set encoding rules In the example shown in Fig. 2, 14 A,B, and C represent symbols 12, 11 and 10, respectively, according to Fig.
1, where the first symbol is 0. Together ABC represents a value of 51,425 (12x652 16 + 11 x65 + 10). Any three individual symbols, including symbols from the first and 17 second redundant sets can be employed in estimating the transformation 18 parameters.
19 Each of the 2D symbols represents a digitally encoded value using 2 or more high contrast designations. Once the form is identified, all the relevant 21 information associated with the form is dynamically loaded into the system from 22 a database. The relevant information can be, but is not limited to, the relative 23 positions of the symbols, answer bubble positions, bubble group rules, and 24 output representation format.
1 The redundancy or backup sets of symbols are optional and may 2 be eliminated in situations where a high identification rate is not required. In this 3 case, the first and only set of symbols should consist of at least three symbols to 4 permit full affine transformation alignment.
An exact image of every form or exam sheet can be created using 6 any digital scanner and the image can be easily stored in electronic format.
This 7 eliminates the need for paper copies of exams and storage associated therewith.
8 A typical implementation in a school examination scenario includes: creating the 9 exam choosing from a set of customizable templates, printing the exam having symbols, arranged on the form according to embodiments of the invention, 11 thereon, the exam being economically printed on plain paper with any laser 12 printer, administering the exam allowing for the use of virtually any pencil or pen 13 for marking, scanning the exams with a digital scanner storing them as images, 14 processing the scanned images using the current invention for accurately identifying, aligning the exams and therefore correctly recording the exam 16 answers, and scoring the exams and generate reports for students and teachers.
Claims (35)
EXCLUSIVE PROPERTY OR PRIVILEGE IS CLAIMED ARE DEFINED AS
FOLLOWS:
1. A scannable form having at least one answer response area comprising:
a generally rectangular sheet; and at least three digitally encoded symbols, each encoded symbol having an identifiable center for forming a control point, the symbols being printed on the sheet in a spatial arrangement relative to the at least one answer response area, wherein when the form is scanned as an image, the control points in the image are compared to pre-determined control points in a known template for determining an alignment of the image with the known template.
a generally rectangular sheet; and at least three digitally encoded symbols, each encoded symbol having an identifiable center for forming a control point, the symbols being printed on the sheet in a spatial arrangement relative to the at least one answer response area, wherein when the form is scanned as an image, the control points in the image are compared to pre-determined control points in a known template for determining an alignment of the image with the known template.
2. The scannable form as described in claim 1 wherein recognition of three or more of the identifiable centers of the at least three digitally encoded symbols triggers calculation of transformation parameters for alignment of the image with the known template.
3. The scannable form as described in claim 1 or 2 wherein the symbols are geometrically symmetrical and have a fixed centroid for forming the control point.
4. The scannable form as described in claim 1, 2 or 3 wherein the at least three digitally encoded symbols are a first set of the plurality of digitally encoded symbols.
5. The scannable form as described in claim 4 further comprising at least one additional redundant set of symbols.
6. The scannable form as described in claim 5 wherein the cumulative number of symbols in the first set and the at least one additional redundant set is three symbols.
7. The scannable form as described in claim 6 wherein the at least one redundant set of symbols comprises substantially identical symbols to the first set of symbols and the substantially identical symbols are arranged in pairs, each symbol of the pair being positioned at substantially the same horizontal position on the sheet.
8. A method for image alignment and recognition of a scannable form comprising:
providing a plurality of digitally encoded symbols, each symbol encoding a unique number or character, and having an identifiable center for forming a control point;
positioning at least three of the plurality of digitally encoded symbols on the scannable form in a spatial arrangement;
generating a digital image of the scannable form including the spatial arrangement;
identifying a known template having pre-determined control points for comparison of the scannable form thereto; and determining the spatial arrangement of the control points in the digital image of at least three of the at least three digitally encoded symbols for determining alignment of the digital image compared to pre-determined control points of the known template.
providing a plurality of digitally encoded symbols, each symbol encoding a unique number or character, and having an identifiable center for forming a control point;
positioning at least three of the plurality of digitally encoded symbols on the scannable form in a spatial arrangement;
generating a digital image of the scannable form including the spatial arrangement;
identifying a known template having pre-determined control points for comparison of the scannable form thereto; and determining the spatial arrangement of the control points in the digital image of at least three of the at least three digitally encoded symbols for determining alignment of the digital image compared to pre-determined control points of the known template.
9. The method as described in claim 8 wherein the digitally encoded symbols are decoded to identify the known template for comparison thereto.
10. The method as described in claim 8 wherein the template is identified manually by a user for comparison thereto.
11. The method as described in claim 8, 9 10, or 11 wherein the digitally encoded symbols are geometrically symmetrical and have a fixed centroid for forming the control point.
12. The method as described in claim 8, 9 10, or 11 wherein the at least three digitally encoded symbols are a first set of the plurality of digitally encoded symbols.
13. The method as described in claim 12 further comprising at least one additional redundant set of symbols.
14. The method as described in claim 13 wherein the cumulative number of symbols in the first set and the at least one additional redundant set is three symbols.
15. The method as described in claim 13 or 14 wherein the at least one redundant set of symbols comprises substantially identical symbols to the first set of symbols and the substantially identical symbols are arranged in pairs, each symbol of the pair being positioned at substantially the same horizontal position on the sheet.
16. The method as described in any one of claims 8 - 15 wherein each of the encoded symbols has a control point for recognition of the alignment of the digital image.
17. The method as described in any one of claims 8 - 16 wherein recognition of three or more of the at least three digitally encoded symbols triggers calculation of transformation parameters for alignment of the digital imate with the known template.
18. The method as described in any one of claims 8 - 16 wherein each symbol is a two-dimensional matrix having light and dark data modules, each module representing a data bit.
19. The method as described in claim 18 wherein the two-dimensional matrix is a 4X4 matrix.
20. The method as described in claim 18 wherein the two-dimensional matrix is a 5x5 matrix.
21. The method as described in claim 18, 19 or 20 wherein each two-dimensional matrix is contained in a border having a line width substantially equivalent to a width of an individual module within the symbol.
22. The method as described in any one of claims 8 - 21 wherein when at least a portion of a symbol encoding is erroneous and wherein the spatial arrangement comprises:
providing adjacent symbols, all adjacent symbols having a first fixed hamming distance and providing non-adjacent symbols, all on-adjacent symbols having a second hamming distance;
further comprising:
determining whether the at least a portion of the identified symbol is adjacent another symbol and should have the first fixed hamming distance or is not adjacent another symbol and should have the second fixed hamming distance; and predicting the identity of the erroneously encoded symbol by selecting a symbol from the plurality of symbols as being the symbol having the first or second fixed hamming distance.
providing adjacent symbols, all adjacent symbols having a first fixed hamming distance and providing non-adjacent symbols, all on-adjacent symbols having a second hamming distance;
further comprising:
determining whether the at least a portion of the identified symbol is adjacent another symbol and should have the first fixed hamming distance or is not adjacent another symbol and should have the second fixed hamming distance; and predicting the identity of the erroneously encoded symbol by selecting a symbol from the plurality of symbols as being the symbol having the first or second fixed hamming distance.
23. The method as described in claim 22 wherein when the sheets are recognized as a batch in a batch mode, further comprising:
predicting the identity of the erroneously encoded symbol by selecting a symbol from the symbols identified for neighbors in the batch having the least hamming distance.
predicting the identity of the erroneously encoded symbol by selecting a symbol from the symbols identified for neighbors in the batch having the least hamming distance.
24. The method as described in claim 23 further wherein a symbol having the fixed hamming distance is not found in the symbols identified for neighbours in the batch further comprising:
predicting the identity of the erroneously encoded symbol by selecting a symbol from the plurality of symbols as being the symbol having the least hamming distance.
predicting the identity of the erroneously encoded symbol by selecting a symbol from the plurality of symbols as being the symbol having the least hamming distance.
25. The method as described in claim 15 wherein when at least one symbol of the first set of symbols or the redundant set of symbols is not recognized further comprising:
predicting the identity of the unrecognized symbol from the symbol of the pair being positioned at substantially the same horizontal position on the sheet.
predicting the identity of the unrecognized symbol from the symbol of the pair being positioned at substantially the same horizontal position on the sheet.
26. A method for image alignment and recognition comprising:
providing a plurality of digitally encoded symbols, each symbol encoding a unique number or character, and being geometrically symmetrical and having a fixed centroid for forming a control point;
positioning at least three of the plurality of digitally encoded symbols on the scannable form in a spatial arrangement;
generating a digital image of the scannable form including the spatial arrangement;
identifying a known template having pre-determined control points for comparison of the scannable form thereto; and determining the spatial arrangement of the centroids of at least three of the at least three digitally encoded symbols for determining alignment of the digital image compared to the known template.
providing a plurality of digitally encoded symbols, each symbol encoding a unique number or character, and being geometrically symmetrical and having a fixed centroid for forming a control point;
positioning at least three of the plurality of digitally encoded symbols on the scannable form in a spatial arrangement;
generating a digital image of the scannable form including the spatial arrangement;
identifying a known template having pre-determined control points for comparison of the scannable form thereto; and determining the spatial arrangement of the centroids of at least three of the at least three digitally encoded symbols for determining alignment of the digital image compared to the known template.
27. The method as described in claim 26 wherein the at least three digitally encoded symbols are a first set of the plurality of digitally encoded symbols.
28. The method as described in claim 27 further comprising at least one additional redundant set of symbols.
29. The method as described in claim 28 wherein the cumulative number of symbols in the first set and the at least one additional redundant set is three symbols.
30. The method as described in claim 28 or 29 wherein the at least one redundant set of symbols comprises substantially identical symbols to the first set of symbols and the substantially identical symbols are arranged in pairs, each symbol of the pair being positioned at substantially the same horizontal position on the sheet.
31. The method as described in any one of claims 26 - 30 wherein each of the encoded symbols has a control point for recognition of the alignment of the digital image.
32. The method as described in any one of claims 26 - 31 wherein recognition of three or more of the at least three digitally encoded symbols triggers calculation of transformation parameters for alignment of the digital image with the known template.
33. A system for preparing, administering and marking an examination comprising:
creating an examination having a plurality of user-defined questions;
selecting an answer sheet template from a plurality of customizable templates and customizing the template to contain markings representative of the answers to the plurality of user-defined questions;
printing a plurality of examination response sheets using the selected answer sheet template and having at least three digitally encoded symbols printed thereon, each symbol encoding a unique number or character for identification of the template, and each symbol having a center wherein the at least three digitally encoded symbols form at least three control points, the at least three control points being positioned on the plurality of response sheets in a spatial arrangement to permit recognition of an alignment of each of the printed answer sheets;
administering the examination, each exam answer sheet being marked at predetermined locations to indicate an answer to each of the plurality of user-defined questions;
scanning each of the plurality of answer sheets for forming a scanned digital image;
processing the scanned digital images for determining a location of the at least three control points for aligning each answer sheet with the known template; and if aligned, comparing the scanned digital image to the template for matching markings thereon for scoring the answer sheet.
creating an examination having a plurality of user-defined questions;
selecting an answer sheet template from a plurality of customizable templates and customizing the template to contain markings representative of the answers to the plurality of user-defined questions;
printing a plurality of examination response sheets using the selected answer sheet template and having at least three digitally encoded symbols printed thereon, each symbol encoding a unique number or character for identification of the template, and each symbol having a center wherein the at least three digitally encoded symbols form at least three control points, the at least three control points being positioned on the plurality of response sheets in a spatial arrangement to permit recognition of an alignment of each of the printed answer sheets;
administering the examination, each exam answer sheet being marked at predetermined locations to indicate an answer to each of the plurality of user-defined questions;
scanning each of the plurality of answer sheets for forming a scanned digital image;
processing the scanned digital images for determining a location of the at least three control points for aligning each answer sheet with the known template; and if aligned, comparing the scanned digital image to the template for matching markings thereon for scoring the answer sheet.
34. The system as described in claim 33 wherein each of the symbols is geometrically symmetrical and has a fixed centroid for forming the control point.
35. The system as described in claim 34 wherein recognition of three or more of the at least three centroids of the digitally encoded symbols triggers calculation of transformation parameters for alignment of the sheet with the known template.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA002500441A CA2500441A1 (en) | 2004-03-12 | 2005-03-11 | Scannable form, system and method for image alignment and identification |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA2,460,888 | 2004-03-12 | ||
CA002460888A CA2460888A1 (en) | 2004-03-12 | 2004-03-12 | Method for image alignment and identification |
CA002500441A CA2500441A1 (en) | 2004-03-12 | 2005-03-11 | Scannable form, system and method for image alignment and identification |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2500441A1 true CA2500441A1 (en) | 2005-09-12 |
Family
ID=35005586
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002500441A Abandoned CA2500441A1 (en) | 2004-03-12 | 2005-03-11 | Scannable form, system and method for image alignment and identification |
Country Status (1)
Country | Link |
---|---|
CA (1) | CA2500441A1 (en) |
-
2005
- 2005-03-11 CA CA002500441A patent/CA2500441A1/en not_active Abandoned
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6742708B2 (en) | Fiducial mark patterns for graphical bar codes | |
US6895116B2 (en) | Automatically extracting graphical bar codes | |
US7999657B2 (en) | Image registration method for image comparison and document authentication | |
US6418244B2 (en) | Border-less clock free two-dimensional barcode and method for printing and reading the same | |
US6565003B1 (en) | Method for locating and reading a two-dimensional barcode | |
US10713528B2 (en) | System for determining alignment of a user-marked document and method thereof | |
US7305131B2 (en) | Extracting graphical bar codes from an input image | |
KR101648756B1 (en) | Examination paper recognition and scoring system | |
MXPA06001533A (en) | Machine readable data. | |
EP1436766B1 (en) | Graphically demodulating bar codes without foreknowledge of the original unmodulated base image | |
US20070246542A1 (en) | Document element repair | |
JP2001307014A (en) | 2D code extraction method | |
JP4594952B2 (en) | Character recognition device and character recognition method | |
EP1936535B1 (en) | Multiple barcode detection | |
US7227997B2 (en) | Image recognition apparatus, image recognition method, and image recognition program | |
US20050201639A1 (en) | Scannable form, system and method for image alignment and identification | |
US8310729B2 (en) | Device capable of adjusting two-dimensional code | |
CA2500441A1 (en) | Scannable form, system and method for image alignment and identification | |
JP2007299098A (en) | QR code reader | |
EP1237115B1 (en) | Automatic table location in documents | |
JPH0664367A (en) | Mark sheet |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FZDE | Discontinued |