Baró et al., 2019 - Google Patents
Towards a generic unsupervised method for transcription of encoded manuscriptsBaró et al., 2019
View PDF- Document ID
- 8313424091033996085
- Author
- Baró A
- Chen J
- Fornés A
- Megyesi B
- Publication year
- Publication venue
- Proceedings of the 3rd International Conference on Digital Access to Textual Cultural Heritage
External Links
Snippet
Historical ciphers, a special type of manuscripts, contain encrypted information, important for the interpretation of our history. The first step towards decipherment is to transcribe the images, either manually or by automatic image processing techniques. Despite the …
- 230000035897 transcription 0 title abstract description 33
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/68—Methods or arrangements for recognition using electronic means using sequential comparisons of the image signals with a plurality of references in which the sequence of the image signals or the references is relevant, e.g. addressable memory
- G06K9/6807—Dividing the references in groups prior to recognition, the recognition taking place in steps; Selecting relevant dictionaries
- G06K9/6842—Dividing the references in groups prior to recognition, the recognition taking place in steps; Selecting relevant dictionaries according to the linguistic properties, e.g. English, German
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/20—Image acquisition
- G06K9/2054—Selective acquisition/locating/processing of specific regions, e.g. highlighted text, fiducial marks, predetermined fields, document type identification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/20—Image acquisition
- G06K9/22—Image acquisition using hand-held instruments
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00852—Recognising whole cursive words
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/03—Detection or correction of errors, e.g. by rescanning the pattern
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/38—Quantising the analogue image signal, e.g. histogram thresholding for discrimination between background and foreground patterns
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Baviskar et al. | Efficient automated processing of the unstructured documents using artificial intelligence: A systematic literature review and future directions | |
Baró et al. | Towards a generic unsupervised method for transcription of encoded manuscripts | |
Kantharaj et al. | Opencqa: Open-ended question answering with charts | |
CN107004140B (en) | Text recognition method and computer program product | |
Xue et al. | A better way to attend: Attention with trees for video question answering | |
US11868313B1 (en) | Apparatus and method for generating an article | |
Dölek et al. | A deep learning model for Ottoman OCR | |
Lai et al. | Semeval 2022 task 12: Symlink-linking mathematical symbols to their descriptions | |
Toselli et al. | Transcribing a 17th-century botanical manuscript: Longitudinal evaluation of document layout detection and interactive transcription | |
Nguyen et al. | OCR error correction for unconstrained Vietnamese handwritten text | |
Almanea | Automatic methods and neural networks in Arabic texts diacritization: a comprehensive survey | |
Alrasheed et al. | Evaluation of deep learning techniques for content extraction in spanish colonial notary records | |
Kasem et al. | Advancements and challenges in arabic optical character recognition: A comprehensive survey | |
Yasin et al. | Transformer-based neural machine translation for post-OCR error correction in cursive text | |
Xia et al. | A sequence-to-sequence approach with mixed pointers to topic segmentation and segment labeling | |
Khan et al. | OCR approaches for humanities: Applications of artificial intelligence/machine learning on transcription and transliteration of historical documents | |
Bender et al. | Learning fine-grained image representations for mathematical expression recognition | |
Sundaram et al. | Bigram language models and reevaluation strategy for improved recognition of online handwritten Tamil words | |
Tasdemir et al. | Automatic transcription of Ottoman documents using deep learning | |
Sinclair et al. | Handwriting recognition for Scottish Gaelic | |
CN116611428B (en) | Non-autoregressive decoding Vietnam text regularization method based on editing alignment algorithm | |
Andrés et al. | Search for hyphenated words in probabilistic indices: a machine learning approach | |
Almanea | Deep learning in written arabic linguistic studies: A comprehensive survey | |
Villegas et al. | Exploiting existing modern transcripts for historical handwritten text recognition | |
Villanova-Aparisi et al. | Reading Order Independent Metrics for Information Extraction in Handwritten Documents |