Chieu et al., 2002 - Google Patents
Teaching a weaker classifier: Named entity recognition on upper case textChieu et al., 2002
View PDF- Document ID
- 10568593874225941772
- Author
- Chieu H
- Ng H
- Publication year
- Publication venue
- Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics
External Links
Snippet
This paper describes how a machinelearning named entity recognizer (NER) on upper case text can be improved by using a mixed case NER and some unlabeled text. The mixed case NER can be used to tag some unlabeled mixed case text, which are then used as additional …
- 102100013926 MUC7 0 abstract description 30
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/277—Lexical analysis, e.g. tokenisation, collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/2775—Phrasal analysis, e.g. finite state techniques, chunking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2863—Processing of non-latin text
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation, e.g. computer aided management of electronic mail or groupware; Time management, e.g. calendars, reminders, meetings or time accounting
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Chieu et al. | Named entity recognition with a maximum entropy approach | |
Chieu et al. | A maximum entropy approach to information extraction from semi-structured and free text | |
Reynar et al. | A maximum entropy approach to identifying sentence boundaries | |
Ekbal et al. | Named entity recognition in Bengali: A multi-engine approach | |
Murthy et al. | Language identification from small text samples | |
Amarappa et al. | Named entity recognition and classification in kannada language | |
Fernando et al. | Comprehensive part-of-speech tag set and svm based pos tagger for sinhala | |
Meelen et al. | Optimisation of the largest annotated Tibetan corpus combining rule-based, memory-based, and deep-learning methods | |
Scherrer et al. | New developments in tagging pre-modern orthodox Slavic texts | |
Halvani et al. | Natural language watermarking for german texts | |
Priyadarshi et al. | A study on the performance of recurrent neural network based models in Maithili part of speech tagging | |
Chieu et al. | Teaching a weaker classifier: Named entity recognition on upper case text | |
Mohamed et al. | Arabic-SOS: segmentation, stemming, and orthography standardization for classical and pre-modern standard Arabic | |
Nagy | Teaching a computer to read | |
Boisen et al. | Annotating Resources for Information Extraction. | |
Chua et al. | Learning pattern rules for Chinese named entity extraction | |
Algahtani | Arabic named entity recognition: A corpus-based study | |
Mollá et al. | Named entity recognition in question answering of speech data | |
Dao et al. | Evaluating the effect of letter case on named entity recognition performance | |
Xia et al. | Accurate Pinyin-English codeswitched language identification | |
Bosch et al. | Memory-based morphological analysis and part-of-speech tagging of Arabic | |
Bergsma et al. | Glen, Glenda or Glendale: Unsupervised and semi-supervised learning of English noun gender | |
Lyu et al. | Converse attention knowledge transfer for low-resource named entity recognition | |
Mohammed et al. | Translating Ambiguous Arabic Words Using Text Mining | |
Zeldes | A characterwise windowed approach to Hebrew morphological segmentation |