[go: up one dir, main page]

Chieu et al., 2002 - Google Patents

Teaching a weaker classifier: Named entity recognition on upper case text

Chieu et al., 2002

View PDF
Document ID
10568593874225941772
Author
Chieu H
Ng H
Publication year
Publication venue
Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics

External Links

Snippet

This paper describes how a machinelearning named entity recognizer (NER) on upper case text can be improved by using a mixed case NER and some unlabeled text. The mixed case NER can be used to tag some unlabeled mixed case text, which are then used as additional …
Continue reading at aclanthology.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2765Recognition
    • G06F17/277Lexical analysis, e.g. tokenisation, collocates
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2765Recognition
    • G06F17/2775Phrasal analysis, e.g. finite state techniques, chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/21Text processing
    • G06F17/22Manipulating or registering by use of codes, e.g. in sequence of text characters
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2705Parsing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/28Processing or translating of natural language
    • G06F17/2809Data driven translation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/28Processing or translating of natural language
    • G06F17/2863Processing of non-latin text
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F17/30634Querying
    • G06F17/30657Query processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation, e.g. computer aided management of electronic mail or groupware; Time management, e.g. calendars, reminders, meetings or time accounting

Similar Documents

Publication Publication Date Title
Chieu et al. Named entity recognition with a maximum entropy approach
Chieu et al. A maximum entropy approach to information extraction from semi-structured and free text
Reynar et al. A maximum entropy approach to identifying sentence boundaries
Ekbal et al. Named entity recognition in Bengali: A multi-engine approach
Murthy et al. Language identification from small text samples
Amarappa et al. Named entity recognition and classification in kannada language
Fernando et al. Comprehensive part-of-speech tag set and svm based pos tagger for sinhala
Meelen et al. Optimisation of the largest annotated Tibetan corpus combining rule-based, memory-based, and deep-learning methods
Scherrer et al. New developments in tagging pre-modern orthodox Slavic texts
Halvani et al. Natural language watermarking for german texts
Priyadarshi et al. A study on the performance of recurrent neural network based models in Maithili part of speech tagging
Chieu et al. Teaching a weaker classifier: Named entity recognition on upper case text
Mohamed et al. Arabic-SOS: segmentation, stemming, and orthography standardization for classical and pre-modern standard Arabic
Nagy Teaching a computer to read
Boisen et al. Annotating Resources for Information Extraction.
Chua et al. Learning pattern rules for Chinese named entity extraction
Algahtani Arabic named entity recognition: A corpus-based study
Mollá et al. Named entity recognition in question answering of speech data
Dao et al. Evaluating the effect of letter case on named entity recognition performance
Xia et al. Accurate Pinyin-English codeswitched language identification
Bosch et al. Memory-based morphological analysis and part-of-speech tagging of Arabic
Bergsma et al. Glen, Glenda or Glendale: Unsupervised and semi-supervised learning of English noun gender
Lyu et al. Converse attention knowledge transfer for low-resource named entity recognition
Mohammed et al. Translating Ambiguous Arabic Words Using Text Mining
Zeldes A characterwise windowed approach to Hebrew morphological segmentation