[go: up one dir, main page]

Yarowsky, 2000 - Google Patents

Hierarchical decision lists for word sense disambiguation

Yarowsky, 2000

View PDF
Document ID
7939793316054139293
Author
Yarowsky D
Publication year
Publication venue
Computers and the Humanities

External Links

Snippet

This paper describes a supervised algorithm for word sensedisambiguation based on hierarchies of decision lists. This algorithmsupports a useful degree of conditional branching while minimizing thetraining data fragmentation typical of decision trees. Classificationsare …
Continue reading at www.coli.uni-saarland.de (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2705Parsing
    • G06F17/271Syntactic parsing, e.g. based on context-free grammar [CFG], unification grammars
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2765Recognition
    • G06F17/277Lexical analysis, e.g. tokenisation, collocates
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/28Processing or translating of natural language
    • G06F17/2872Rule based translation
    • G06F17/2881Natural language generation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/274Grammatical analysis; Style critique
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2795Thesaurus; Synonyms
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2705Parsing
    • G06F17/2715Statistical methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/28Processing or translating of natural language
    • G06F17/2809Data driven translation
    • G06F17/2827Example based machine translation; Alignment
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2785Semantic analysis
    • G06F17/279Discourse representation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F17/30634Querying
    • G06F17/30657Query processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2755Morphological analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/21Text processing
    • G06F17/22Manipulating or registering by use of codes, e.g. in sequence of text characters

Similar Documents

Publication Publication Date Title
Yarowsky Hierarchical decision lists for word sense disambiguation
Sekine et al. A corpus-based probabilistic grammar with only two non-terminals
Inkpen et al. Building and using a lexical knowledge base of near-synonym differences
Arun et al. Lexicalization in crosslinguistic probabilistic parsing: The case of French
Habash Arabic morphological representations for machine translation
Montoyo et al. Combining knowledge-and corpus-based word-sense-disambiguation methods
Mihalcea Word sense disambiguation with pattern learning and automatic feature selection
Lee et al. Syllable-pattern-based unknown-morpheme segmentation and estimation for hybrid part-of-speech tagging of Korean
Þorsteinsson et al. A wide-coverage context-free grammar for Icelandic and an accompanying parsing system
Hristea The naïve Bayes model for unsupervised word sense disambiguation: Aspects concerning feature selection
Imamura Application of translation knowledge acquired by hierarchical phrase alignment for pattern-based MT
Al-Sabbagh et al. Mining the Web for the Induction of a Dialectical Arabic Lexicon.
Naskar et al. A phrasal EBMT system for translating English to Bengali
Preotiuc-Pietro et al. Unsupervised word sense disambiguation with N-gram features
Wu et al. Parsing-based Chinese word segmentation integrating morphological and syntactic information
Flickinger The English resource grammar
Piskorski et al. Lemmatization of Polish person names
Adebara et al. Translating the unseen? yoruba-english mt in low-resource, morphologically-unmarked settings
Argaw et al. Dictionary-based Amharic-French information retrieval
Okhovvat et al. An Accurate Persian Part-of-Speech Tagger.
Hurskainen Optimizing disambiguation in Swahili
Itai et al. A corpus based morphological analyzer for unvocalized modern hebrew
Đorđević et al. Different approaches in serbian language parsing using context-free grammars
Ellman et al. Word sense disambiguation by information filtering and extraction
Isozaki An analysis of a high-performance Japanese question answering system