Yarowsky, 2000 - Google Patents
Hierarchical decision lists for word sense disambiguationYarowsky, 2000
View PDF- Document ID
- 7939793316054139293
- Author
- Yarowsky D
- Publication year
- Publication venue
- Computers and the Humanities
External Links
Snippet
This paper describes a supervised algorithm for word sensedisambiguation based on hierarchies of decision lists. This algorithmsupports a useful degree of conditional branching while minimizing thetraining data fragmentation typical of decision trees. Classificationsare …
- 238000006062 fragmentation reaction 0 abstract description 4
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G06F17/271—Syntactic parsing, e.g. based on context-free grammar [CFG], unification grammars
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/277—Lexical analysis, e.g. tokenisation, collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
- G06F17/2881—Natural language generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/274—Grammatical analysis; Style critique
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2795—Thesaurus; Synonyms
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G06F17/2715—Statistical methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G06F17/2827—Example based machine translation; Alignment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2785—Semantic analysis
- G06F17/279—Discourse representation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2755—Morphological analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Yarowsky | Hierarchical decision lists for word sense disambiguation | |
Sekine et al. | A corpus-based probabilistic grammar with only two non-terminals | |
Inkpen et al. | Building and using a lexical knowledge base of near-synonym differences | |
Arun et al. | Lexicalization in crosslinguistic probabilistic parsing: The case of French | |
Habash | Arabic morphological representations for machine translation | |
Montoyo et al. | Combining knowledge-and corpus-based word-sense-disambiguation methods | |
Mihalcea | Word sense disambiguation with pattern learning and automatic feature selection | |
Lee et al. | Syllable-pattern-based unknown-morpheme segmentation and estimation for hybrid part-of-speech tagging of Korean | |
Þorsteinsson et al. | A wide-coverage context-free grammar for Icelandic and an accompanying parsing system | |
Hristea | The naïve Bayes model for unsupervised word sense disambiguation: Aspects concerning feature selection | |
Imamura | Application of translation knowledge acquired by hierarchical phrase alignment for pattern-based MT | |
Al-Sabbagh et al. | Mining the Web for the Induction of a Dialectical Arabic Lexicon. | |
Naskar et al. | A phrasal EBMT system for translating English to Bengali | |
Preotiuc-Pietro et al. | Unsupervised word sense disambiguation with N-gram features | |
Wu et al. | Parsing-based Chinese word segmentation integrating morphological and syntactic information | |
Flickinger | The English resource grammar | |
Piskorski et al. | Lemmatization of Polish person names | |
Adebara et al. | Translating the unseen? yoruba-english mt in low-resource, morphologically-unmarked settings | |
Argaw et al. | Dictionary-based Amharic-French information retrieval | |
Okhovvat et al. | An Accurate Persian Part-of-Speech Tagger. | |
Hurskainen | Optimizing disambiguation in Swahili | |
Itai et al. | A corpus based morphological analyzer for unvocalized modern hebrew | |
Đorđević et al. | Different approaches in serbian language parsing using context-free grammars | |
Ellman et al. | Word sense disambiguation by information filtering and extraction | |
Isozaki | An analysis of a high-performance Japanese question answering system |