Kiani et al., 2009 - Google Patents

Developing a Persian chunker using a hybrid approach

Kiani et al., 2009

Document ID: 3993535500128490738
Author: Kiani S; Akhavan T; Shamsfard M
Publication year: 2009
Publication venue: 2009 International Multiconference on Computer Science and Information Technology

External Links

Cited by

Snippet

Text segmentation is the process of recognizing boundaries of text constituents, such as sentences, phrases and words. This paper focuses on phrase segmentation also known as chunking. This task has different problems in various natural languages depending on …

Continue reading at www.academia.edu (PDF) (other versions)

230000001537 neural 0 abstract description 22

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G06F17/271—Syntactic parsing, e.g. based on context-free grammar [CFG], unification grammars
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/277—Lexical analysis, e.g. tokenisation, collocates
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/2775—Phrasal analysis, e.g. finite state techniques, chunking
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G06F17/2715—Statistical methods
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G06F17/30675—Query execution
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/274—Grammatical analysis; Style critique
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2785—Semantic analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/68—Methods or arrangements for recognition using electronic means using sequential comparisons of the image signals with a plurality of references in which the sequence of the image signals or the references is relevant, e.g. addressable memory
- G06K9/6807—Dividing the references in groups prior to recognition, the recognition taking place in steps; Selecting relevant dictionaries
- G06K9/6842—Dividing the references in groups prior to recognition, the recognition taking place in steps; Selecting relevant dictionaries according to the linguistic properties, e.g. English, German

Similar Documents

Publication	Publication Date	Title
Poon et al.	2009	Unsupervised morphological segmentation with log-linear models
Elayeb	2019	Arabic word sense disambiguation: a review
Antony et al.	2011	Parts of speech tagging for Indian languages: a literature survey
Rozovskaya et al.	2014	Correcting grammatical verb errors
Das et al.	2015	Part of speech tagging in odia using support vector machine
Jayakrishnan et al.	2018	Multi-class emotion detection and annotation in Malayalam novels
Suárez et al.	2002	A maximum entropy-based word sense disambiguation system
Dien et al.	2003	POS-tagger for English-Vietnamese bilingual corpus
Tlili-Guiassa	2006	Hybrid method for tagging Arabic text
Kumar et al.	2009	Morphological analyzer for agglutinative languages using machine learning approaches
Wan et al.	2021	Enhancing metaphor detection by gloss-based interpretations
Surahio et al.	2018	Prediction system for sindhi parts of speech tags by using support vector machine
Jayasuriya et al.	2013	Learning a stochastic part of speech tagger for sinhala
Hellwig	2017	Coarse semantic classification of rare nouns using cross-lingual data and recurrent neural networks
Bruches et al.	2021	A system for information extraction from scientific texts in Russian
Jamwal et al.	2022	A Novel Hybrid Approach for the Designing and Implementation of Dogri Spell Checker
Chakraborty et al.	2024	Syntactic Category based Assamese Question Pattern Extraction using N-grams
Kiani et al.	2009	Developing a Persian chunker using a hybrid approach
Francis	2015	A comprehensive survey on parts of speech tagging approaches in dravidian languages
Hoste	2016	The mention-pair model
Bosch et al.	2007	Memory-based morphological analysis and part-of-speech tagging of Arabic
Farrah et al.	2018	An hybrid approach to improve part of speech tagging system
Bach et al.	2015	Paraphrase identification in Vietnamese documents
Sampath et al.	2023	Hybrid Tamil spell checker with combined character splitting
Kardan et al.	2014	Improving Persian POS tagging using the maximum entropy model