Froud et al., 2010 - Google Patents
Stemming and similarity measures for Arabic Documents ClusteringFroud et al., 2010
- Document ID
- 15402100354464619395
- Author
- Froud H
- Benslimane R
- Lachkar A
- Ouatik S
- Publication year
- Publication venue
- 2010 5th International Symposium on I/V Communications and Mobile Network
External Links
Snippet
Arabic Documents Clustering is an important task for obtaining good results with the traditional Information Retrieval (TR) systems especially with the rapid growth of the number of online documents present in Arabic language. Document clustering aims to automatically …
- 230000000875 corresponding 0 description 4
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G06F17/30675—Query execution
- G06F17/3069—Query execution using vector based model
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G06F17/3066—Query translation
- G06F17/30669—Translation of the query language, e.g. Chinese to English
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30705—Clustering or classification
- G06F17/3071—Clustering or classification including class or cluster creation or modification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30613—Indexing
- G06F17/30619—Indexing indexing structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30705—Clustering or classification
- G06F17/30707—Clustering or classification into predefined classes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30386—Retrieval requests
- G06F17/30424—Query processing
- G06F17/30522—Query processing with adaptation to user needs
- G06F17/3053—Query processing with adaptation to user needs using ranking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30731—Creation of semantic tools
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30716—Browsing or visualization
- G06F17/30719—Summarization for human users
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
- G06F17/30864—Retrieval from the Internet, e.g. browsers by querying, e.g. search engines or meta-search engines, crawling techniques, push systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2795—Thesaurus; Synonyms
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99935—Query augmenting and refining, e.g. inexact access
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Froud et al. | Arabic text summarization based on latent semantic analysis to enhance arabic documents clustering | |
Siddiqi et al. | Keyword and keyphrase extraction techniques: a literature review | |
US6189002B1 (en) | Process and system for retrieval of documents using context-relevant semantic profiles | |
Islam et al. | Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words. | |
Froud et al. | Stemming and similarity measures for Arabic Documents Clustering | |
Alexandrov et al. | An approach to clustering abstracts | |
Froud et al. | A comparative study of root-based and stem-based approaches for measuring the similarity between Arabic words for Arabic text mining applications | |
Lamba et al. | A survey on plagiarism detection techniques for indian regional languages | |
Andersson et al. | When is the time ripe for natural language processing for patent passage retrieval? | |
CN105956010A (en) | Distributed information retrieval set selection method based on distributed representation and local ordering | |
Sanchez-Gomez et al. | Sentiment-oriented query-focused text summarization addressed with a multi-objective optimization approach | |
Najadat et al. | Automatic keyphrase extractor from arabic documents | |
Ngo et al. | Wordnet-based information retrieval using common hypernyms and combined features | |
Kireyev | Semantic-based estimation of term informativeness | |
Wang et al. | A joint chinese named entity recognition and disambiguation system | |
Haribhakta et al. | Unsupervised topic detection model and its application in text categorization | |
Wang et al. | Course concept extraction in MOOC via explicit/implicit representation | |
Sahmoudi et al. | A new keyphrases extraction method based on suffix tree data structure for Arabic documents clustering | |
Bsoul et al. | Distance measures and stemming impact on Arabic document clustering | |
Artese et al. | What is this painting about? Experiments on Unsupervised Keyphrases Extraction algorithms | |
Li et al. | News-oriented automatic Chinese keyword indexing | |
Hoque et al. | Information retrieval system in bangla document ranking using latent semantic indexing | |
CN113642325A (en) | Text keyword extraction method fusing text structure information and semantic information | |
Froud et al. | Stemming for Arabic words similarity measures based on Latent Semantic Analysis model | |
Ab Samat et al. | Malay documents clustering algorithm based on singular value decomposition |