Al Hasan et al., 2022 - Google Patents
Clustering analysis of Bangla news articles with TF-IDF & CV using mini-batch K-means and K-meansAl Hasan et al., 2022
- Document ID
- 10539001238034323860
- Author
- Al Hasan S
- Ruiqin W
- Hussain M
- Publication year
- Publication venue
- 2022 IEEE International Conference on Cybernetics and Computational Intelligence (CyberneticsCom)
External Links
Snippet
Document clustering is the compilation of docu-ments relating to textual content into classes or clusters. The primary objective is to group the documents that are internally logical but substantially different from each other. It is a vital method used in the retrieval of information …
- 238000004458 analytical method 0 title description 5
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30705—Clustering or classification
- G06F17/3071—Clustering or classification including class or cluster creation or modification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G06F17/30675—Query execution
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30705—Clustering or classification
- G06F17/30707—Clustering or classification into predefined classes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30613—Indexing
- G06F17/30619—Indexing indexing structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30386—Retrieval requests
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30587—Details of specialised database models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6279—Classification techniques relating to the number of classes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/18—Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Alhaj et al. | Improving Arabic cognitive distortion classification in Twitter using BERTopic | |
| Alshaer et al. | Feature selection method using improved CHI Square on Arabic text classifiers: analysis and application | |
| Aliwy et al. | Comparative study of five text classification algorithms with their improvements | |
| Bisandu et al. | Clustering news articles using efficient similarity measure and N-grams | |
| Romanov et al. | Application of natural language processing algorithms to the task of automatic classification of Russian scientific texts | |
| Mustafa et al. | A comprehensive evaluation of metadata-based features to classify research paper’s topics | |
| Sandhiya et al. | A review of topic modeling and its application | |
| Manimaran et al. | A survey of association rule mining in text applications | |
| Wijanto et al. | Topic Modeling for Scientific Articles: Exploring Optimal Hyperparameter Tuning in BERT. | |
| Mohemad et al. | Performance analysis in text clustering using k-means and k-medoids algorithms for Malay crime documents | |
| Mulyanto et al. | Systematic literature review of text feature extraction | |
| Benghuzzi et al. | An investigation of keywords extraction from textual documents using Word2Vec and Decision Tree | |
| Al Hasan et al. | Clustering analysis of Bangla news articles with TF-IDF & CV using mini-batch K-means and K-means | |
| KR102754741B1 (en) | Method of automatically structuring research based on keword and device thereof | |
| Kaysar et al. | Word sense disambiguation of Bengali words using FP-growth algorithm | |
| Afolabi et al. | Topic modelling for research perception: Techniques, processes and a case study | |
| Golechha et al. | Implementing topic modelling for document clustering | |
| Dawar et al. | Text categorization by content using Naïve Bayes approach | |
| CN114860936A (en) | Topic generation system and method based on hotspot list | |
| ul haq Dar et al. | Classification of job offers of the World Wide Web | |
| Ogada | N-grams for Text Classification Using Supervised Machine Learning | |
| Wen et al. | Blockchain-based reviewer selection | |
| Salman et al. | Arabic document clustering: A survey | |
| King et al. | Graggle: A Graph-based Approach to Document Clustering | |
| Rajkumar et al. | An efficient feature extraction with subset selection model using machine learning techniques for Tamil documents classification |