[go: up one dir, main page]

Al Hasan et al., 2022 - Google Patents

Clustering analysis of Bangla news articles with TF-IDF & CV using mini-batch K-means and K-means

Al Hasan et al., 2022

Document ID
10539001238034323860
Author
Al Hasan S
Ruiqin W
Hussain M
Publication year
Publication venue
2022 IEEE International Conference on Cybernetics and Computational Intelligence (CyberneticsCom)

External Links

Snippet

Document clustering is the compilation of docu-ments relating to textual content into classes or clusters. The primary objective is to group the documents that are internally logical but substantially different from each other. It is a vital method used in the retrieval of information …
Continue reading at ieeexplore.ieee.org (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F17/30705Clustering or classification
    • G06F17/3071Clustering or classification including class or cluster creation or modification
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F17/30634Querying
    • G06F17/30657Query processing
    • G06F17/30675Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F17/30705Clustering or classification
    • G06F17/30707Clustering or classification into predefined classes
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F17/30613Indexing
    • G06F17/30619Indexing indexing structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2765Recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • G06F17/30386Retrieval requests
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • G06F17/30587Details of specialised database models
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/02Knowledge representation
    • G06N5/022Knowledge engineering, knowledge acquisition
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6267Classification techniques
    • G06K9/6279Classification techniques relating to the number of classes
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F19/00Digital computing or data processing equipment or methods, specially adapted for specific applications
    • G06F19/10Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/18Digital computers in general; Data processing equipment in general in which a programme is changed according to experience gained by the computer itself during a complete run; Learning machines

Similar Documents

Publication Publication Date Title
Alhaj et al. Improving Arabic cognitive distortion classification in Twitter using BERTopic
Alshaer et al. Feature selection method using improved CHI Square on Arabic text classifiers: analysis and application
Aliwy et al. Comparative study of five text classification algorithms with their improvements
Bisandu et al. Clustering news articles using efficient similarity measure and N-grams
Romanov et al. Application of natural language processing algorithms to the task of automatic classification of Russian scientific texts
Mustafa et al. A comprehensive evaluation of metadata-based features to classify research paper’s topics
Sandhiya et al. A review of topic modeling and its application
Manimaran et al. A survey of association rule mining in text applications
Wijanto et al. Topic Modeling for Scientific Articles: Exploring Optimal Hyperparameter Tuning in BERT.
Mohemad et al. Performance analysis in text clustering using k-means and k-medoids algorithms for Malay crime documents
Mulyanto et al. Systematic literature review of text feature extraction
Benghuzzi et al. An investigation of keywords extraction from textual documents using Word2Vec and Decision Tree
Al Hasan et al. Clustering analysis of Bangla news articles with TF-IDF & CV using mini-batch K-means and K-means
KR102754741B1 (en) Method of automatically structuring research based on keword and device thereof
Kaysar et al. Word sense disambiguation of Bengali words using FP-growth algorithm
Afolabi et al. Topic modelling for research perception: Techniques, processes and a case study
Golechha et al. Implementing topic modelling for document clustering
Dawar et al. Text categorization by content using Naïve Bayes approach
CN114860936A (en) Topic generation system and method based on hotspot list
ul haq Dar et al. Classification of job offers of the World Wide Web
Ogada N-grams for Text Classification Using Supervised Machine Learning
Wen et al. Blockchain-based reviewer selection
Salman et al. Arabic document clustering: A survey
King et al. Graggle: A Graph-based Approach to Document Clustering
Rajkumar et al. An efficient feature extraction with subset selection model using machine learning techniques for Tamil documents classification