Innovative Techniques for Text Classification
Author(s):
Amey K. Shet Tilve , Don Bosco College of Engineering; Shaba Desai, Padre Conceicao College of Engineering
Keywords:
Text Classification, Information Retrieval, Naive Bayes Classifier, Vector Space Model Text Classification, Part of Speech Tagging, Natural Language Processing
Abstract:
This research focuses on Text Classification. Text classification is the task of automatically sorting a set of documents into categories from a predefined set. The domain of this research is the combination of information retrieval (IR) technology, Data mining and machine learning (ML) technology. This research will outline the fundamental traits of the technologies involved. This research uses three text classification algorithms (Naive Bayes, VSM for text classification and the new technique -Use of Stanford Tagger for text classification) to classify documents into different categories, which is trained on two different datasets (20 Newsgroups and New news dataset for five categories). In regards to the above classification strategies, Naïve Bayes is potentially good at serving as a text classification model due to its simplicity.
Other Details:
| Manuscript Id | : | IJSTEV3I6002
|
| Published in | : | Volume : 3, Issue : 6
|
| Publication Date | : | 01/01/2017
|
| Page(s) | : | 1-6
|
Download Article