Scopus Indexed Publications

Paper Details


Title
Toward an Enhanced Bengali Text Classification Using Saint and Common Form
Author
Nushrat Jahan Ria, Abu Kaisar Mohammad Masum, Mohammad Abu Yousuf, Sharun Akter Khushbu, Sheikh Abujar, Syed Akhter Hossain,
Email
sheikh.cse@diu.edu.bd
Abstract
Language processing tool has been strengthened by the measurements of text classification. Due to this concern many approaches investigate through the text documentation problem. Behalf of this conception we have focused on our bengali text written format. A long period of time bengali people are familiar with two bengali accents about saint and common form. With concern text document processing becomes easier to translate. Most well-known supervised six classifiers we have used to classify these two bengali forms of saint and common. Classifiers prediction will determine whether it is saint or common form. Collection of text documents more than 1200 mix sentences grabbed from bengali written sources. Each text needs preprocess to classify the text into a solid form of output. Before applying algorithms there has been some prerequisite ability to split the sentences, stemming, remove stop words, construct contraction. Ending with preprocess, the processed bengali text had been taken as input on machine learning classifiers that have raised very spontaneous outcomes over the accent of bengali data. Foremost output produced by NB classifier to identify the actual form about 77% on bengali text of saint and common form. Apart from that, other ML classifiers XGB, RB, DT, SVC, KNN showed nearly prediction upto 77% -64% accuracy which we have proposed in different segments of this paper.
Keywords
Text Classification , Bengali Text Classification , Machine Learning , NB classifier , XGB classifier , RB classifier , DT classifier , SVC , KNN classifier
Journal or Conference Name
11th International Conference on Computing, Communication and Networking Technologies, ICCCNT 2020
Publication Year
2020
Indexing
scopus