Scopus Indexed Publications
Paper Details
- Title
-
Toward an Enhanced Bengali Text Classification Using Saint and Common Form
- Author
-
Nushrat Jahan Ria,
Abu Kaisar Mohammad Masum,
Mohammad Abu Yousuf,
Sharun Akter Khushbu,
Sheikh Abujar,
Syed Akhter Hossain,
- Email
-
sheikh.cse@diu.edu.bd
- Abstract
-
Language processing tool has
been strengthened by the measurements of text classification. Due to
this concern many approaches investigate through the text documentation
problem. Behalf of this conception we have focused on our bengali text
written format. A long period of time bengali people are familiar with
two bengali accents about saint and common form. With concern text
document processing becomes easier to translate. Most well-known
supervised six classifiers we have used to classify these two bengali
forms of saint and common. Classifiers prediction will determine whether
it is saint or common form. Collection of text documents more than 1200
mix sentences grabbed from bengali written sources. Each text needs
preprocess to classify the text into a solid form of output. Before
applying algorithms there has been some prerequisite ability to split
the sentences, stemming, remove stop words, construct contraction.
Ending with preprocess, the processed bengali text had been taken as
input on machine learning classifiers that have raised very spontaneous
outcomes over the accent of bengali data. Foremost output produced by NB
classifier to identify the actual form about 77% on bengali text of
saint and common form. Apart from that, other ML classifiers XGB, RB,
DT, SVC, KNN showed nearly prediction upto 77% -64% accuracy which we
have proposed in different segments of this paper.
- Keywords
-
Text Classification , Bengali Text Classification , Machine Learning , NB classifier , XGB classifier , RB classifier , DT classifier , SVC , KNN classifier
- Journal or Conference Name
- 11th International Conference on Computing, Communication and Networking Technologies, ICCCNT 2020
- Publication Year
-
2020
- Indexing
-
scopus