Scopus Indexed Publications

Paper Details


Title
BanglaSenti: A Dataset of Bangla Words for Sentiment Analysis
Author
Hasmot Ali, Ahmed Al Marouf, Md. Fahad Hossain, Shaon Bhatta Shuvo,
Email
shaon.cse@diu.edu.bd
Abstract
Being the fifth most spoken language in the world, use of Bangla or Bengali language has spread its breadth into the world of social media. Huge volume of user-generated Bangla data are produced daily in various social media such as Facebook, Twitter, YouTube; online news portals and various websites. Hence, the importance of understanding the emotion and sentiment of these types of data has gain attraction to the researchers' recently. Bangla Natural Language Processing (BNLP) has emerged as a novel research domain because of these multidisciplinary scopes. In this paper, we have presented “BanglaSenti”, which is a lexicon based corpus or dataset generated solely to identify the sentiment analysis from textual data. This dataset contains 61582 Bangla words with positive, negative and neutral words. These polarities are very significant to understand the overall polarity of the sentences. Not only the corpus, but also a model simulation has been conducted in this paper to understand the usability of this dataset. The dataset is formalized as English SentiWordNet dataset so that researchers' may utilize it with the same format of codes. Though the dataset is developed for sentiment analysis, it could be utilized for emotion detection, opinion/review mining and such applications.
Keywords
Bangla Natural Langauge Processing (BNLP) , Social Media , Bangla Sentiment words , Sentiment Analysis
Journal or Conference Name
11th International Conference on Computing, Communication and Networking Technologies, ICCCNT 2020
Publication Year
2020
Indexing
scopus