Scopus Indexed Publications

Paper Details


Title
Bengali Caption Generation for Images Using Deep Learning

Author
Zahidul Islam, Sajib Saha, Subhenur Latif, Tanvirul Islam,

Email

Abstract
Automatic caption generation from images has evolved into an active research topic that requires Natural Language Processing (NLP) and Computer Vision (CV) to comprehend the image input and represent it in text. This can assist visually impaired people by generating text captions of images to understand their surroundings. In this study, we have presented a Long Short-Term Memory (LSTM) based Recurrent Neural Network (RNN) approach, which can generate natural language for an image. A dataset containing 8,000 images and a total of 37611 captions are utilized for training our model. Besides, VVG16 is employed to extract features from images. Finally, performance is evaluated, which shows an accuracy of 66% and BLEU-1, BLEU-2, BLEU-3, and BLEU-4 scores of 0.40, 0.18, 0.11, and 0.03, respectively.


Keywords
Computer vision , Deep learning , Encoder-Decoder , Image captioning , LSTM

Journal or Conference Name
Proceedings of 2022 IEEE International Women in Engineering (WIE) Conference on Electrical and Computer Engineering, WIECON-ECE 2022

Publication Year
2022

Indexing
scopus