Hybrid deep neural network for Bangla automated image descriptor

International Journal of Advances in Intelligent Informatics

View Publication Info
 
 
Field Value
 
Title Hybrid deep neural network for Bangla automated image descriptor
 
Creator Jishan, Md Asifuzzaman
Mahmud, Khan Raqib
Azad, Abul Kalam Al
Alam, Md Shahabub
Khan, Anif Minhaz
 
Subject convolutional neural network; hybrid recurrent neural network; long short-term memory; bi-directional RNN; natural language descriptors
 
Description Automated image to text generation is a computationally challenging computer vision task which requires sufficient comprehension of both syntactic and semantic meaning of an image to generate a meaningful description. Until recent times, it has been studied to a limited scope due to the lack of visual-descriptor dataset and functional models to capture intrinsic complexities involving features of an image. In this study, a novel dataset was constructed by generating Bangla textual descriptor from visual input, called Bangla Natural Language Image to Text (BNLIT), incorporating 100 classes with annotation. A deep neural network-based image captioning model was proposed to generate image description. The model employs Convolutional Neural Network (CNN) to classify the whole dataset, while Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) capture the sequential semantic representation of text-based sentences and generate pertinent description based on the modular complexities of an image. When tested on the new dataset, the model accomplishes significant enhancement of centrality execution for image semantic recovery assignment. For the experiment of that task, we implemented a hybrid image captioning model, which achieved a remarkable result for a new self-made dataset, and that task was new for the Bangladesh perspective. In brief, the model provided benchmark precision in the characteristic Bangla syntax reconstruction and comprehensive numerical analysis of the model execution results on the dataset.
 
Publisher Universitas Ahmad Dahlan
 
Contributor
 
Date 2020-07-12
 
Type info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion

 
Format application/pdf
 
Identifier http://ijain.org/index.php/IJAIN/article/view/499
10.26555/ijain.v6i2.499
 
Source International Journal of Advances in Intelligent Informatics; Vol 6, No 2 (2020): July 2020; 109-122
2548-3161
2442-6571
 
Language eng
 
Relation http://ijain.org/index.php/IJAIN/article/view/499/ijain_v6i2_p109-122
 
Rights https://creativecommons.org/licenses/by-sa/4.0
 

Contact Us

The PKP Index is an initiative of the Public Knowledge Project.

For PKP Publishing Services please use the PKP|PS contact form.

For support with PKP software we encourage users to consult our wiki for documentation and search our support forums.

For any other correspondence feel free to contact us using the PKP contact form.

Find Us

Twitter

Copyright © 2015-2018 Simon Fraser University Library