Image and Video Caption Generation Using Machine Learning

Rushikesh Nitin Chavan; Vijay Vishnu Anpat; Bhagyashri Nanaso Chavan; Nagesh Bharat Kshirsagar; Prof. Kamble. D. R

Image and Video Caption Generation Using Machine Learning

Author(s):

Rushikesh Nitin Chavan , SB Patil college of Engineering, Indapur ; Vijay Vishnu Anpat, SB Patil college of Engineering, Indapur ; Bhagyashri Nanaso Chavan, SB Patil college of Engineering, Indapur ; Nagesh Bharat Kshirsagar, SB Patil college of Engineering, Indapur ; Prof. Kamble. D. R, SB Patil college of Engineering, Indapur

Keywords:

Deep Learning, Part of Speech, Image Captioning, Multi-Task Learning

Abstract

Image Grounded web straggler is the way toward looking through data by exercising affiliated images. The tremendous means of images are accessible on the web in that a large number of the images are contain as with named and without named caption. Our task is to induce an automatic caption for the images grounded on the image content. To produce an image caption, originally, the content of the image should be completely understood; and also, the semantic information contained in the image should be described using an expression or statement that conforms to certain grammatical rules. Therefore, it requires ways from both computer vision and natural language processing to connect the two different media forms together, which is largely gruelling. The paper targets producing mechanized eulogies by learning the contents of the image. At present images are clarified with mortal supplication, and it turns out to be nearly unbelievable task for tremendous databases. The picture information base is given as donation to a deep neural.

Other Details

Paper ID: IJSRDV11I30034
Published in: Volume : 11, Issue : 3
Publication Date: 01/06/2023
Page(s): 36-38

Article Preview

Download Article

Email To A Friend

CALL FOR PAPERS : June-2026

ADVANCED SEARCH

NEWS & UPDATES

FOR AUTHORS

FOR REVIEWERS

ARCHIVES

DOWNLOADS