Image and Video Caption Generation Using Machine Learning |
Author(s): |
| Rushikesh Nitin Chavan , SB Patil college of Engineering, Indapur ; Vijay Vishnu Anpat, SB Patil college of Engineering, Indapur ; Bhagyashri Nanaso Chavan, SB Patil college of Engineering, Indapur ; Nagesh Bharat Kshirsagar, SB Patil college of Engineering, Indapur ; Prof. Kamble. D. R, SB Patil college of Engineering, Indapur |
Keywords: |
| Deep Learning, Part of Speech, Image Captioning, Multi-Task Learning |
Abstract |
|
Image Grounded web straggler is the way toward looking through data by exercising affiliated images. The tremendous means of images are accessible on the web in that a large number of the images are contain as with named and without named caption. Our task is to induce an automatic caption for the images grounded on the image content. To produce an image caption, originally, the content of the image should be completely understood; and also, the semantic information contained in the image should be described using an expression or statement that conforms to certain grammatical rules. Therefore, it requires ways from both computer vision and natural language processing to connect the two different media forms together, which is largely gruelling. The paper targets producing mechanized eulogies by learning the contents of the image. At present images are clarified with mortal supplication, and it turns out to be nearly unbelievable task for tremendous databases. The picture information base is given as donation to a deep neural. |
Other Details |
|
Paper ID: IJSRDV11I30034 Published in: Volume : 11, Issue : 3 Publication Date: 01/06/2023 Page(s): 36-38 |
Article Preview |
|
|
|
|
