The chrome extension that gets input images and generates the captions for them.
-
Updated
Dec 5, 2024 - JavaScript
The chrome extension that gets input images and generates the captions for them.
Developed an image captioning system using the BLIP model to generate detailed, context-aware captions. Achieved an average BLEU score of 0.72, providing rich descriptions that enhance accessibility and inclusivity.
Add a description, image, and links to the vit-gpt2 topic page so that developers can more easily learn about it.
To associate your repository with the vit-gpt2 topic, visit your repo's landing page and select "manage topics."