Leveraging the BLIP Model for Visual Question Answering: A Comparative Analysis on VQA and DAQUAR Datasets
machine-learning natural-language-processing computer-vision inference accuracy image-captioning bleu-score blip visual-question-answering wups vqav2 bert-score daquar
-
Updated
Jun 18, 2024 - Jupyter Notebook