Skip to content

Latest commit

 

History

History
18 lines (10 loc) · 527 Bytes

README.md

File metadata and controls

18 lines (10 loc) · 527 Bytes

Transformers

O. Vinyals, A. Toshev, S. Bengio and D. Erhan.
Show and Tell: A Neural Image Caption Generator, 2015 https://arxiv.org/pdf/1411.4555.pdf

Proposal: PROJECT PROPOSAL.pdf

Project goals:

  1. Try to reproduce image captioning NN model.

  2. Train the model on different datasets.

  3. Measure results on different metrics, suggested in paper (starting from BLEU score metric, and diving into what other metrics they used)