A multi-task model which does image captioning, sentence paraphrasing and cross-modal retrieval.
deep-learning image-captioning sequence-to-sequence cross-modal-retrieval sentence-paraphrasing common-vector-space
-
Updated
Nov 21, 2019 - Python