This paper describes the Idiap submission to WAT 2019 for the English-Hindi MultiModal Translation Task. We have used the state-of-the-art Transformer model and utilized the IITB English-Hindi parallel corpus as an additional data source.
Among the different tracks of the multimodal task, we have participated in the "Text-Only" track for the evaluation and challenge test sets. Our submission tops in its track among the competitors in terms of both automatic and manual evaluation.
Based on automatic scores, our text-only submission also outperforms systems that consider visual information in the "multimodal translation" task.