LXMERT: Learning Cross-Modality Encoder Representations from Transformers
Hao Tan, Mohit Bansal. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Proc...