Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
,
Yinfei Yang
,
Ye Xia
,
Chao Jia
,
Yinfei Yang
,
Ye Xia
,
Yi‐Ting Chen
,
Zarana Parekh
,
Hieu Pham
,
Quoc V. Le
,
Yun-Hsuan Sung
,
Zhen Li
,
Tom Duerig
2021
arXiv (Cornell University)
1,190 citations