ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision Blog Posting uri icon

publication date

  • February 5, 2021