research
∙
12/02/2022
Compound Tokens: Channel Fusion for Vision-Language Representation Learning
We present an effective method for fusing visual-and-language representa...
research
∙
01/16/2021