Following the success of GPT4, there has been a surge in interest in
mul...
Transformer-like models for vision tasks have recently proven effective ...
Deployed multimodal systems can fail in ways that evaluators did not
ant...
The advent of large pre-trained models has brought about a paradigm shif...
In this paper, we contend that the objective of representation learning ...
Recently, self-supervised learning (SSL) has achieved tremendous success...
Autoencoding has achieved great empirical success as a framework for lea...
Clustering data lying close to a union of low-dimensional manifolds, wit...
This paper proposes an unsupervised method for learning a unified
repres...
Despite strong empirical performance for image classification, deep neur...
This work proposes a minimal computational model for learning a structur...
This work proposes a new computational framework for learning an explici...