We investigate the role of attention and memory in complex reasoning tas...
Vision transformers are nowadays the de-facto preference for image
class...
A fundamental component of human vision is our ability to parse complex
...
Humans continue to outperform modern AI systems in their ability to flex...
Visual understanding requires comprehending complex visual relations bet...