Aude Oliva

research

∙ 04/10/2023

Artifact magnification on deepfake videos increases human detection and subjective confidence

The development of technologies for easily and automatically falsifying ...

0 Emilie Josephs, et al. ∙

research

∙ 03/30/2023

Going Beyond Nouns With Vision Language Models Using Synthetic Data

Large-scale pre-trained Vision Language (VL) models have shown remar...

0 Paola Cascante-Bonilla, et al. ∙

research

∙ 06/01/2022

Deepfake Caricatures: Amplifying attention to artifacts increases deepfake detection by humans and machines

Deepfakes pose a serious threat to our digital society by fueling the sp...

0 Camilo Fosco, et al. ∙

research

∙ 08/23/2021

Dynamic Network Quantization for Efficient Video Inference

Deep convolutional networks have recently achieved great success in vide...

5 Ximeng Sun, et al. ∙

research

∙ 06/23/2021

IA-RED^2: Interpretability-Aware Redundancy Reduction for Vision Transformers

The self-attention-based model, transformer, is recently becoming the le...

0 Bowen Pan, et al. ∙

research

∙ 06/10/2021

Cross-Modal Discrete Representation Learning

Recent advances in representation learning have demonstrated an ability ...

0 Alexander H. Liu, et al. ∙

research

∙ 05/11/2021

AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition

Multi-modal learning, which focuses on utilizing various modalities to i...

0 Rameswar Panda, et al. ∙

research

∙ 05/10/2021

Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions

When people observe events, they are able to abstract key information an...

4 Mathew Monfort, et al. ∙

research

∙ 04/01/2021

Memorability: An image-computable measure of information utility

The pixels in an image, and the objects, scenes, and actions that they c...

0 Zoya Bylinskii, et al. ∙

research

∙ 03/19/2021

Paint by Word

We investigate the problem of zero-shot semantic image painting. Instead...

6 David Bau, et al. ∙

research

∙ 03/02/2021

All at Once Network Quantization via Collaborative Knowledge Transfer

Network quantization has rapidly become one of the most widely used meth...

0 Ximeng Sun, et al. ∙

research

∙ 02/15/2021

VA-RED^2: Video Adaptive Redundancy Reduction

Performing inference on deep learning models for videos remains a challe...

0 Bowen Pan, et al. ∙

research

∙ 02/10/2021

AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition

Temporal modelling is the key for efficient video action recognition. Wh...

0 Yue Meng, et al. ∙

research

∙ 10/22/2020

Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition

In recent years, a number of approaches based on 2D CNNs and 3D CNNs hav...

0 Chun-Fu Chen, et al. ∙

research

∙ 09/05/2020

Multimodal Memorability: Modeling Effects of Semantics and Decay on Video Memorability

A key capability of an intelligent system is deciding when events from p...

4 Anelise Newman, et al. ∙

research

∙ 08/12/2020

We Have So Much In Common: Modeling Semantic Relational Set Abstractions in Videos

Identifying common patterns among events is a key ability in human and m...

8 Alex Andonian, et al. ∙

research

∙ 07/31/2020

AR-Net: Adaptive Frame Resolution for Efficient Action Recognition

Action recognition is an open and challenging problem in computer vision...

4 Yue Meng, et al. ∙

research

∙ 11/01/2019

Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding

An event happening in the world is often made of different activities an...

15 Mathew Monfort, et al. ∙

research

∙ 09/10/2019

Reasoning About Human-Object Interactions Through Dual Attention Networks

Objects are entities we act upon, where the functionality of an object i...

3 Tete Xiao, et al. ∙

research

∙ 06/24/2019

GANalyze: Toward Visual Definitions of Cognitive Image Properties

We introduce a framework that uses Generative Adversarial Networks (GANs...

9 Authors, et al. ∙

research

∙ 06/09/2019

Cross-view Semantic Segmentation for Sensing Surroundings

Sensing surroundings is ubiquitous and effortless to humans: It takes a ...

1 Bowen Pan, et al. ∙

research

∙ 05/14/2019

The Algonauts Project: A Platform for Communication between the Sciences of Biological and Artificial Intelligence

In the last decade, artificial intelligence (AI) models inspired by the ...

1 Radoslaw Martin Cichy, et al. ∙

research

∙ 07/27/2018

Synthetically Trained Icon Proposals for Parsing and Summarizing Infographics

Widely used in news, business, and educational media, infographics are h...

2 Spandan Madan, et al. ∙

research

∙ 01/09/2018

Moments in Time Dataset: one million videos for event understanding

We present the Moments in Time Dataset, a large-scale human-annotated co...

0 Mathew Monfort, et al. ∙

research

∙ 11/15/2017

Interpreting Deep Visual Representations via Network Dissection

The success of recent deep convolutional neural networks (CNNs) depends ...

0 Bolei Zhou, et al. ∙

research

∙ 09/26/2017

Understanding Infographics through Textual and Visual Tag Prediction

We introduce the problem of visual hashtag discovery for infographics: e...

0 Zoya Bylinskii, et al. ∙

research

∙ 04/19/2017

Network Dissection: Quantifying Interpretability of Deep Visual Representations

We propose a general framework called Network Dissection for quantifying...

0 David Bau, et al. ∙

research

∙ 10/06/2016

Places: An Image Database for Deep Scene Understanding

The rise of multi-million-item dataset initiatives has enabled data-hung...

0 Bolei Zhou, et al. ∙

research

∙ 04/12/2016

What do different evaluation metrics tell us about saliency models?

How best to evaluate a saliency model's ability to predict where humans ...

0 Zoya Bylinskii, et al. ∙

research

∙ 01/12/2016

Deep Neural Networks predict Hierarchical Spatio-temporal Cortical Dynamics of Human Visual Object Recognition

The complex multi-stage architecture of cortical visual pathways provide...

0 Radoslaw M. Cichy, et al. ∙

research

∙ 12/14/2015

Learning Deep Features for Discriminative Localization

In this work, we revisit the global average pooling layer proposed in [1...

0 Bolei Zhou, et al. ∙

research

∙ 12/22/2014

Object Detectors Emerge in Deep Scene CNNs

With the success of new computational architectures for visual processin...

0 Bolei Zhou, et al. ∙

research

∙ 10/17/2014

Learning visual biases from human imagination

Although the human visual system can recognize many concepts under chall...

0 Carl Vondrick, et al. ∙

Aude Oliva

Featured Co-authors

Sign in with Google

Consider DeepAI Pro