Alexey Gritsenko

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Cordelia Schmid
156 publications
Yi Tay
76 publications
Chen Sun
74 publications
Mohammad Norouzi
71 publications
Neil Houlsby
44 publications
Mostafa Dehghani
43 publications
Mario Lucic
40 publications
William Chan
37 publications
Tim Salimans
36 publications
Anurag Arnab
35 publications
Ben Poole
34 publications

research

∙ 07/12/2023

Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution

The ubiquitous and demonstrably suboptimal choice of resizing images to ...

0 Mostafa Dehghani, et al. ∙

research

∙ 06/16/2023

Scaling Open-Vocabulary Object Detection

Open-vocabulary object detection has benefited greatly from pretrained v...

0 Matthias Minderer, et al. ∙

research

∙ 04/24/2023

End-to-End Spatio-Temporal Action Localisation with Video Transformers

The most performant spatio-temporal action localisation models use exter...

0 Alexey Gritsenko, et al. ∙

research

∙ 10/05/2022

Imagen Video: High Definition Video Generation with Diffusion Models

We present Imagen Video, a text-conditional video generation system base...

16 Jonathan Ho, et al. ∙

research

∙ 07/08/2022

Beyond Transfer Learning: Co-finetuning for Action Localisation

Transfer learning is the predominant paradigm for training deep networks...

9 Anurag Arnab, et al. ∙

research

∙ 04/07/2022

Video Diffusion Models

Generating temporally coherent high fidelity video is an important miles...

0 Jonathan Ho, et al. ∙

research

∙ 10/18/2021

SCENIC: A JAX Library for Computer Vision Research and Beyond

Scenic is an open-source JAX library with a focus on Transformer-based m...

31 Mostafa Dehghani, et al. ∙

Success!

An error occurred

Alexey Gritsenko

Featured Co-authors

Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution

Scaling Open-Vocabulary Object Detection

End-to-End Spatio-Temporal Action Localisation with Video Transformers

Imagen Video: High Definition Video Generation with Diffusion Models

Beyond Transfer Learning: Co-finetuning for Action Localisation

Video Diffusion Models

SCENIC: A JAX Library for Computer Vision Research and Beyond

Sign in with Google

Consider DeepAI Pro