b'Hugo Touvron'

research

∙ 08/24/2023

Code Llama: Open Foundation Models for Code

We release Code Llama, a family of large language models for code based ...

0 Baptiste Roziere, et al. ∙

research

∙ 07/18/2023

Llama 2: Open Foundation and Fine-Tuned Chat Models

In this work, we develop and release Llama 2, a collection of pretrained...

0 Hugo Touvron, et al. ∙

research

∙ 02/27/2023

LLaMA: Open and Efficient Foundation Language Models

We introduce LLaMA, a collection of foundation language models ranging f...

6 Hugo Touvron, et al. ∙

research

∙ 12/09/2022

Co-training 2^L Submodels for Visual Recognition

We introduce submodel co-training, a regularization method related to co...

4 Hugo Touvron, et al. ∙

research

∙ 04/14/2022

DeiT III: Revenge of the ViT

A Vision Transformer (ViT) is a simple neural architecture amenable to s...

21 Hugo Touvron, et al. ∙

research

∙ 03/18/2022

Three things everyone should know about Vision Transformers

After their initial success in natural language processing, transformer ...

21 Hugo Touvron, et al. ∙

research

∙ 12/27/2021

Augmenting Convolutional networks with attention-based aggregation

We show how to augment any convolutional network with an attention-based...

24 Hugo Touvron, et al. ∙

research

∙ 12/20/2021

Are Large-scale Datasets Necessary for Self-Supervised Pre-training?

Pre-training models on large scale datasets, like ImageNet, is a standar...

25 Alaaeldin El-Nouby, et al. ∙

research

∙ 10/01/2021

ResNet strikes back: An improved training procedure in timm

The influential Residual Networks designed by He et al. remain the gold-...

0 Ross Wightman, et al. ∙

research

∙ 06/17/2021

XCiT: Cross-Covariance Image Transformers

Following their success in natural language processing, transformers hav...

0 Alaaeldin El-Nouby, et al. ∙

research

∙ 05/07/2021

ResMLP: Feedforward networks for image classification with data-efficient training

We present ResMLP, an architecture built entirely upon multi-layer perce...

43 Hugo Touvron, et al. ∙

research

∙ 04/29/2021

Emerging Properties in Self-Supervised Vision Transformers

In this paper, we question if self-supervised learning provides new prop...

12 Mathilde Caron, et al. ∙

research

∙ 04/02/2021

LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference

We design a family of image classification architectures that optimize t...

0 Ben Graham, et al. ∙

research

∙ 03/31/2021

Going deeper with Image Transformers

Transformers have been recently adapted for large scale image classifica...

0 Hugo Touvron, et al. ∙

research

∙ 03/19/2021

ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases

Convolutional architectures have proven extremely successful for vision ...

5 Stéphane d'Ascoli, et al. ∙

research

∙ 12/23/2020

Training data-efficient image transformers distillation through attention

Recently, neural networks purely based on attention were shown to addres...

15 Hugo Touvron, et al. ∙

research

∙ 11/25/2020

Grafit: Learning fine-grained image representations with coarse labels

This paper tackles the problem of learning a finer representation than t...

0 Hugo Touvron, et al. ∙

research

∙ 08/13/2020

Powers of layers for image-to-image translation

We propose a simple architecture to address unpaired image-to-image tran...

10 Hugo Touvron, et al. ∙

research

∙ 03/18/2020

Fixing the train-test resolution discrepancy: FixEfficientNet

This note complements the paper "Fixing the train-test resolution discre...

16 Hugo Touvron, et al. ∙

research

∙ 06/14/2019

Fixing the train-test resolution discrepancy

Data-augmentation is key to the training of neural networks for image cl...

0 Hugo Touvron, et al. ∙

Hugo Touvron

Featured Co-authors

Sign in with Google

Consider DeepAI Pro