b'Noel Codella'

research

∙ 07/03/2023

UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding

Vision-language tasks, such as VQA, SNLI-VE, and VCR are challenging bec...

0 Rui Sun, et al. ∙

research

∙ 05/21/2023

i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data

The convergence of text, visual, and audio data is a key step towards hu...

0 ZiYi Yang, et al. ∙

research

∙ 03/30/2023

Streaming Video Model

Video understanding tasks have traditionally been modeled by two separat...

0 Yucheng Zhao, et al. ∙

research

∙ 07/26/2022

Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training

Large-scale multi-modal contrastive pre-training has demonstrated great ...

7 Haoxuan You, et al. ∙

research

∙ 05/03/2022

i-Code: An Integrative and Composable Multimodal Learning Framework

Human intelligence is multimodal; we integrate visual, linguistic, and a...

1 ZiYi Yang, et al. ∙

research

∙ 04/22/2022

Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for Vision-Language Tasks

Cross-modal encoders for vision-language (VL) tasks are often pretrained...

3 Zhecan Wang, et al. ∙

research

∙ 04/07/2022

DaViT: Dual Attention Vision Transformers

In this work, we introduce Dual Attention Vision Transformers (DaViT), a...

12 Mingyu Ding, et al. ∙

research

∙ 01/15/2022

CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks

Contrastive language-image pretraining (CLIP) links vision and language ...

15 Zhecan Wang, et al. ∙

research

∙ 12/16/2021

RegionCLIP: Region-based Language-Image Pretraining

Contrastive language-image pretraining (CLIP) using image-text pairs has...

2 Yiwu Zhong, et al. ∙

research

∙ 11/22/2021

Florence: A New Foundation Model for Computer Vision

Automated visual understanding of our diverse and open world demands com...

4 Lu Yuan, et al. ∙

research

∙ 03/29/2021

CvT: Introducing Convolutions to Vision Transformers

We present in this paper a new architecture, named Convolutional vision ...

30 Haiping Wu, et al. ∙

research

∙ 08/07/2020

A Patient-Centric Dataset of Images and Metadata for Identifying Melanomas Using Clinical Context

Prior skin image datasets have not addressed patient-level information o...

0 Veronica Rotemberg, et al. ∙

research

∙ 08/20/2019

P2L: Predicting Transfer Learning for Images and Semantic Relations

Transfer learning enhances learning across tasks, by leveraging previous...

0 Bishwaranjan Bhattacharjee, et al. ∙

research

∙ 02/09/2019

Skin Lesion Analysis Toward Melanoma Detection 2018: A Challenge Hosted by the International Skin Imaging Collaboration (ISIC)

The International Skin Imaging Collaboration (ISIC) is a global partners...

0 Noel Codella, et al. ∙

research

∙ 10/14/2016

Deep Learning Ensembles for Melanoma Recognition in Dermoscopy Images

Melanoma is the deadliest form of skin cancer. While curable with early ...

0 Noel Codella, et al. ∙

Noel Codella

Featured Co-authors

Sign in with Google

Consider DeepAI Pro