Emanuele Bugliarello

research

∙ 08/22/2023

StoryBench: A Multifaceted Benchmark for Continuous Story Visualization

Generating video stories from text prompts is a complex task. In additio...

0 Emanuele Bugliarello, et al. ∙

research

∙ 05/23/2023

Weakly-Supervised Learning of Visual Relations in Multimodal Pretraining

Recent work in vision-and-language pretraining has investigated supervis...

2 Emanuele Bugliarello, et al. ∙

research

∙ 05/12/2023

Measuring Progress in Fine-grained Vision-and-Language Understanding

While pretraining on large-scale image-text data from the Web has facili...

2 Emanuele Bugliarello, et al. ∙

research

∙ 03/30/2023

A Study of Autoregressive Decoders for Multi-Tasking in Computer Vision

There has been a recent explosion of computer vision models which perfor...

4 Lucas Beyer, et al. ∙

research

∙ 10/24/2022

Multilingual Multimodal Learning with Machine Translated Text

Most vision-and-language pretraining research focuses on English tasks. ...

2 Chen Qiu, et al. ∙

research

∙ 07/14/2022

Language Modelling with Pixels

Language models are defined over a finite set of inputs, which creates a...

3 Phillip Rust, et al. ∙

research

∙ 06/09/2022

Ancestor-to-Creole Transfer is Not a Walk in the Park

We aim to learn language models for Creole languages for which large vol...

3 Heather Lent, et al. ∙

research

∙ 05/24/2022

Rethinking Evaluation Practices in Visual Question Answering: A Case Study on Out-of-Distribution Generalization

Vision-and-language (V L) models pretrained on large-scale multimodal ...

3 Aishwarya Agrawal, et al. ∙

research

∙ 04/22/2022

Mostra: A Flexible Balancing Framework to Trade-off User, Artist and Platform Objectives for Music Sequencing

We consider the task of sequencing tracks on music streaming platforms w...

0 Emanuele Bugliarello, et al. ∙

research

∙ 03/18/2022

Challenges and Strategies in Cross-Cultural NLP

Various efforts in the Natural Language Processing (NLP) community have ...

12 Daniel Hershcovich, et al. ∙

research

∙ 09/13/2021

On Language Models for Creoles

Creole languages such as Nigerian Pidgin English and Haitian Creole are ...

23 Heather Lent, et al. ∙

research

∙ 09/09/2021

Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers

Pretrained vision-and-language BERTs aim to learn representations that c...

19 Stella Frank, et al. ∙

research

∙ 01/28/2021

The Role of Syntactic Planning in Compositional Image Captioning

Image captioning has focused on generalizing to images drawn from the sa...

14 Emanuele Bugliarello, et al. ∙

research

∙ 11/30/2020

Multimodal Pretraining Unmasked: Unifying the Vision and Language BERTs

Large-scale pretraining and task-specific fine-tuning is now the standar...

12 Emanuele Bugliarello, et al. ∙

research

∙ 05/05/2020

It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information

The performance of neural machine translation systems is commonly evalua...

0 Emanuele Bugliarello, et al. ∙

research

∙ 09/06/2019

Improving Neural Machine Translation with Parent-Scaled Self-Attention

Most neural machine translation (NMT) models operate on source and targe...

0 Emanuele Bugliarello, et al. ∙

research

∙ 05/30/2019

Matrix Completion in the Unit Hypercube via Structured Matrix Factorization

Several complex tasks that arise in organizations can be simplified by m...

0 Emanuele Bugliarello, et al. ∙

Emanuele Bugliarello

Featured Co-authors

Sign in with Google

Consider DeepAI Pro