Dan Oneata

research

∙ 09/11/2023

Towards generalisable and calibrated synthetic speech detection with self-supervised representations

Generalisation – the ability of a model to perform well on unseen data –...

0 Dan Oneata, et al. ∙

research

∙ 06/20/2023

Visually grounded few-shot word learning in low-resource settings

We propose a visually grounded speech model that learns new words and th...

0 Leanne Nortje, et al. ∙

research

∙ 10/24/2022

Multilingual Multimodal Learning with Machine Translated Text

Most vision-and-language pretraining research focuses on English tasks. ...

2 Chen Qiu, et al. ∙

research

∙ 10/10/2022

YFACC: A Yorùbá speech-image dataset for cross-lingual keyword localisation through visual grounding

Visually grounded speech (VGS) models are trained on images paired with ...

0 Kayode Olaleye, et al. ∙

research

∙ 06/07/2022

FlexLip: A Controllable Text-to-Lip System

The task of converting text input into video content is becoming an impo...

0 Dan Oneata, et al. ∙

research

∙ 04/27/2022

Improving Multimodal Speech Recognition by Data Augmentation and Speech Representations

Multimodal speech recognition aims to improve the performance of automat...

0 Dan Oneata, et al. ∙

research

∙ 02/02/2022

Keyword localisation in untranscribed speech using visually grounded speech models

Keyword localisation is the task of finding where in a speech utterance ...

0 Kayode Olaleye, et al. ∙

research

∙ 05/20/2021

Speaker disentanglement in video-to-speech conversion

The task of video-to-speech aims to translate silent video of lip moveme...

0 Dan Oneata, et al. ∙

research

∙ 01/14/2021

An evaluation of word-level confidence estimation for end-to-end automatic speech recognition

Quantifying the confidence (or conversely the uncertainty) of a predicti...

0 Dan Oneata, et al. ∙

research

∙ 10/27/2019

The Quo Vadis submission at Traffic4cast 2019

We describe the submission of the Quo Vadis team to the Traffic4cast com...

0 Dan Oneata, et al. ∙

research

∙ 07/02/2019

Kite: Automatic speech recognition for unmanned aerial vehicles

This paper addresses the problem of building a speech recognition system...

0 Dan Oneata, et al. ∙

research

∙ 04/21/2015

A robust and efficient video representation for action recognition

This paper introduces a state-of-the-art video representation and applie...

0 Heng Wang, et al. ∙

Dan Oneata

Featured Co-authors

Sign in with Google

Consider DeepAI Pro