Odette Scharenborg

research

∙ 09/15/2023

The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction

Previous Multimodal Information based Speech Processing (MISP) challenge...

0 Shilong Wu, et al. ∙

research

∙ 07/05/2023

Using Data Augmentations and VTLN to Reduce Bias in Dutch End-to-End Speech Recognition Systems

Speech technology has improved greatly for norm speakers, i.e., adult na...

0 Tanvina Patel, et al. ∙

research

∙ 03/11/2023

The Multimodal Information based Speech Processing (MISP) 2022 Challenge: Audio-Visual Diarization and Recognition

The Multi-modal Information based Speech Processing (MISP) challenge aim...

0 Zhe Wang, et al. ∙

research

∙ 06/24/2022

Predicting within and across language phoneme recognition performance of self-supervised learning speech pre-trained models

In this work, we analyzed and compared speech representations extracted ...

0 Hang Ji, et al. ∙

research

∙ 03/31/2022

Manipulation of oral cancer speech using neural articulatory synthesis

We present an articulatory synthesis framework for the synthesis and man...

0 Bence Mark Halpern, et al. ∙

research

∙ 03/14/2022

Modelling word learning and recognition using visually grounded speech

Background: Computational models of speech recognition often assume that...

0 Danny Merkx, et al. ∙

research

∙ 01/26/2022

Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition

The high cost of data acquisition makes Automatic Speech Recognition (AS...

6 Piotr Żelasko, et al. ∙

research

∙ 01/13/2022

The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for Improved Dysarthric Speech Recognition

In this paper, we investigate several existing and a new state-of-the-ar...

0 Luke Prananta, et al. ∙

research

∙ 10/15/2021

Towards Identity Preserving Normal to Dysarthric Voice Conversion

We present a voice conversion framework that converts normal speech into...

0 Wen-Chin Huang, et al. ∙

research

∙ 07/01/2021

An Objective Evaluation Framework for Pathological Speech Synthesis

The development of pathological speech systems is currently hindered by ...

0 Bence Mark Halpern, et al. ∙

research

∙ 06/15/2021

Pathological voice adaptation with autoencoder-based voice conversion

In this paper, we propose a new approach to pathological speech synthesi...

0 Marc Illa, et al. ∙

research

∙ 04/02/2021

Unsupervised Acoustic Unit Discovery by Leveraging a Language-Independent Subword Discriminative Feature Representation

This paper tackles automatically discovering phone-like acoustic units (...

0 Siyuan Feng, et al. ∙

research

∙ 03/28/2021

Quantifying Bias in Automatic Speech Recognition

Automatic speech recognition (ASR) systems promise to deliver objective ...

0 Siyuan Feng, et al. ∙

research

∙ 12/17/2020

The effectiveness of unsupervised subword modeling with autoregressive and cross-lingual phone-aware networks

This study addresses unsupervised subword modeling, i.e., learning acous...

8 Siyuan Feng, et al. ∙

research

∙ 11/12/2020

The CUHK-TUDELFT System for The SLT 2021 Children Speech Recognition Challenge

This technical report describes our submission to the 2021 SLT Children ...

0 Si-Ioi Ng, et al. ∙

research

∙ 10/23/2020

Show and Speak: Directly Synthesize Spoken Description of Images

This paper proposes a new model, referred to as the show and speak (SAS)...

6 Xinsheng Wang, et al. ∙

research

∙ 10/22/2020

How Phonotactics Affect Multilingual and Zero-shot ASR Performance

The idea of combining multiple languages' recordings to train a single a...

0 Siyuan Feng, et al. ∙

research

∙ 07/31/2020

Evaluating Automatically Generated Phoneme Captions for Images

Image2Speech is the relatively new task of generating a spoken descripti...

3 Justin van der Hout, et al. ∙

research

∙ 07/28/2020

Detecting and analysing spontaneous oral cancer speech in the wild

Oral cancer speech is a disease which impacts more than half a million p...

0 Bence Mark Halpern, et al. ∙

research

∙ 07/25/2020

Unsupervised Subword Modeling Using Autoregressive Pretraining and Cross-Lingual Phone-Aware Modeling

This study addresses unsupervised subword modeling, i.e., learning featu...

0 Siyuan Feng, et al. ∙

research

∙ 05/31/2020

Learning to Recognise Words using Visually Grounded Speech

We investigated word recognition in a Visually Grounded Speech model. Th...

0 Sebastiaan Scholten, et al. ∙

research

∙ 05/16/2020

That Sounds Familiar: an Analysis of Phonetic Representations Transfer Across Languages

Only a handful of the world's languages are abundant with the resources ...

0 Piotr Żelasko, et al. ∙

research

∙ 05/14/2020

S2IGAN: Speech-to-Image Generation via Adversarial Learning

An estimated half of the world's languages do not have a written form, m...

0 Xinsheng Wang, et al. ∙

research

∙ 03/13/2018

Investigating the Effect of Music and Lyrics on Spoken-Word Recognition

Background music in social interaction settings can hinder conversation....

0 Odette Scharenborg, et al. ∙

research

∙ 02/16/2018

Bayesian Models for Unit Discovery on a Very Low Resource Language

Developing speech technologies for low-resource languages has become a v...

0 Lucas Ondel, et al. ∙

research

∙ 02/14/2018

Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop

We summarize the accomplishments of a multi-disciplinary workshop explor...

0 Odette Scharenborg, et al. ∙

Odette Scharenborg

Featured Co-authors

Sign in with Google

Consider DeepAI Pro