Ondřej Klejch

research

∙ 06/03/2023

Acoustic Word Embeddings for Untranscribed Target Languages with Continued Pretraining and Learned Pooling

Acoustic word embeddings are typically created by training a pooling fun...

0 Ramon Sanabria, et al. ∙

research

∙ 05/25/2023

ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition

In Speech Emotion Recognition (SER), textual data is often used alongsid...

0 Yuanchao Li, et al. ∙

research

∙ 03/31/2023

The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR

English is the most widely spoken language in the world, used daily by m...

0 Ramon Sanabria, et al. ∙

research

∙ 11/29/2022

Evaluating and reducing the distance between synthetic and real speech distributions

While modern Text-to-Speech (TTS) systems can produce speech rated highl...

0 Christoph Minixhofer, et al. ∙

research

∙ 11/02/2022

Towards Zero-Shot Code-Switched Speech Recognition

In this work, we seek to build effective code-switched (CS) automatic sp...

0 Brian Yan, et al. ∙

research

∙ 12/15/2021

Mask-combine Decoding and Classification Approach for Punctuation Prediction with real-time Inference Constraints

In this work, we unify several existing decoding strategies for punctuat...

0 Christoph Minixhofer, et al. ∙

research

∙ 11/12/2021

Deciphering Speech: a Zero-Resource Approach to Cross-Lingual Transfer in ASR

We present a method for cross-lingual training an ASR system using absol...

0 Ondřej Klejch, et al. ∙

research

∙ 08/14/2020

Adaptation Algorithms for Speech Recognition: An Overview

We present a structured overview of adaptation algorithms for neural net...

0 Peter Bell, et al. ∙

research

∙ 03/30/2020

European Language Grid: An Overview

With 24 official EU and many additional languages, multilingualism in Eu...

0 Georg Rehm, et al. ∙

research

∙ 10/23/2019

Speaker Adaptive Training using Model Agnostic Meta-Learning

Speaker adaptive training (SAT) of neural network acoustic models learns...

0 Ondřej Klejch, et al. ∙

research

∙ 09/30/2019

Acoustic Model Adaptation from Raw Waveforms with SincNet

Raw waveform acoustic modelling has recently gained interest due to neur...

0 Joachim Fainberg, et al. ∙

research

∙ 06/27/2019

Lattice-Based Unsupervised Test-Time Adaptation of Neural Network Acoustic Models

Acoustic model adaptation to unseen test recordings aims to reduce the m...

0 Ondřej Klejch, et al. ∙

research

∙ 05/30/2019

Lattice-based lightly-supervised acoustic model training

In the broadcast domain there is an abundance of related text data and p...

0 Joachim Fainberg, et al. ∙

research

∙ 01/05/2019

AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection

Active speaker detection is an important component in video analysis alg...

6 Joseph Roth, et al. ∙

research

∙ 08/30/2018

Learning to adapt: a meta-learning approach for speaker adaptation

The performance of automatic speech recognition systems can be improved ...

0 Ondřej Klejch, et al. ∙

Ondřej Klejch

Featured Co-authors

Sign in with Google

Consider DeepAI Pro