Thomas Hueber

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Laurent Besacier
68 publications
Xavier Alameda-Pineda
54 publications
Laurent Girin
33 publications
Simon Leglaive
18 publications
Xiaoyu Bie
6 publications
Denis Beautemps
4 publications
Marc-Antoine Georges
3 publications
Jean-Luc Schwartz
3 publications
Brooke Stephenson
3 publications
Sanjana Sankar
2 publications
Olivier Perrotin
2 publications

research

∙ 06/14/2023

Investigating the dynamics of hand and lips in French Cued Speech using attention mechanisms and CTC-based decoding

Hard of hearing or profoundly deaf people make use of cued speech (CS) a...

0 Sanjana Sankar, et al. ∙

research

∙ 07/04/2022

BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model

Several recent studies have tested the use of transformer language model...

3 Brooke Stephenson, et al. ∙

research

∙ 06/17/2022

Self-supervised speech unit discovery from articulatory and acoustic features using VQ-VAE

The human perception system is often assumed to recruit motor knowledge ...

0 Marc-Antoine Georges, et al. ∙

research

∙ 04/11/2022

Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding

This paper proposes a simple and effective approach for automatic recogn...

0 Sanjana Sankar, et al. ∙

research

∙ 04/05/2022

Repeat after me: Self-supervised learning of acoustic-to-articulatory mapping by vocal imitation

We propose a computational model of speech production combining a pre-tr...

0 Marc-Antoine Georges, et al. ∙

research

∙ 06/11/2021

A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling

The Variational Autoencoder (VAE) is a powerful deep generative model th...

0 Xiaoyu Bie, et al. ∙

research

∙ 04/07/2021

Learning robust speech representation with an articulatory-regularized variational autoencoder

It is increasingly considered that human speech perception and productio...

0 Marc-Antoine Georges, et al. ∙

research

∙ 02/19/2021

Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input

The prosody of a spoken word is determined by its surrounding context. I...

0 Brooke Stephenson, et al. ∙

research

∙ 09/04/2020

What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS

In incremental text to speech synthesis (iTTS), the synthesizer produces...

0 Brooke Stephenson, et al. ∙

research

∙ 08/28/2020

Dynamical Variational Autoencoders: A Comprehensive Review

The Variational Autoencoder (VAE) is a powerful deep generative model th...

0 Laurent Girin, et al. ∙

research

∙ 06/11/2018

Autoencoders for music sound synthesis: a comparison of linear, shallow, deep and variational models

This study investigates the use of non-linear unsupervised dimensionalit...

0 Fanny Roche, et al. ∙

Success!

An error occurred

Thomas Hueber

Featured Co-authors

Sign in with Google

Consider DeepAI Pro