Marvin Lavechin

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Xin Wang
382 publications
Emmanuel Dupoux
61 publications
Jun Du
58 publications
Yossi Adi
58 publications
Lei Sun
54 publications
Najim Dehak
49 publications
Kong Aik Lee
42 publications
Jesús Villalba
33 publications
Grzegorz Chrupała
24 publications
Chen Yu
22 publications
Okko Räsänen
20 publications

research

∙ 06/02/2023

BabySLM: language-acquisition-friendly benchmark of self-supervised spoken language models

Self-supervised techniques for learning speech representations have been...

0 Marvin Lavechin, et al. ∙

research

∙ 02/23/2023

ProsAudit, a prosodic benchmark for self-supervised speech models

We present ProsAudit, a benchmark in English to assess structural prosod...

0 Maureen de Seyssel, et al. ∙

research

∙ 10/24/2022

Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation

Most automatic speech processing systems are sensitive to the acoustic e...

0 Marvin Lavechin, et al. ∙

research

∙ 03/30/2022

Probing phoneme, language and speaker information in unsupervised speech representations

Unsupervised models of representations based on Contrastive Predictive C...

0 Maureen de Seyssel, et al. ∙

research

∙ 07/14/2021

ZR-2021VG: Zero-Resource Speech Challenge, Visually-Grounded Language Modelling track, 2021 edition

We present the visually-grounded language modelling track that was intro...

4 Afra Alishahi, et al. ∙

research

∙ 12/02/2019

Speaker detection in the wild: Lessons learned from JSALT 2019

This paper presents the problems and solutions addressed at the JSALT wo...

0 Paola García, et al. ∙

research

∙ 11/04/2019

pyannote.audio: neural building blocks for speaker diarization

We introduce pyannote.audio, an open-source toolkit written in Python fo...

0 Hervé Bredin, et al. ∙

Success!

An error occurred

Marvin Lavechin

Featured Co-authors

BabySLM: language-acquisition-friendly benchmark of self-supervised spoken language models

ProsAudit, a prosodic benchmark for self-supervised speech models

Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation

Probing phoneme, language and speaker information in unsupervised speech representations

ZR-2021VG: Zero-Resource Speech Challenge, Visually-Grounded Language Modelling track, 2021 edition

Speaker detection in the wild: Lessons learned from JSALT 2019

pyannote.audio: neural building blocks for speaker diarization

Sign in with Google

Consider DeepAI Pro