Self-supervised techniques for learning speech representations have been...
We present ProsAudit, a benchmark in English to assess structural prosod...
Most automatic speech processing systems are sensitive to the acoustic
e...
Unsupervised models of representations based on Contrastive Predictive C...
We present the visually-grounded language modelling track that was intro...
This paper presents the problems and solutions addressed at the JSALT
wo...
We introduce pyannote.audio, an open-source toolkit written in Python fo...