We release Code Llama, a family of large language models for code based ...
Recent work has shown that it is possible to resynthesize high-quality s...
We tackle the task of conditional music generation. We introduce MusicGe...
Speech language models (SpeechLMs) process and generate acoustic data on...
Speech to text models tend to be trained and evaluated against a single
...
In this work, we study the task of Audio Language Modeling, in which we ...
We introduce a state-of-the-art real-time, high-fidelity, audio codec
le...
Most automatic speech processing systems are sensitive to the acoustic
e...
Self-supervised representations have been extensively studied for
discri...
We introduce dGSLM, the first "textless" model able to generate audio sa...
Textless spoken language processing research aims to extend the applicab...
Speech emotion conversion is the task of modifying the perceived emotion...
Popular ASR benchmarks such as Librispeech and Switchboard are limited i...
Speech pre-training has primarily demonstrated efficacy on classificatio...
We propose using self-supervised discrete representations for the task o...
Generative spoken language modeling involves learning jointly the acoust...