Noticing the urgent need to provide tools for fast and user-friendly
qua...
The BigCode community, an open-scientific collaboration working on the
r...
As machine learning-enabled Text-to-Image (TTI) systems are becoming
inc...
As language models grow ever larger, the need for large-scale high-quali...
We present Spacerini, a modular framework for seamless building and
depl...
ROOTS is a 1.6TB multilingual text corpus developed for the training of
...
Open Artificial Intelligence (Open source AI) collaboratives offer
alter...
The BigCode project is an open-scientific collaboration working on the
r...
The BigScience Workshop was a value-driven initiative that spanned one a...
Neural retrieval models are often trained on (subsets of) the millions o...
In this work, we explore whether the recently demonstrated zero-shot
abi...
This technical report documents our efforts in addressing the tasks set ...
Masked language models have recently been interpreted as energy-based
se...