In this paper, we introduce Jointist, an instrument-aware multi-instrume...
Text-to-speech (TTS) models have achieved remarkable naturalness in rece...
Accent plays a significant role in speech communication, influencing
und...
In this paper we propose a novel generative approach, DiffRoll, to tackl...
In this paper, we introduce Jointist, an instrument-aware multi-instrume...
Bitcoin, with its ever-growing popularity, has demonstrated extreme pric...
In this paper we explore the possibility of maximizing the information
r...
What audio embedding approach generalizes best to a wide range of downst...
We present a novel music generation framework for music infilling, with ...
The field of automatic music composition has seen great progress in rece...
Most of the current supervised automatic music transcription (AMT) model...
Intelligent systems are transforming the world, as well as our healthcar...
The field of automatic music composition has seen great progress in the ...
Recent advances in automatic music transcription (AMT) have achieved hig...
Underwater environments create a challenging channel for communications....
In this work, we propose different variants of the self-attention based
...
Most of the state-of-the-art automatic music transcription (AMT) models ...
Billions of USD are invested in new artists and songs by the music indus...
Many of the music generation systems based on neural networks are fully
...
In this paper we present a new dataset, with musical excepts from the th...
High-level musical qualities (such as emotion) are often abstract,
subje...
Generating an image from a provided descriptive text is quite a challeng...
We present a controllable neural audio synthesizer based on Gaussian Mix...
Information on liquid jet stream flow is crucial in many real world
appl...
This paper thoroughly analyses the effect of different input representat...
In this paper, we adapt triplet neural networks (TNNs) to a regression t...
Converting time domain waveforms to frequency domain spectrograms is
typ...
We propose a flexible framework that deals with both singer conversion a...
We present a Python library, called Midi Miner, that can calculate tonal...
We present an approach to tackle the speaker recognition problem using
T...
The goal of this study is to develop and analyze multimodal models for
p...
Shallow water environments create a challenging channel for communicatio...
This paper presents a novel game prototype that uses music and motion
de...
In this paper, we learn disentangled representations of timbre and pitch...
Automatic speaker verification, like every other biometric system, is
vu...
Record companies invest billions of dollars in new talent around the glo...
Automatic music generation systems have gained in popularity and
sophist...
Digital advances have transformed the face of automatic music generation...
Separating a singing voice from its music accompaniment remains an impor...
We explore the potential of a popular distributional semantics vector sp...
We present a semantic vector space model for capturing complex polyphoni...
Proceedings of the First International Workshop on Deep Learning and Mus...