Dorien Herremans

research

∙ 02/01/2023

Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training

In this paper, we introduce Jointist, an instrument-aware multi-instrume...

0 Kin Wai Cheuk, et al. ∙

research

∙ 11/14/2022

SNIPER Training: Variable Sparsity Rate Training For Text-To-Speech

Text-to-speech (TTS) models have achieved remarkable naturalness in rece...

0 Perry Lam, et al. ∙

research

∙ 11/07/2022

Accented Text-to-Speech Synthesis with a Conditional Variational Autoencoder

Accent plays a significant role in speech communication, influencing und...

0 Jan Melechovsky, et al. ∙

research

∙ 10/11/2022

DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability

In this paper we propose a novel generative approach, DiffRoll, to tackl...

16 Kin Wai Cheuk, et al. ∙

research

∙ 06/22/2022

Jointist: Joint Learning for Multi-instrument Transcription and Its Applications

In this paper, we introduce Jointist, an instrument-aware multi-instrume...

5 Kin Wai Cheuk, et al. ∙

research

∙ 05/30/2022

A multimodal model with Twitter FinBERT embeddings for extreme price movement prediction of Bitcoin

Bitcoin, with its ever-growing popularity, has demonstrated extreme pric...

0 Yanzhao Zou, et al. ∙

research

∙ 04/25/2022

Understanding Audio Features via Trainable Basis Functions

In this paper we explore the possibility of maximizing the information r...

0 Kwan Yee Heung, et al. ∙

research

∙ 03/06/2022

HEAR 2021: Holistic Evaluation of Audio Representations

What audio embedding approach generalizes best to a wide range of downst...

17 Joseph Turian, et al. ∙

research

∙ 02/11/2022

MusIAC: An extensible generative framework for Music Infilling Applications with multi-level Control

We present a novel music generation framework for music infilling, with ...

0 Rui Guo, et al. ∙

research

∙ 02/09/2022

Conditional Drums Generation using Compound Word Representations

The field of automatic music composition has seen great progress in rece...

0 Dimos Makris, et al. ∙

research

∙ 07/11/2021

ReconVAT: A Semi-Supervised Automatic Music Transcription Framework for Low-Resource Real-World Data

Most of the current supervised automatic music transcription (AMT) model...

0 Kin Wai Cheuk, et al. ∙

research

∙ 06/23/2021

Deep Neural Network Based Respiratory Pathology Classification Using Cough Sounds

Intelligent systems are transforming the world, as well as our healthcar...

14 Balamurali B. T., et al. ∙

research

∙ 04/27/2021

Generating Lead Sheets with Affect: A Novel Conditional seq2seq Framework

The field of automatic music composition has seen great progress in the ...

0 Dimos Makris, et al. ∙

research

∙ 04/14/2021

Revisiting the Onsets and Frames Model with Additive Attention

Recent advances in automatic music transcription (AMT) have achieved hig...

0 Kin Wai Cheuk, et al. ∙

research

∙ 02/26/2021

Underwater Acoustic Communication Receiver Using Deep Belief Network

Underwater environments create a challenging channel for communications....

0 Abigail Lee-Leon, et al. ∙

research

∙ 10/21/2020

AttendAffectNet: Self-Attention based Networks for Predicting Affective Responses from Movies

In this work, we propose different variants of the self-attention based ...

0 Ha Thi Phuong Thao, et al. ∙

research

∙ 10/20/2020

The Effect of Spectrogram Reconstruction on Automatic Music Transcription: An Alternative Approach to Improve Transcription Accuracy

Most of the state-of-the-art automatic music transcription (AMT) models ...

4 Kin Wai Cheuk, et al. ∙

research

∙ 10/16/2020

Hit Song Prediction Based on Early Adopter Data and Audio Features

Billions of USD are invested in new artists and songs by the music indus...

0 Dorien Herremans, et al. ∙

research

∙ 10/13/2020

A variational autoencoder for music generation controlled by tonal tension

Many of the music generation systems based on neural networks are fully ...

0 Rui Guo, et al. ∙

research

∙ 09/09/2020

A dataset and classification model for Malay, Hindi, Tamil and Chinese music

In this paper we present a new dataset, with musical excepts from the th...

0 Fajilatun Nahar, et al. ∙

research

∙ 07/29/2020

Music FaderNets: Controllable Music Generation Based On High-Level Features via Low-Level Feature Modelling

High-level musical qualities (such as emotion) are often abstract, subje...

0 Hao Hao Tan, et al. ∙

research

∙ 07/02/2020

PerceptionGAN: Real-world Image Construction from Provided Text through Perceptual Understanding

Generating an image from a provided descriptive text is quite a challeng...

0 Kanish Garg, et al. ∙

research

∙ 06/16/2020

Generative Modelling for Controllable Audio Synthesis of Expressive Piano Performance

We present a controllable neural audio synthesizer based on Gaussian Mix...

0 Hao Hao Tan, et al. ∙

research

∙ 06/16/2020

Acoustic prediction of flowrate: varying liquid jet stream onto a free surface

Information on liquid jet stream flow is crucial in many real world appl...

0 Balamurali B. T., et al. ∙

research

∙ 01/25/2020

The impact of Audio input representations on neural network based music transcription

This paper thoroughly analyses the effect of different input representat...

0 Kin Wai Cheuk, et al. ∙

research

∙ 01/25/2020

Regression-based music emotion prediction using triplet neural networks

In this paper, we adapt triplet neural networks (TNNs) to a regression t...

0 Kin Wai Cheuk, et al. ∙

research

∙ 12/27/2019

nnAudio: An on-the-fly GPU Audio to Spectrogram Conversion Toolbox Using 1D Convolution Neural Networks

Converting time domain waveforms to frequency domain spectrograms is typ...

0 Kin Wai Cheuk, et al. ∙

research

∙ 12/03/2019

Singing Voice Conversion with Disentangled Representations of Singer and Vocal Technique Using Variational Autoencoders

We propose a flexible framework that deals with both singer conversion a...

0 Yin-Jyun Luo, et al. ∙

research

∙ 10/03/2019

Midi Miner – A Python library for tonal tension and track classification

We present a Python library, called Midi Miner, that can calculate tonal...

0 Rui Guo, et al. ∙

research

∙ 10/01/2019

Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networks

We present an approach to tackle the speaker recognition problem using T...

0 Kin Wai Cheuk, et al. ∙

research

∙ 09/16/2019

Multimodal Deep Models for Predicting Affective Responses Evoked by Movies

The goal of this study is to develop and analyze multimodal models for p...

0 Ha Thi Phuong Thao, et al. ∙

research

∙ 09/05/2019

Doppler Invariant Demodulation for Shallow Water Acoustic Communications Using Deep Belief Networks

Shallow water environments create a challenging channel for communicatio...

0 Abigail Lee-Leon, et al. ∙

research

∙ 06/25/2019

A novel music-based game with motion capture to support cognitive and motor function in the elderly

This paper presents a novel game prototype that uses music and motion de...

0 Kat Agres, et al. ∙

research

∙ 06/19/2019

Learning Disentangled Representations of Timbre and Pitch for Musical Instrument Sounds Using Gaussian Mixture Variational Autoencoders

In this paper, we learn disentangled representations of timbre and pitch...

2 Yin-Jyun Luo, et al. ∙

research

∙ 05/28/2019

Towards robust audio spoofing detection: a detailed comparison of traditional and learned features

Automatic speaker verification, like every other biometric system, is vu...

0 Balamurali BT, et al. ∙

research

∙ 05/17/2019

Dance Hit Song Prediction

Record companies invest billions of dollars in new talent around the glo...

0 Dorien Herremans, et al. ∙

research

∙ 12/12/2018

MorpheuS: generating structured music with constrained patterns and tension

Automatic music generation systems have gained in popularity and sophist...

1 Dorien Herremans, et al. ∙

research

∙ 12/11/2018

A Functional Taxonomy of Music Generation Systems

Digital advances have transformed the face of automatic music generation...

0 Dorien Herremans, et al. ∙

research

∙ 12/04/2018

Singing Voice Separation Using a Deep Convolutional Neural Network Trained by Ideal Binary Mask and Cross Entropy

Separating a singing voice from its music accompaniment remains an impor...

0 Kin Wah Edward Lin, et al. ∙

research

∙ 11/29/2018

From Context to Concept: Exploring Semantic Relationships in Music with Word2Vec

We explore the potential of a popular distributional semantics vector sp...

0 Ching-Hua Chuan, et al. ∙

research

∙ 06/28/2017

Modeling Musical Context with Word2vec

We present a semantic vector space model for capturing complex polyphoni...

0 Dorien Herremans, et al. ∙

research

∙ 06/27/2017

Proceedings of the First International Workshop on Deep Learning and Music

Proceedings of the First International Workshop on Deep Learning and Mus...

0 Dorien Herremans, et al. ∙

Dorien Herremans

Featured Co-authors

Sign in with Google

Consider DeepAI Pro