Benjamin Elizalde

research

∙ 09/14/2023

Training Audio Captioning Models without Audio

Automated Audio Captioning (AAC) is the task of generating natural langu...

0 Soham Deshmukh, et al. ∙

research

∙ 09/11/2023

Natural Language Supervision for General-Purpose Audio Representations

Audio-Language models jointly learn multimodal text and audio representa...

0 Benjamin Elizalde, et al. ∙

research

∙ 05/19/2023

Pengi: An Audio Language Model for Audio Tasks

In the domain of audio processing, Transfer Learning has facilitated the...

0 Soham Deshmukh, et al. ∙

research

∙ 02/20/2023

Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session

Machine Listening, as usually formalized, attempts to perform a task tha...

0 Laurie M. Heller, et al. ∙

research

∙ 11/14/2022

Describing emotions with acoustic property prompts for speech emotion recognition

Emotions lie on a broad continuum and treating emotions as a discrete nu...

0 Hira Dhamyal, et al. ∙

research

∙ 09/28/2022

Audio Retrieval with WavText5K and CLAP Training

Audio-Text retrieval takes a natural language query to retrieve relevant...

0 Soham Deshmukh, et al. ∙

research

∙ 06/09/2022

CLAP: Learning Audio Concepts From Natural Language Supervision

Mainstream Audio Analytics models are trained to learn under the paradig...

0 Benjamin Elizalde, et al. ∙

research

∙ 05/22/2021

COVID-19 Detection Using Recorded Coughs in the 2021 DiCOVA Challenge

COVID-19 has resulted in over 100 million infections and caused worldwid...

0 Benjamin Elizalde, et al. ∙

research

∙ 04/26/2021

Identifying Actions for Sound Event Classification

In Psychology, actions are paramount for humans to perceive and separate...

0 Benjamin Elizalde, et al. ∙

research

∙ 02/20/2020

Multi-label Sound Event Retrieval Using a Deep Learning-based Siamese Structure with a Pairwise Presence Matrix

Realistic recordings of soundscapes often have multiple sound events co-...

0 Jianyu Fan, et al. ∙

research

∙ 01/17/2018

NELS - Never-Ending Learner of Sounds

Sounds are essential to how humans perceive and interact with the world ...

0 Benjamin Elizalde, et al. ∙

research

∙ 01/08/2018

DCASE 2017 Task 1: Acoustic Scene Classification Using Shift-Invariant Kernels and Random Features

Acoustic scene recordings are represented by different types of handcraf...

0 Abelino Jimenez, et al. ∙

research

∙ 11/02/2017

Framework for evaluation of sound event detection in web videos

The largest source of sound events is web videos. Most videos lack sound...

0 Rohan Badlani, et al. ∙

research

∙ 10/30/2017

Content-based Representations of audio using Siamese neural networks

In this paper, we focus on the problem of content-based retrieval for au...

0 Pranay Manocha, et al. ∙

research

∙ 10/11/2017

Audio Concept Classification with Hierarchical Deep Neural Networks

Audio-based multimedia retrieval tasks may identify semantic information...

0 Mirco Ravanelli, et al. ∙

research

∙ 07/22/2016

Experiments on the DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording

In this paper we present our work on Task 1 Acoustic Scene Classi- ficat...

0 Benjamin Elizalde, et al. ∙

research

∙ 07/13/2016

AudioPairBank: Towards A Large-Scale Tag-Pair-Based Audio Content Analysis

Recently, sound recognition has been used to identify sounds, such as ca...

0 Sebastian Sager, et al. ∙

research

∙ 07/12/2016

City-Identification of Flickr Videos Using Semantic Acoustic Features

City-identification of videos aims to determine the likelihood of a vide...

0 Benjamin Elizalde, et al. ∙

research

∙ 03/13/2015

The YLI-MED Corpus: Characteristics, Procedures, and Plans

The YLI Multimedia Event Detection corpus is a public-domain index of vi...

0 Julia Bernd, et al. ∙

Benjamin Elizalde

Featured Co-authors

Sign in with Google

Consider DeepAI Pro