Large self-supervised pre-trained speech models require computationally
...
Despite recent strides made in Speech Separation, most models are traine...
Zero-shot video recognition (ZSVR) is a task that aims to recognize vide...
In this paper, we propose ACA-Net, a lightweight, global context-aware
s...
Most of the existing neural-based models for keyword spotting (KWS) in s...
The vulnerability of Deep Neural Networks (DNNs) to adversarial examples...
Learning on a massive amount of speech corpus leads to the recent succes...
Existing self-supervised pre-trained speech models have offered an effec...
Monaural speech enhancement has been widely studied using real networks ...
Transformer based models have provided significant performance improveme...
Noise robustness in keyword spotting remains a challenge as many models ...
Convolutional recurrent networks (CRN) integrating a convolutional
encod...
Learning individual-level treatment effect is a fundamental problem in c...
The ICASSP 2022 Multi-channel Multi-party Meeting Transcription Grand
Ch...
Digital advertising is a critical part of many e-commerce platforms such...
Transformer models have been used in automatic speech recognition (ASR)
...
Recent development of speech signal processing, such as speech recogniti...
Echo and noise suppression is an integral part of a full-duplex communic...
Deep complex U-Net structure and convolutional recurrent network (CRN)
s...
Cross-lingual voice conversion (VC) is an important and challenging prob...
Recent state-of-the-art neural text-to-speech (TTS) synthesis models hav...
In this work, we study leveraging extra text data to improve low-resourc...
The attention-based end-to-end (E2E) automatic speech recognition (ASR)
...
The lack of code-switch training data is one of the major concerns in th...
We propose a new self-supervised approach to image feature learning from...
We propose to learn acoustic word embeddings with temporal context for
q...
This article describes the systems jointly submitted by Institute for
In...
A new class of distances appropriate for measuring similarity relations
...