Regressive Text-to-Speech (TTS) system utilizes attention mechanism to
g...
Conversational text-to-speech (TTS) aims to synthesize speech with prope...
Knowledge tracing (KT) serves as a primary part of intelligent education...
Audio driven talking head synthesis is a challenging task that attracts
...
Black-box attacks can generate adversarial examples without accessing th...
In recent years, neural network based methods for multi-speaker
text-to-...
The success of DNNs has driven the extensive applications of person
re-i...
Speech emotion recognition is an important aspect of human-computer
inte...
Multitask learning (MTL) aims to learn multiple tasks simultaneously thr...
Recent years has witnessed dramatic progress of neural machine translati...
Speech emotion recognition is an important aspect of human-computer
inte...
Automatic emotion recognition is a challenging task. In this paper, we
p...
Domain generalization aims to apply knowledge gained from multiple label...
Speech emotion recognition is an important task in human-machine interac...
Recent successes in learning-based image classification, however, heavil...
The past decade has witnessed the rapid development of feature represent...
This paper focuses on two key problems for audio-visual emotion recognit...
This work investigates how the traditional image classification pipeline...
Decision-Making Trial and Evaluation Laboratory (DEMATEL) method is wide...