The AI community has made significant strides in developing powerful
fou...
Automated audio captioning (AAC) is an important cross-modality translat...
Automated audio captioning aims at generating natural language descripti...
Controlling chatbot utterance generation with multiple attributes such a...
Previous audio generation mainly focuses on specified sound classes such...
Compared with ample visual-text pre-training research, few works explore...
This paper presents an ontology-aware pretrained language model (OPAL) f...
Building unified conversational agents has been a long-standing goal of ...
In a depression-diagnosis-directed clinical session, doctors initiate a
...
Mental disease detection (MDD) from social media has suffered from poor
...
Depression is a prominent health challenge to the world, and early risk
...
Automated audio captioning, a task that mimics human perception as well ...
Automatic depression detection has attracted increasing amount of attent...
Audio-text retrieval based on natural language descriptions is a challen...
Automated audio captioning aims at generating textual descriptions for a...
Audio tagging aims at predicting sound events occurred in a recording.
T...
Recently, Text-to-SQL for multi-turn dialogue has attracted great intere...
Voice activity detection is an essential pre-processing component for
sp...
Automated Audio Captioning is a cross-modal task, generating natural lan...
Automated audio captioning (AAC) aims at generating summarizing descript...
Sound event detection (SED) is the task of tagging the absence or presen...
How to visually localize multiple sound sources in unconstrained videos ...
Traditional supervised voice activity detection (VAD) methods work well ...
Traditional voice activity detection (VAD) methods work well in clean an...
Depression detection research has increased over the last few decades as...
Captioning has attracted much attention in image and video understanding...
Recent advances in automatic depression detection mostly derive from mod...
Increasing amount of research has shed light on machine perception of au...