Employing additional multimodal information to improve automatic speech
...
Large language models (LLMs) have demonstrated remarkable language abili...
Medical Slot Filling (MSF) task aims to convert medical queries into
str...
The spiking neural network (SNN) using leaky-integrated-and-fire (LIF)
n...
Large-scale pre-trained language models (PLMs) with powerful language
mo...
In the past few years, the emergence of pre-training models has brought
...
Nowadays, most methods in end-to-end contextual speech recognition bias ...
End-to-end (E2E) models have achieved promising results on multiple spee...