b'Martin Radfar'

research

∙ 05/04/2023

End-to-end spoken language understanding using joint CTC loss and self-supervised, pretrained acoustic encoders

It is challenging to extract semantic meanings directly from audio signa...

0 Jixuan Wang, et al. ∙

research

∙ 10/17/2022

Sub-8-bit quantization for on-device speech recognition: a regularization-free approach

For on-device automatic speech recognition (ASR), quantization aware tra...

0 Kai Zhen, et al. ∙

research

∙ 09/29/2022

ConvRNN-T: Convolutional Augmented Recurrent Neural Network Transducers for Streaming Speech Recognition

The recurrent neural network transducer (RNN-T) is a prominent streaming...

0 Martin Radfar, et al. ∙

research

∙ 07/05/2022

Compute Cost Amortized Transformer for Streaming ASR

We present a streaming, Transformer-based end-to-end automatic speech re...

0 Yi Xie, et al. ∙

research

∙ 05/11/2022

A neural prosody encoder for end-ro-end dialogue act classification

Dialogue act classification (DAC) is a critical task for spoken language...

0 Kai Wei, et al. ∙

research

∙ 04/01/2022

Multi-task RNN-T with Semantic Decoder for Streamable Spoken Language Understanding

End-to-end Spoken Language Understanding (E2E SLU) has attracted increas...

0 Xuandi Fu, et al. ∙

research

∙ 11/05/2021

Context-Aware Transformer Transducer for Speech Recognition

End-to-end (E2E) automatic speech recognition (ASR) systems often have d...

0 Feng-Ju Chang, et al. ∙

research

∙ 10/31/2021

Speech Emotion Recognition Using Quaternion Convolutional Neural Networks

Although speech recognition has become a widespread technology, inferrin...

0 Aneesh Muppidi, et al. ∙

research

∙ 10/31/2021

FANS: Fusing ASR and NLU for on-device SLU

Spoken language understanding (SLU) systems translate voice input comman...

0 Martin Radfar, et al. ∙

research

∙ 08/30/2021

Multi-Channel Transformer Transducer for Speech Recognition

Multi-channel inputs offer several advantages over single-channel, to im...

0 Feng-Ju Chang, et al. ∙

research

∙ 08/03/2021

The Performance Evaluation of Attention-Based Neural ASR under Mixed Speech Input

In order to evaluate the performance of the attention based neural ASR u...

0 Bradley He, et al. ∙

research

∙ 02/08/2021

End-to-End Multi-Channel Transformer for Speech Recognition

Transformers are powerful neural architectures that allow integrating di...

0 Feng-Ju Chang, et al. ∙

research

∙ 12/21/2020

Encoding Syntactic Knowledge in Transformer Encoder for Intent Detection and Slot Filling

We propose a novel Transformer encoder-based architecture with syntactic...

0 Jixuan Wang, et al. ∙

research

∙ 11/18/2020

Tie Your Embeddings Down: Cross-Modal Latent Spaces for End-to-end Spoken Language Understanding

End-to-end (E2E) spoken language understanding (SLU) systems can infer t...

7 Bhuvan Agrawal, et al. ∙

research

∙ 08/12/2020

End-to-End Neural Transformer Based Spoken Language Understanding

Spoken language understanding (SLU) refers to the process of inferring t...

0 Martin Radfar, et al. ∙

Martin Radfar

Featured Co-authors

Sign in with Google

Consider DeepAI Pro