Chanwoo Kim

research

∙ 08/16/2023

Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction

Text-to-Text Transfer Transformer (T5) has recently been considered for ...

0 Eunseop Yoon, et al. ∙

research

∙ 12/29/2022

Macro-block dropout for improved regularization in training end-to-end speech recognition models

This paper proposes a new regularization algorithm referred to as macro-...

0 Chanwoo Kim, et al. ∙

research

∙ 11/06/2022

An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space

With the recent developments in cross-lingual Text-to-Speech (TTS) syste...

0 JIhwan Lee, et al. ∙

research

∙ 10/20/2022

Transformer-based Global 3D Hand Pose Estimation in Two Hands Manipulating Objects Scenarios

This report describes our 1st place solution to ECCV 2022 challenge on H...

0 Hoseong Cho, et al. ∙

research

∙ 09/30/2022

Contrastive Corpus Attribution for Explaining Representations

Despite the widespread use of unsupervised models, very few methods are ...

0 Chris Lin, et al. ∙

research

∙ 06/10/2022

Learning to Estimate Shapley Values with Vision Transformers

Transformers have become a default architecture in computer vision, but ...

0 Ian Covert, et al. ∙

research

∙ 04/04/2022

Into-TTS : Intonation Template based Prosody Control System

Intonations take an important role in delivering the intention of the sp...

0 JIhwan Lee, et al. ∙

research

∙ 01/08/2022

Two-Pass End-to-End ASR Model Compression

Speech recognition on smart devices is challenging owing to the small me...

0 Nauman Dawalatabad, et al. ∙

research

∙ 11/19/2021

Semi-supervised transfer learning for language expansion of end-to-end speech recognition models to low-resource languages

In this paper, we propose a three-stage training methodology to improve ...

0 Jiyeon Kim, et al. ∙

research

∙ 11/19/2021

A comparison of streaming models and data augmentation methods for robust speech recognition

In this paper, we present a comparative study on the robustness of two d...

0 Jiyeon Kim, et al. ∙

research

∙ 10/13/2021

Decision Attentive Regularization to Improve Simultaneous Speech Translation Systems

Simultaneous Speech-to-text Translation (SimulST) systems translate sour...

0 Mohd Abbas Zaidi, et al. ∙

research

∙ 05/04/2021

Streaming end-to-end speech recognition with jointly trained neural feature enhancement

In this paper, we present a streaming end-to-end speech recognition mode...

0 Chanwoo Kim, et al. ∙

research

∙ 12/29/2020

Faster Re-translation Using Non-Autoregressive Model For Simultaneous Neural Machine Translation

Recently, simultaneous translation has gathered a lot of attention since...

0 Hyojung Han, et al. ∙

research

∙ 12/14/2020

A review of on-device fully neural end-to-end automatic speech recognition algorithms

In this paper, we review various end-to-end automatic speech recognition...

0 Chanwoo Kim, et al. ∙

research

∙ 02/15/2020

Small energy masking for improved neural network training for end-to-end speech recognition

In this paper, we present a Small Energy Masking (SEM) algorithm, which ...

0 Chanwoo Kim, et al. ∙

research

∙ 01/02/2020

Attention based on-device streaming speech recognition with large speech corpus

In this paper, we present a new on-device automatic speech recognition (...

0 Kwangyoun Kim, et al. ∙

research

∙ 12/28/2019

Improved Multi-Stage Training of Online Attention-based Encoder-Decoder Models

In this paper, we propose a refined multi-stage multi-task training stra...

0 Abhinav Garg, et al. ∙

research

∙ 12/22/2019

power-law nonlinearity with maximally uniform distribution criterion for improved neural network training in automatic speech recognition

In this paper, we describe the Maximum Uniformity of Distribution (MUD) ...

0 Chanwoo Kim, et al. ∙

research

∙ 12/22/2019

end-to-end training of a large vocabulary end-to-end speech recognition system

In this paper, we present an end-to-end training framework for building ...

7 Chanwoo Kim, et al. ∙

research

∙ 11/11/2019

Data Efficient Direct Speech-to-Text Translation with Modality Agnostic Meta-Learning

End-to-end Speech Translation (ST) models have several advantages such a...

0 Sathish Indurthi, et al. ∙

research

∙ 12/09/2017

Efficient Implementation of the Room Simulator for Training Deep Neural Network Acoustic Models

In this paper, we describe how to efficiently implement an acoustic room...

0 Chanwoo Kim, et al. ∙

Chanwoo Kim

Featured Co-authors

Sign in with Google

Consider DeepAI Pro