b'Xuedong Huang'

research

∙ 05/24/2023

ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation

Joint speech-language training is challenging due to the large demand fo...

0 Chenyang Le, et al. ∙

research

∙ 05/23/2023

i-Code Studio: A Configurable and Composable Framework for Integrative AI

Artificial General Intelligence (AGI) requires comprehensive understandi...

0 Yuwei Fang, et al. ∙

research

∙ 05/21/2023

i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data

The convergence of text, visual, and audio data is a key step towards hu...

0 ZiYi Yang, et al. ∙

research

∙ 05/03/2022

i-Code: An Integrative and Composable Multimodal Learning Framework

Human intelligence is multimodal; we integrate visual, linguistic, and a...

1 ZiYi Yang, et al. ∙

research

∙ 12/06/2021

Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention

Most of today's AI systems focus on using self-attention mechanisms and ...

3 Yichong Xu, et al. ∙

research

∙ 11/22/2021

Florence: A New Foundation Model for Computer Vision

Automated visual understanding of our diverse and open world demands com...

4 Lu Yuan, et al. ∙

research

∙ 10/20/2021

One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement

With the recent surge of video conferencing tools usage, providing high-...

0 Hassan Taherian, et al. ∙

research

∙ 10/18/2021

Personalized Speech Enhancement: New Models and Comprehensive Evaluation

Personalized speech enhancement (PSE) models utilize additional cues, su...

0 Sefik Emre Eskimez, et al. ∙

research

∙ 01/19/2021

UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data

In this paper, we propose a unified pre-training approach called UniSpee...

21 Chengyi Wang, et al. ∙

research

∙ 12/09/2020

Fusing Context Into Knowledge Graph for Commonsense Reasoning

Commonsense reasoning requires a model to make presumptions about world ...

14 Yichong Xu, et al. ∙

research

∙ 10/18/2020

Mixed-Lingual Pre-training for Cross-lingual Summarization

Cross-lingual Summarization (CLS) aims at producing a summary in the tar...

7 Ruochen Xu, et al. ∙

research

∙ 06/27/2020

Mind The Facts: Knowledge-Boosted Coherent Abstractive Text Summarization

Neural models have become successful at producing abstractive summaries ...

0 Beliz Gunel, et al. ∙

research

∙ 04/04/2020

End-to-End Abstractive Summarization for Meetings

With the abundance of automatic meeting transcripts, meeting summarizati...

1 Chenguang Zhu, et al. ∙

research

∙ 03/19/2020

Boosting Factual Correctness of Abstractive Summarization

A commonly observed problem with abstractive summarization is the distor...

0 Chenguang Zhu, et al. ∙

research

∙ 03/19/2020

Boosting Factual Correctness of Abstractive Summarization with Knowledge Graph

A commonly observed problem with abstractive summarization is the distor...

0 Chenguang Zhu, et al. ∙

research

∙ 01/03/2020

TED: A Pretrained Unsupervised Summarization Model with Theme Modeling and Denoising

Text summarization aims to extract essential information from a piece of...

0 ZiYi Yang, et al. ∙

research

∙ 12/25/2019

Make Lead Bias in Your Favor: A Simple and Effective Method for News Summarization

Lead bias is a common phenomenon in news summarization, where early part...

0 Chenguang Zhu, et al. ∙

research

∙ 12/10/2019

Advances in Online Audio-Visual Meeting Transcription

This paper describes a system that generates speaker-annotated transcrip...

15 Takuya Yoshioka, et al. ∙

research

∙ 09/26/2019

SIM: A Slot-Independent Neural Model for Dialogue State Tracking

Dialogue state tracking is an important component in task-oriented dialo...

0 Chenguang Zhu, et al. ∙

research

∙ 05/03/2019

Meeting Transcription Using Virtual Microphone Arrays

We describe a system that generates speaker-annotated transcripts of mee...

0 Takuya Yoshioka, et al. ∙

research

∙ 12/10/2018

SDNet: Contextualized Attention-based Deep Network for Conversational Question Answering

Conversational question answering (CQA) is a novel QA task that requires...

0 Chenguang Zhu, et al. ∙

research

∙ 03/15/2018

Achieving Human Parity on Automatic Chinese to English News Translation

Machine translation has made rapid advances in recent years. Millions of...

0 Hany Hassan, et al. ∙

Xuedong Huang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro