Lei He

research

∙ 09/20/2023

Orbital AI-based Autonomous Refuelling Solution

Cameras are rapidly becoming the choice for on-board sensors towards spa...

0 Duarte Rondao, et al. ∙

research

∙ 07/27/2023

FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene

It has long been an ill-posed problem to predict absolute depth maps fro...

0 Chengrui Wei, et al. ∙

research

∙ 06/26/2023

Progressive Energy-Based Cooperative Learning for Multi-Domain Image-to-Image Translation

This paper studies a novel energy-based cooperative learning framework f...

0 Weinan Song, et al. ∙

research

∙ 06/09/2023

Illumination Controllable Dehazing Network based on Unsupervised Retinex Embedding

On the one hand, the dehazing task is an illposedness problem, which mea...

0 Jie Gui, et al. ∙

research

∙ 06/07/2023

4D Millimeter-Wave Radar in Autonomous Driving: A Survey

The 4D millimeter-wave (mmWave) radar, capable of measuring the range, a...

0 Zeyu Han, et al. ∙

research

∙ 04/08/2023

A Reinforcement Learning-assisted Genetic Programming Algorithm for Team Formation Problem Considering Person-Job Matching

An efficient team is essential for the company to successfully complete ...

0 Yangyang Guo, et al. ∙

research

∙ 03/21/2023

Oral-NeXF: 3D Oral Reconstruction with Neural X-ray Field from Panoramic Imaging

3D reconstruction of medical images from 2D images has increasingly beco...

0 Weinan Song, et al. ∙

research

∙ 03/06/2023

FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model

Neural text-to-speech (TTS) generally consists of cascaded architecture ...

0 Ruiqing Xue, et al. ∙

research

∙ 01/05/2023

Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers

We introduce a language modeling approach for text to speech synthesis (...

4 Chengyi Wang, et al. ∙

research

∙ 11/16/2022

AlignVE: Visual Entailment Recognition Based on Alignment Relations

Visual entailment (VE) is to recognize whether the semantics of a hypoth...

0 Biwei Cao, et al. ∙

research

∙ 10/31/2022

Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech Translation

Direct speech-to-speech translation (S2ST) is an attractive research top...

0 Kun Wei, et al. ∙

research

∙ 07/14/2022

ConCL: Concept Contrastive Learning for Dense Prediction Pre-training in Pathology Images

Detectingandsegmentingobjectswithinwholeslideimagesis essential in compu...

0 Jiawei Yang, et al. ∙

research

∙ 07/11/2022

DelightfulTTS 2: End-to-End Speech Synthesis with Adversarial Vector-Quantized Auto-Encoders

Current text to speech (TTS) systems usually leverage a cascaded acousti...

0 Yanqing Liu, et al. ∙

research

∙ 07/05/2022

ReMix: A General and Efficient Framework for Multiple Instance Learning based Whole Slide Image Classification

Whole slide image (WSI) classification often relies on deep weakly super...

0 Jiawei Yang, et al. ∙

research

∙ 06/25/2022

Self-supervised Context-aware Style Representation for Expressive Speech Synthesis

Expressive speech synthesis, like audiobook synthesis, is still challeng...

0 Yihan Wu, et al. ∙

research

∙ 05/30/2022

BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis

Binaural audio plays a significant role in constructing immersive augmen...

1 Yichong Leng, et al. ∙

research

∙ 05/09/2022

NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality

Text to speech (TTS) has made rapid progress in both academia and indust...

18 Xu Tan, et al. ∙

research

∙ 04/01/2022

AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios

Adaptive text to speech (TTS) can synthesize new voices in zero-shot sce...

4 Yihan Wu, et al. ∙

research

∙ 02/08/2022

InferGrad: Improving Diffusion Models for Vocoder by Considering Inference in Training

Denoising diffusion probabilistic models (diffusion models for short) re...

7 Zehua Chen, et al. ∙

research

∙ 01/20/2022

Cross-Lingual Text-to-Speech Using Multi-Task Learning and Speaker Classifier Joint Training

In cross-lingual speech synthesis, the speech in various languages can b...

0 J. Yang, et al. ∙

research

∙ 10/28/2021

LF-YOLO: A Lighter and Faster YOLO for Weld Defect Detection of X-ray Image

X-ray image plays an important role in manufacturing for quality assuran...

0 Moyun Liu, et al. ∙

research

∙ 10/19/2021

Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge

End-to-end TTS suffers from high data requirements as it is difficult fo...

0 Mutian He, et al. ∙

research

∙ 10/03/2021

Heterogeneous Dual-Core Overlay Processor for Light-Weight CNNs

Light-weight convolutional neural networks (CNNs) have small complexity ...

0 Tiandong Zhao, et al. ∙

research

∙ 08/30/2021

X2Teeth: 3D Teeth Reconstruction from a Single Panoramic Radiograph

3D teeth reconstruction from X-ray is important for dental diagnosis and...

9 Yuan Liang, et al. ∙

research

∙ 07/27/2021

Cross-speaker Style Transfer with Prosody Bottleneck in Neural Speech Synthesis

Cross-speaker style transfer is crucial to the applications of multi-sty...

0 Shifeng Pan, et al. ∙

research

∙ 07/21/2021

TumorCP: A Simple but Effective Object-Level Data Augmentation for Tumor Segmentation

Deep learning models are notoriously data-hungry. Thus, there is an urgi...

0 Jiawei Yang, et al. ∙

research

∙ 07/14/2021

Diff-Net: Image Feature Difference based High-Definition Map Change Detection

Up-to-date High-Definition (HD) maps are essential for self-driving cars...

7 Lei He, et al. ∙

research

∙ 06/08/2021

Speech BERT Embedding For Improving Prosody in Neural TTS

This paper presents a speech BERT model to extract embedded prosody info...

0 Liping Chen, et al. ∙

research

∙ 04/27/2021

On Addressing Practical Challenges for RNN-Transducer

In this paper, several works are proposed to address practical challenge...

0 Rui Zhao, et al. ∙

research

∙ 04/13/2021

NPE: An FPGA-based Overlay Processor for Natural Language Processing

In recent years, transformer-based models have shown state-of-the-art re...

0 Hamza Khan, et al. ∙

research

∙ 04/08/2021

Exploring Machine Speech Chain for Domain Adaptation and Few-Shot Speaker Adaptation

Machine Speech Chain, which integrates both end-to-end (E2E) automatic s...

0 Fengpeng Yue, et al. ∙

research

∙ 03/05/2021

Multilingual Byte2Speech Text-To-Speech Models Are Few-shot Spoken Language Learners

We present a multilingual end-to-end Text-To-Speech framework that maps ...

0 Mutian He, et al. ∙

research

∙ 02/02/2021

Atlas-aware ConvNetfor Accurate yet Robust Anatomical Segmentation

Convolutional networks (ConvNets) have achieved promising accuracy for v...

3 Yuan Liang, et al. ∙

research

∙ 01/19/2021

SOSD-Net: Joint Semantic Object Segmentation and Depth Estimation from Monocular images

Depth estimation and semantic segmentation play essential roles in scene...

0 Lei He, et al. ∙

research

∙ 12/31/2020

OralViewer: 3D Demonstration of Dental Surgeries for Patient Education with Oral Cavity Reconstruction from a 2D Panoramic X-ray

Patient's understanding on forthcoming dental surgeries is required by p...

0 Yuan Liang, et al. ∙

research

∙ 12/23/2020

Exploring Instance-Level Uncertainty for Medical Detection

The ability of deep learning to predict with uncertainty is recognized a...

7 Jiawei Yang, et al. ∙

research

∙ 11/17/2020

s-Transformer: Segment-Transformer for Robust Neural Speech Synthesis

Neural end-to-end text-to-speech (TTS) , which adopts either a recurrent...

0 Xi Wang, et al. ∙

research

∙ 09/30/2020

Explainable Deep Reinforcement Learning for UAV Autonomous Navigation

Modern deep reinforcement learning plays an important role to solve a wi...

0 Lei He, et al. ∙

research

∙ 08/06/2020

Deep Reinforcement Learning based Local Planner for UAV Obstacle Avoidance using Demonstration Data

In this paper, a deep reinforcement learning (DRL) method is proposed to...

0 Lei He, et al. ∙

research

∙ 07/30/2020

Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability

Because of its streaming nature, recurrent neural network transducer (RN...

0 Jinyu Li, et al. ∙

research

∙ 06/13/2020

Accurate Anchor Free Tracking

Visual object tracking is an important application of computer vision. R...

12 Shengyun Peng, et al. ∙

research

∙ 05/21/2020

Conversational End-to-End TTS for Voice Agent

End-to-end neural TTS has achieved superior performance on reading style...

0 Haohan Guo, et al. ∙

research

∙ 03/18/2020

Oral-3D: Reconstructing the 3D Bone Structure of Oral Cavity from 2D Panoramic X-ray

Panoramic X-ray and Cone Beam Computed Tomography (CBCT) are two of the ...

3 Weinan Song, et al. ∙

research

∙ 02/19/2020

T-Net: A Template-Supervised Network for Task-specific Feature Extraction in Biomedical Image Analysis

Existing deep learning methods depend on an encoder-decoder structure to...

32 Weinan Song, et al. ∙

research

∙ 01/16/2020

OralCam: Enabling Self-Examination and Awareness of Oral Health Using a Smartphone Camera

Due to a lack of medical resources or oral health awareness, oral diseas...

0 Yuan Liang, et al. ∙

research

∙ 01/16/2020

OralCam: Enabling Self-Examination and Awareness ofOral Health Using a Smartphone Camera

Due to a lack of medical resources or oral health awareness, oral diseas...

0 Yuan Liang, et al. ∙

research

∙ 01/07/2020

Effective scaling of blockchain beyond consensus innovations and Moore's law

As an emerging technology, blockchain has achieved great success in nume...

0 Yinqiu Liu, et al. ∙

research

∙ 10/10/2019

CompareNet: Anatomical Segmentation Network with Deep Non-local Label Fusion

Label propagation is a popular technique for anatomical segmentation. In...

0 Yuan Liang, et al. ∙

research

∙ 10/04/2019

Order Acceptance and Scheduling with Sequence-dependent Setup Times: a New Memetic Algorithm and Benchmark of the State of the Art

The Order Acceptance and Scheduling (OAS) problem describes a class of r...

0 Lei He, et al. ∙

research

∙ 08/31/2019

EnGN: A High-Throughput and Energy-Efficient Accelerator for Large Graph Neural Networks

Inspired by the great success of convolutional neural networks on struct...

0 Lei He, et al. ∙

Lei He

Featured Co-authors

Sign in with Google

Consider DeepAI Pro