Pengfei Hu

research

∙ 09/09/2023

Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video

Synthesizing realistic videos according to a given speech is still an op...

0 Xiuzhe Wu, et al. ∙

research

∙ 07/30/2023

Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction

Recently, handwritten Chinese character error correction has been greatl...

0 Pengfei Hu, et al. ∙

research

∙ 03/24/2023

HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures

The problem of document structure reconstruction refers to converting di...

0 Jiefeng Ma, et al. ∙

research

∙ 03/08/2023

SEMv2: Table Separation Line Detection Based on Conditional Convolution

Table structure recognition is an indispensable element for enabling mac...

0 Zhenrong Zhang, et al. ∙

research

∙ 12/06/2022

Multimodal Tree Decoder for Table of Contents Extraction in Document Images

Table of contents (ToC) extraction aims to extract headings of different...

0 Pengfei Hu, et al. ∙

research

∙ 12/02/2022

AccEar: Accelerometer Acoustic Eavesdropping with Unconstrained Vocabulary

With the increasing popularity of voice-based applications, acoustic eav...

0 Pengfei Hu, et al. ∙

research

∙ 09/09/2022

Learning Audio-Visual embedding for Person Verification in the Wild

It has already been observed that audio-visual embedding is more robust ...

9 Peiwen Sun, et al. ∙

research

∙ 09/06/2022

High Speed Rotation Estimation with Dynamic Vision Sensors

Rotational speed is one of the important metrics to be measured for cali...

0 Guangrong Zhao, et al. ∙

research

∙ 04/13/2022

Defensive Patches for Robust Recognition in the Physical World

To operate in real-world high-stakes environments, deep learning systems...

0 Jiakai Wang, et al. ∙

research

∙ 04/07/2022

Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition

General accent recognition (AR) models tend to directly extract low-leve...

0 Qijie Shao, et al. ∙

research

∙ 04/02/2022

Leveraging Phone Mask Training for Phonetic-Reduction-Robust E2E Uyghur Speech Recognition

In Uyghur speech, consonant and vowel reduction are often encountered, e...

1 Guodong Ma, et al. ∙

research

∙ 02/05/2022

Iota: A Framework for Analyzing System-Level Security of IoTs

Most IoT systems involve IoT devices, communication protocols, remote cl...

0 Zheng Fang, et al. ∙

research

∙ 12/13/2021

PM-MMUT: Boosted Phone-mask Data Augmentation using Multi-modeing Unit Training for Robust Uyghur E2E Speech Recognition

Consonant and vowel reduction are often encountered in Uyghur speech, wh...

1 Guodong Ma, et al. ∙

research

∙ 10/18/2021

VRM-Phase I VKW system description of long-short video customizable keyword wakeup challenge

Keyword wakeup technology has always been a research hotspot in speech p...

0 Yougen Yuan, et al. ∙

research

∙ 09/16/2021

Membership Inference Attacks Against Recommender Systems

Recently, recommender systems have achieved promising performances and b...

18 Minxing Zhang, et al. ∙

research

∙ 06/29/2020

IoTGaze: IoT Security Enforcement via Wireless Context Analysis

Internet of Things (IoT) has become the most promising technology for se...

0 Tianbo Gu, et al. ∙

Pengfei Hu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro