Jia Pan

research

∙ 09/15/2023

The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction

Previous Multimodal Information based Speech Processing (MISP) challenge...

0 Shilong Wu, et al. ∙

research

∙ 08/28/2023

The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge

This technical report details our submission system to the CHiME-7 DASR ...

0 Ruoyu Wang, et al. ∙

research

∙ 07/12/2023

GRAINS: Proximity Sensing of Objects in Granular Materials

Proximity sensing detects an object's presence without contact. However,...

0 Zeqing Zhang, et al. ∙

research

∙ 07/03/2023

SAM-DA: UAV Tracks Anything at Night with SAM-Powered Domain Adaptation

Domain adaptation (DA) has demonstrated significant promise for real-tim...

0 Liangliang Yao, et al. ∙

research

∙ 06/27/2023

Reducing the gap between streaming and non-streaming Transducer-based ASR by adaptive two-stage knowledge distillation

Transducer is one of the mainstream frameworks for streaming speech reco...

0 Haitao Tang, et al. ∙

research

∙ 06/10/2023

Long-term Microscopic Traffic Simulation with History-Masked Multi-agent Imitation Learning

A realistic long-term microscopic traffic simulator is necessary for und...

0 Ke Guo, et al. ∙

research

∙ 06/01/2023

A Transformer-based representation-learning model with unified processing of multimodal input for clinical diagnostics

During the diagnostic process, clinicians leverage multimodal informatio...

0 Hong-Yu Zhou, et al. ∙

research

∙ 05/10/2023

Fast Event-based Double Integral for Real-time Robotics

Motion deblurring is a critical ill-posed problem that is important in m...

0 Shijie Lin, et al. ∙

research

∙ 05/06/2023

Target-free Extrinsic Calibration of Event-LiDAR Dyad using Edge Correspondences

Calibrating the extrinsic parameters of sensory devices is crucial for f...

0 Wanli Xing, et al. ∙

research

∙ 05/04/2023

CCIL: Context-conditioned imitation learning for urban driving

Imitation learning holds great promise for addressing the complex task o...

0 Ke Guo, et al. ∙

research

∙ 04/27/2023

SMAT: A Self-Reinforcing Framework for Simultaneous Mapping and Tracking in Unbounded Urban Environments

With the increasing prevalence of robots in daily life, it is crucial to...

0 Tingxiang Fan, et al. ∙

research

∙ 04/03/2023

TacGNN:Learning Tactile-based In-hand Manipulation with a Blind Robot

In this paper, we propose a novel framework for tactile-based dexterous ...

0 Linhan Yang, et al. ∙

research

∙ 03/11/2023

The Multimodal Information based Speech Processing (MISP) 2022 Challenge: Audio-Visual Diarization and Recognition

The Multi-modal Information based Speech Processing (MISP) challenge aim...

0 Zhe Wang, et al. ∙

research

∙ 03/05/2023

Tight Collision Probability for UAV Motion Planning in Uncertain Environment

Operating unmanned aerial vehicles (UAVs) in complex environments that f...

0 Tianyu Liu, et al. ∙

research

∙ 03/01/2023

Polymer-Based Self-Calibrated Optical Fiber Tactile Sensor

Human skin can accurately sense the self-decoupled normal and shear forc...

0 Wentao Chen, et al. ∙

research

∙ 02/17/2023

Hybrid Traffic Control and Coordination from Pixels

Traffic congestion is a persistent problem in our society. Existing meth...

0 Michael Villarreal, et al. ∙

research

∙ 01/12/2023

Learning to Control and Coordinate Hybrid Traffic Through Robot Vehicles at Complex and Unsignalized Intersections

Intersections are essential road infrastructures for traffic in modern m...

0 Dawei Wang, et al. ∙

research

∙ 12/07/2022

Improved Speech Pre-Training with Supervision-Enhanced Acoustic Unit

Speech pre-training has shown great success in learning useful and gener...

0 Pengcheng Li, et al. ∙

research

∙ 12/07/2022

Progressive Multi-Scale Self-Supervised Learning for Speech Recognition

Self-supervised learning (SSL) models have achieved considerable improve...

0 Genshun Wan, et al. ∙

research

∙ 12/07/2022

Improved Self-Supervised Multilingual Speech Representation Learning Combined with Auxiliary Language Information

Multilingual end-to-end models have shown great improvement over monolin...

0 Fenglin Ding, et al. ∙

research

∙ 12/06/2022

Self-Supervised Audio-Visual Speech Representations Learning By Multimodal Self-Distillation

In this work, we present a novel method, named AV2vec, for learning audi...

0 Jing-Xuan Zhang, et al. ∙

research

∙ 11/26/2022

Siamese Object Tracking for Vision-Based UAM Approaching with Pairwise Scale-Channel Attention

Although the manipulating of the unmanned aerial manipulator (UAM) has b...

0 Guangze Zheng, et al. ∙

research

∙ 11/15/2022

Monocular BEV Perception of Road Scenes via Front-to-Top View Projection

HD map reconstruction is crucial for autonomous driving. LiDAR-based met...

0 Wenxi Liu, et al. ∙

research

∙ 10/01/2022

RDA: An Accelerated Collision-free Motion Planner for Autonomous Navigation in Cluttered Environments

Motion planning is challenging for autonomous systems in multi-obstacle ...

0 Ruihua Han, et al. ∙

research

∙ 09/20/2022

Hierarchical Temporal Transformer for 3D Hand Pose Estimation and Action Recognition from Egocentric RGB Videos

Understanding dynamic hand motions and actions from egocentric RGB video...

0 Yilin Wen, et al. ∙

research

∙ 09/01/2022

Monocular Camera-based Complex Obstacle Avoidance via Efficient Deep Reinforcement Learning

Deep reinforcement learning has achieved great success in laser-based co...

0 Jianchuan Ding, et al. ∙

research

∙ 06/30/2022

DynamicFilter: an Online Dynamic Objects Removal Framework for Highly Dynamic Environments

Emergence of massive dynamic objects will diversify spatial structures w...

0 Tingxiang Fan, et al. ∙

research

∙ 06/27/2022

A Generalized Continuous Collision Detection Framework of Polynomial Trajectory for Mobile Robots in Cluttered Environments

In this paper, we introduce a generalized continuous collision detection...

0 Zeqing Zhang, et al. ∙

research

∙ 06/25/2022

Learn to Predict How Humans Manipulate Large-sized Objects from Interactive Motions

Understanding human intentions during interactions has been a long-lasti...

0 Weilin Wan, et al. ∙

research

∙ 06/24/2022

ModLaNets: Learning Generalisable Dynamics via Modularity and Physical Inductive Bias

Deep learning models are able to approximate one specific dynamical syst...

0 Yupu Lu, et al. ∙

research

∙ 05/28/2022

Is Lip Region-of-Interest Sufficient for Lipreading?

Lip region-of-interest (ROI) is conventionally used for visual input in ...

0 Jing-Xuan Zhang, et al. ∙

research

∙ 03/31/2022

End-to-End Trajectory Distribution Prediction Based on Occupancy Grid Maps

In this paper, we aim to forecast a future trajectory distribution of a ...

6 Ke Guo, et al. ∙

research

∙ 03/30/2022

High-resolution Face Swapping via Latent Semantics Disentanglement

We present a novel high-resolution face swapping method using the inhere...

0 Yangyang Xu, et al. ∙

research

∙ 03/23/2022

Autofocus for Event Cameras

Focus control (FC) is crucial for cameras to capture sharp images in cha...

0 Shijie Lin, et al. ∙

research

∙ 03/19/2022

Reinforcement Learned Distributed Multi-Robot Navigation with Reciprocal Velocity Obstacle Shaped Rewards

The challenges to solving the collision avoidance problem lie in adaptiv...

0 Ruihua Han, et al. ∙

research

∙ 02/23/2022

Visual-tactile sensing for Real-time liquid Volume Estimation in Grasping

We propose a deep visuo-tactile model for realtime estimation of the liq...

0 Fan Zhu, et al. ∙

research

∙ 12/31/2021

An Intelligent Self-driving Truck System For Highway Transportation

Recently, there have been many advances in autonomous driving society, a...

10 Dawei Wang, et al. ∙

research

∙ 10/24/2021

DiffSRL: Learning Dynamic-aware State Representation for Deformable Object Control with Differentiable Simulator

Dynamic state representation learning is an important task in robot lear...

0 Sirui Chen, et al. ∙

research

∙ 10/16/2021

Towards Making the Most of Multilingual Pretraining for Zero-Shot Neural Machine Translation

This paper demonstrates that multilingual pretraining, a proper fine-tun...

0 Guanhua Chen, et al. ∙

research

∙ 09/12/2021

Learning Selective Communication for Multi-Agent Path Finding

Learning communication via deep reinforcement learning (RL) or imitation...

0 Ziyuan Ma, et al. ∙

research

∙ 04/25/2021

An Interval Branch-and-Bound-Based Inverse Kinemetics Algorithm Towards Global Optimal Redundancy Resolution

The general inverse kinematics (IK) problem of a manipulator, namely tha...

0 Yajue Yang, et al. ∙

research

∙ 04/18/2021

Zero-shot Cross-lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders

Previous works mainly focus on improving cross-lingual transfer for NLU ...

0 Guanhua Chen, et al. ∙

research

∙ 03/19/2021

USTC-NELSLIP System Description for DIHARD-III Challenge

This system description describes our submission system to the Third DIH...

5 Yuxuan Wang, et al. ∙

research

∙ 01/29/2021

Learning-based Optoelectronically Innervated Tactile Finger for Rigid-Soft Interactive Grasping

This paper presents a novel design of a soft tactile finger with omni-di...

0 Linhan Yang, et al. ∙

research

∙ 01/08/2021

A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection

In this paper, we propose a novel four-stage data augmentation approach ...

0 Qing Wang, et al. ∙

research

∙ 12/18/2020

Crowd-Driven Mapping, Localization and Planning

Navigation in dense crowds is a well-known open problem in robotics with...

0 Tingxiang Fan, et al. ∙

research

∙ 12/06/2020

Design of an Optoelectronically Innervated Gripper for Rigid-Soft Interactive Grasping

Over the past few decades, efforts have been made towards robust robotic...

0 Linhan Yang, et al. ∙

research

∙ 10/27/2020

Optimization-Based Framework for Excavation Trajectory Generation

In this paper, we present a novel optimization-based framework for auton...

0 Yajue Yang, et al. ∙

research

∙ 08/20/2020

Autonomous Social Distancing in Urban Environments using a Quadruped Robot

COVID-19 pandemic has become a global challenge faced by people all over...

0 Tingxiang Fan, et al. ∙

research

∙ 06/09/2020

Over-crowdedness Alert! Forecasting the Future Crowd Distribution

In recent years, vision-based crowd analysis has been studied extensivel...

0 Yuzhen Niu, et al. ∙

Jia Pan

Featured Co-authors

Sign in with Google

Consider DeepAI Pro