Yu Sun

research

∙ 09/17/2023

From Cooking Recipes to Robot Task Trees – Improving Planning Correctness and Task Efficiency by Leveraging LLMs with a Knowledge Network

Task planning for robotic cooking involves generating a sequence of acti...

0 Md Sadman Sakib, et al. ∙

research

∙ 07/18/2023

ActionPrompt: Action-Guided 3D Human Pose Estimation With Text and Pose Prompting

Recent 2D-to-3D human pose estimation (HPE) utilizes temporal consistenc...

0 Hongwei Zheng, et al. ∙

research

∙ 07/05/2023

Only Pick Once – Multi-Object Picking Algorithms for Picking Exact Number of Objects Efficiently

Picking up multiple objects at once is a grasping skill that makes a hum...

0 Zihe Ye, et al. ∙

research

∙ 06/05/2023

TRACE: 5D Temporal Regression of Avatars with Dynamic Cameras in 3D Environments

Although the estimation of 3D human pose and shape (HPS) is rapidly prog...

0 Yu Sun, et al. ∙

research

∙ 05/31/2023

Spectral Heterogeneous Graph Convolutions via Positive Noncommutative Polynomials

Heterogeneous Graph Neural Networks (HGNNs) have gained significant popu...

0 Mingguo He, et al. ∙

research

∙ 05/29/2023

Test-Time Training on Nearest Neighbors for Large Language Models

Many recent efforts aim to augment language models with relevant informa...

0 Moritz Hardt, et al. ∙

research

∙ 04/25/2023

Learning Task-Specific Strategies for Accelerated MRI

Compressed sensing magnetic resonance imaging (CS-MRI) seeks to recover ...

0 Zihui Wu, et al. ∙

research

∙ 02/21/2023

Label Information Enhanced Fraud Detection against Low Homophily in Graphs

Node classification is a substantial problem in graph-based fraud detect...

0 Yuchen Wang, et al. ∙

research

∙ 02/15/2023

Pose-Oriented Transformer with Uncertainty-Guided Refinement for 2D-to-3D Human Pose Estimation

There has been a recent surge of interest in introducing transformers to...

0 Han Li, et al. ∙

research

∙ 02/09/2023

ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models

In recent years, there has been an increased popularity in image and spe...

0 Pengfei Zhu, et al. ∙

research

∙ 01/09/2023

ERNIE 3.0 Tiny: Frustratingly Simple Method to Improve Task-Agnostic Distillation Generalization

Task-agnostic knowledge distillation attempts to address the problem of ...

0 Weixin Liu, et al. ∙

research

∙ 12/13/2022

ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages

Software engineers working with the same programming language (PL) may s...

0 Yekun Chai, et al. ∙

research

∙ 11/30/2022

X-PuDu at SemEval-2022 Task 6: Multilingual Learning for English and Arabic Sarcasm Detection

Detecting sarcasm and verbal irony from people's subjective statements i...

0 Yaqian Han, et al. ∙

research

∙ 11/27/2022

X-PuDu at SemEval-2022 Task 7: A Replaced Token Detection Task Pre-trained Model with Pattern-aware Ensembling for Identifying Plausible Clarifications

This paper describes our winning system on SemEval 2022 Task 7: Identify...

0 Junyuan Shang, et al. ∙

research

∙ 11/09/2022

ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation

Recent cross-lingual cross-modal works attempt to extend Vision-Language...

0 Bin Shan, et al. ∙

research

∙ 11/07/2022

ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech

Speech representation learning has improved both speech understanding an...

0 Xiaoran Fan, et al. ∙

research

∙ 10/31/2022

SDCL: Self-Distillation Contrastive Learning for Chinese Spell Checking

Due to the ambiguity of homophones, Chinese Spell Checking (CSC) has wid...

0 Xiaotian Zhang, et al. ∙

research

∙ 10/27/2022

ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts

Recent progress in diffusion models has revolutionized the popular techn...

0 Zhida Feng, et al. ∙

research

∙ 10/21/2022

Clip-Tuning: Towards Derivative-free Prompt Learning with a Mixture of Rewards

Derivative-free prompt learning has emerged as a lightweight alternative...

3 Yekun Chai, et al. ∙

research

∙ 10/18/2022

CPS-MEBR: Click Feedback-Aware Web Page Summarization for Multi-Embedding-Based Retrieval

Embedding-based retrieval (EBR) is a technique to use embeddings to repr...

0 Wenbiao Li, et al. ∙

research

∙ 10/12/2022

ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding

Recent years have witnessed the rise and success of pre-training techniq...

12 Qiming Peng, et al. ∙

research

∙ 09/30/2022

ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training

Recent Vision-Language Pre-trained (VLP) models based on dual encoder ha...

7 Bin Shan, et al. ∙

research

∙ 09/18/2022

ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding

Recent efforts of multimodal Transformers have improved Visually Rich Do...

22 Wenjin Wang, et al. ∙

research

∙ 09/15/2022

Test-Time Training with Masked Autoencoders

Test-time training adapts to a new test distribution on the fly by optim...

4 Yossi Gandelsman, et al. ∙

research

∙ 09/02/2022

WOC: A Handy Webcam-based 3D Online Chatroom

We develop WOC, a webcam-based 3D virtual online chatroom for multi-pers...

4 Chuanhang Yan, et al. ∙

research

∙ 08/09/2022

An Embarrassingly Easy but Strong Baseline for Nested Named Entity Recognition

Named entity recognition (NER) is the task to detect and classify the en...

3 Hang Yan, et al. ∙

research

∙ 07/08/2022

Approximate Task Tree Retrieval in a Knowledge Network for Robotic Cooking

Flexible task planning continues to pose a difficult challenge for robot...

0 Md Sadman Sakib, et al. ∙

research

∙ 05/30/2022

Multi-Object Grasping – Types and Taxonomy

This paper proposes 12 multi-object grasps (MOGs) types from a human and...

0 Yu Sun, et al. ∙

research

∙ 05/29/2022

A General Multiple Data Augmentation Based Framework for Training Deep Neural Networks

Deep neural networks (DNNs) often rely on massive labelled data for trai...

20 Binyan Hu, et al. ∙

research

∙ 05/19/2022

Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters

The ever-growing model size and scale of compute have attracted increasi...

8 Yang Xiang, et al. ∙

research

∙ 05/13/2022

Simple and Effective Relation-based Embedding Propagation for Knowledge Representation Learning

Relational graph neural networks have garnered particular attention to e...

5 Huijuan Wang, et al. ∙

research

∙ 04/01/2022

Robust Neonatal Face Detection in Real-world Clinical Settings

Current face detection algorithms are extremely generalized and can obta...

24 Jacqueline Hausmann, et al. ∙

research

∙ 02/28/2022

Learning Cross-Video Neural Representations for High-Quality Frame Interpolation

This paper considers the problem of temporal video interpolation, where ...

8 Wentao Shangguan, et al. ∙

research

∙ 01/19/2022

CyberRadar: A PUF-based Detecting and Mapping Framework for Physical Devices

The core issue of cyberspace detecting and mapping is to accurately iden...

0 Dawei Li, et al. ∙

research

∙ 01/04/2022

Graph Neural Networks for Double-Strand DNA Breaks Prediction

Double-strand DNA breaks (DSBs) are a form of DNA damage that can cause ...

16 Xu Wang, et al. ∙

research

∙ 12/31/2021

ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation

Conventional methods for the image-text generation tasks mainly tackle t...

6 Han Zhang, et al. ∙

research

∙ 12/23/2021

ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation

Pre-trained language models have achieved state-of-the-art results in va...

4 Shuohuan Wang, et al. ∙

research

∙ 12/18/2021

Calorie Aware Automatic Meal Kit Generation from an Image

Calorie and nutrition research has attained increased interest in recent...

9 Ahmad Babaeian Jelodar, et al. ∙

research

∙ 12/18/2021

Multi-Object Grasping – Generating Efficient Robotic Picking and Transferring Policy

Transferring multiple objects between bins is a common task for many app...

0 Adheesh Shenoy, et al. ∙

research

∙ 12/15/2021

Putting People in their Place: Monocular Regression of 3D People in Depth

Given an image with multiple people, our goal is to directly regress the...

17 Yu Sun, et al. ∙

research

∙ 12/05/2021

Improving Intention Detection in Single-Trial Classification through Fusion of EEG and Eye-tracker Data

Intention decoding is an indispensable procedure in hands-free human-com...

0 Xianliang Ge, et al. ∙

research

∙ 12/04/2021

Functional Task Tree Generation from a Knowledge Graph to Solve Unseen Problems

A major component for developing intelligent and autonomous robots is a ...

2 Md Sadman Sakib, et al. ∙

research

∙ 12/02/2021

Graph4Rec: A Universal Toolkit with Graph Neural Networks for Recommender Systems

In recent years, owing to the outstanding performance in graph represent...

6 Weibin Li, et al. ∙

research

∙ 11/30/2021

Multi-Object Grasping – Estimating the Number of Objects in a Robotic Grasp

A human hand can grasp a desired number of objects at once from a pile b...

5 Tianze Chen, et al. ∙

research

∙ 11/23/2021

Hierarchical Graph Networks for 3D Human Pose Estimation

Recent 2D-to-3D human pose estimation works tend to utilize the graph st...

4 Han Li, et al. ∙

research

∙ 08/05/2021

Pattern Recognition in Vital Signs Using Spectrograms

Spectrograms visualize the frequency components of a given signal which ...

5 Sidharth Srivatsav Sribhashyam, et al. ∙

research

∙ 08/03/2021

Research Challenges and Progress in Robotic Grasping and Manipulation Competitions

This paper discusses recent research progress in robotic grasping and ma...

2 Yu Sun, et al. ∙

research

∙ 07/05/2021

ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation

Pre-trained models have achieved state-of-the-art results in various Nat...

12 Yu Sun, et al. ∙

research

∙ 06/04/2021

ERNIE-Tiny : A Progressive Distillation Framework for Pretrained Transformer Compression

Pretrained language models (PLMs) such as BERT adopt a training paradigm...

8 Weiyue Su, et al. ∙

research

∙ 06/01/2021

Evaluating Recipes Generated from Functional Object-Oriented Network

The functional object-oriented network (FOON) has been introduced as a k...

13 Md Sadman Sakib, et al. ∙

Yu Sun

Featured Co-authors

Sign in with Google

Consider DeepAI Pro