Kun Yao

research

∙ 08/14/2023

Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning

Due to the flexible representation of arbitrary-shaped scene text and si...

0 Xugong Qin, et al. ∙

research

∙ 07/24/2023

MataDoc: Margin and Text Aware Document Dewarping for Arbitrary Boundary

Document dewarping from a distorted camera-captured image is of great va...

0 Beiya Dai, et al. ∙

research

∙ 06/29/2023

Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation

One of the mainstream schemes for 2D human pose estimation (HPE) is lear...

0 Zhongwei Qiu, et al. ∙

research

∙ 06/05/2023

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images

Structured text extraction is one of the most valuable and challenging a...

0 Wenwen Yu, et al. ∙

research

∙ 05/19/2023

Fast-StrucTexT: An Efficient Hourglass Transformer with Modality-guided Dynamic Token Merge for Document Understanding

Transformers achieve promising performance in document understanding bec...

0 Mingliang Zhai, et al. ∙

research

∙ 03/01/2023

StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training

In this paper, we present StrucTexTv2, an effective document image pre-t...

0 Yuechen Yu, et al. ∙

research

∙ 11/17/2022

CAE v2: Context Autoencoder with CLIP Target

Masked image modeling (MIM) learns visual representation by masking and ...

0 Xinyu Zhang, et al. ∙

research

∙ 11/07/2022

Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining

We present a strong object detector with encoder-decoder pretraining and...

0 Qiang Chen, et al. ∙

research

∙ 08/31/2022

TRUST: An Accurate and End-to-End Table structure Recognizer Using Splitting-based Transformers

Table structure recognition is a crucial part of document image analysis...

0 Zengyuan Guo, et al. ∙

research

∙ 07/15/2022

Decoupling Recognition from Detection: Single Shot Self-Reliant Scene Text Spotter

Typical text spotters follow the two-stage spotting strategy: detect the...

0 Jingjing Wu, et al. ∙

research

∙ 06/01/2022

MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining

In this paper, we present a model pretraining technique, named MaskOCR, ...

0 Pengyuan Lyu, et al. ∙

research

∙ 03/31/2022

ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval

Visual appearance is considered to be the most important cue to understa...

0 Mengjun Cheng, et al. ∙

research

∙ 08/06/2021

StrucTexT: Structured Text Understanding with Multi-Modal Transformers

Structured text understanding on Visually Rich Documents (VRDs) is a cru...

0 Yulin Li, et al. ∙

research

∙ 10/31/2018

Compressing physical properties of atomic species for improving predictive chemistry

The answers to many unsolved problems lie in the intractable chemical sp...

0 John E. Herr, et al. ∙

research

∙ 12/19/2017

Metadynamics for Training Neural Network Model Chemistries: a Competitive Assessment

Neural network (NN) model chemistries (MCs) promise to facilitate the ac...

0 John E. Herr, et al. ∙

research

∙ 09/22/2016

The Many-Body Expansion Combined with Neural Networks

Fragmentation methods such as the many-body expansion (MBE) are a common...

0 Kun Yao, et al. ∙

Kun Yao

Featured Co-authors

Sign in with Google

Consider DeepAI Pro