Due to the flexible representation of arbitrary-shaped scene text and si...
Document dewarping from a distorted camera-captured image is of great va...
One of the mainstream schemes for 2D human pose estimation (HPE) is lear...
Structured text extraction is one of the most valuable and challenging
a...
Transformers achieve promising performance in document understanding bec...
In this paper, we present StrucTexTv2, an effective document image
pre-t...
Masked image modeling (MIM) learns visual representation by masking and
...
We present a strong object detector with encoder-decoder pretraining and...
Table structure recognition is a crucial part of document image analysis...
Typical text spotters follow the two-stage spotting strategy: detect the...
In this paper, we present a model pretraining technique, named MaskOCR, ...
Visual appearance is considered to be the most important cue to understa...
Structured text understanding on Visually Rich Documents (VRDs) is a cru...
The answers to many unsolved problems lie in the intractable chemical sp...
Neural network (NN) model chemistries (MCs) promise to facilitate the
ac...
Fragmentation methods such as the many-body expansion (MBE) are a common...