The local and global features are both essential for automatic speech
re...
Attention-based encoder-decoder (AED) models have shown impressive
perfo...
With the prosperity of the video surveillance, multiple visual sensors h...
Detecting pedestrians accurately in urban scenes is significant for real...
Arbitrary shape text detection is a challenging task due to the signific...
With the prosperity of e-commerce industry, various modalities, e.g., vi...
Arbitrary shape text detection is a challenging task due to its complexi...
In object detection, non-maximum suppression (NMS) methods are extensive...
The open-set text recognition task is an emerging challenge that require...
Segmentation-based methods have achieved great success for arbitrary sha...
Scene text recognition is a popular topic and can benefit various tasks....
Learning a common latent embedding by aligning the latent spaces of
cros...
Non-autoregressive (NAR) transformer models have been studied intensivel...
Arbitrary shape text detection is a challenging task due to the high
com...
License plate detection is the first and essential step of the license p...
Text-based visual question answering (VQA) requires to read and understa...
State-of-the-art pedestrian detectors have achieved significant progress...
Arbitrary shape text detection is a challenging task due to the high var...
Recognizing text in the wild is a really challenging task because of com...
Classifier ensemble generally should combine diverse component classifie...
Text detection in natural scene images is an important prerequisite for ...