Recently, linear computed tomography (LCT) systems have actively attract...
Supervised visual captioning models typically require a large scale of i...
Generating an informative and attractive title for the product is a cruc...
Micro-computed tomography (micro-CT) is a widely used state-of-the-art
i...
Natural Language Generation (NLG) accepts input data in the form of imag...
For medical image segmentation, contrastive learning is the dominant pra...
Training supervised video captioning model requires coupled video-captio...
Most video-and-language representation learning approaches employ contra...
Vision-and-language (V-L) tasks require the system to understand both vi...
The "Patient Instruction" (PI), which contains critical instructional
in...
Recently, attention based models have been used extensively in many
sequ...
Recent studies on contrastive learning have achieved remarkable performa...
Medical report generation task, which targets to produce long and cohere...
Gene Ontology (GO) is the primary gene function knowledge base that enab...
Transformers have made remarkable progress towards modeling long-range
d...
Medical report generation, which aims to automatically generate a long a...
Video captioning combines video understanding and language generation.
D...
While Machine Comprehension (MC) has attracted extensive research intere...
Recently, image captioning has aroused great interest in both academic a...
Recently, chest X-ray report generation, which aims to automatically gen...
Automatically generating radiology reports can improve current clinical
...
Skip connection, is a widely-used technique to improve the performance a...
Recently, the attention-enhanced multi-layer encoder, such as Transforme...
In spoken question answering, QA systems are designed to answer question...
Spoken Language Understanding (SLU) is an essential part of the spoken
d...
In sequence-to-sequence learning, the attention mechanism has been a gre...
Recently, attention-based encoder-decoder models have been used extensiv...
Existing state-of-the-art autoregressive video captioning methods (ARVC)...
In image-grounded text generation, fine-grained representations of the i...
The potential huge advantage of spectral computed tomography (CT) is its...
Spectral computed tomography (CT) has a great potential in material
iden...
The encode-decoder framework has shown recent success in image captionin...
Spectral computed tomography (CT) reconstructs material-dependent attenu...