The Traveling Salesman Problem (TSP) is a well-known problem in combinat...
The continuous thriving of the Blockchain society motivates research in ...
Contemporary news reporting increasingly features multimedia content,
mo...
Pointly Supervised Object Detection (PSOD) has attracted considerable
in...
General-purpose language models that can solve various language-domain t...
Investments in movie production are associated with a high level of risk...
Single-frame infrared small target (SIRST) detection aims at separating ...
Large language models (LLMs) have demonstrated excellent zero-shot
gener...
GPT-3 (Generative Pre-trained Transformer 3) is a large-scale autoregres...
Deep neural networks for video action recognition easily learn to utiliz...
In the evolution of agriculture to its next stage, Agriculture 5.0,
arti...
Visual question answering (VQA) is a hallmark of vision and language
rea...
Prompt tuning, or the conditioning of a frozen pretrained language model...
Space-based infrared tiny ship detection aims at separating tiny ships f...
Conversational recommendation system (CRS) is emerging as a user-friendl...
Self-supervised learning for computer vision has achieved tremendous pro...
Despite recent advances of AI, story understanding remains an open and
u...
In vision domain, large-scale natural datasets typically exhibit long-ta...
Noisy labels are commonly found in real-world data, which cause performa...
Proper initialization is crucial to the optimization and the generalizat...
Single-frame infrared small target (SIRST) detection aims at separating ...
Infrared small target detection plays an important role in many infrared...
The existence of multiple datasets for sarcasm detection prompts us to a...
The existence of noisy labels in real-world data negatively impacts the
...
The behaviors of deep neural networks (DNNs) are notoriously resistant t...
Understanding humor is critical to creative language modeling with many
...
The task of video and text sequence alignment is a prerequisite step tow...
The progress of deep learning (DL), especially the recent development of...
Scaling up the vocabulary and complexity of current visual understanding...
Search space is a key consideration for neural architecture search. Rece...
The success of deep neural networks is clouded by two issues that largel...
A Smart Home provides integrating and electronic information services to...
The problem of explaining deep learning models, and model predictions
ge...
In recent years, many efforts have demonstrated that modern machine lear...
Biomedical image segmentation based on Deep neuralnetwork (DNN) is a
pro...
An enormous amount of energy is wasted in Proofof-Work (PoW) mechanisms
...
Emotional content is a crucial ingredient in user-generated videos. Howe...
Understanding narrative content has become an increasingly popular topic...
Understanding of narrative content has become an increasingly popular to...
As a fine-grained video understanding task, dense video captioning invol...
The alignment of heterogeneous sequential data (video to text) is an
imp...
The inception of a mobile app often takes form of a mock-up of the Graph...
Stories are a vital form of communication in human culture; they are emp...
Psychological studies have shown that personality traits are associated ...
An important and difficult challenge in building computational models fo...
Emotional content is a key element in user-generated videos. However, it...