Constructing supervised machine learning models for real-world video ana...
Despite the remarkable performance of Vision Transformers (ViTs) in vari...
The expanding model size and computation of deep neural networks (DNNs) ...
The purpose of federated learning is to enable multiple clients to joint...
Human-Object Interaction (HOI) detection is an important problem to
unde...
Existing image-based activity understanding methods mainly adopt direct
...
Deep neural networks have achieved state-of-the-art performance on vario...
Human activity understanding is crucial for building automatic intellige...
Human-Object Interaction (HOI) Detection is an important problem to
unde...