The integration of Large Language Models (LLMs) into robotics has
revolu...
Generative pre-trained transformer (GPT) models have revolutionized the ...
Audio-guided Video Object Segmentation (A-VOS) and Referring Video Objec...
The integration of diverse visual prompts like clicks, scribbles, and bo...
Transformer-based methods have demonstrated superior performance for
mon...
Interactive Image Segmentation (IIS) has emerged as a promising techniqu...
A semantic map of the road scene, covering fundamental road elements, is...