The convergence of text, visual, and audio data is a key step towards
hu...
In this paper, we consider an intelligent reflecting surface (IRS)-aided...
Human intelligence is multimodal; we integrate visual, linguistic, and
a...
Automated visual understanding of our diverse and open world demands com...
Photo composition is an important factor affecting the aesthetics in
pho...
Automatic photo cropping is an important tool for improving visual quali...