Vision-language models have recently shown great potential on many compu...
The contemporary visual captioning models frequently hallucinate objects...
In this paper, we propose a novel graph learning framework for phrase
gr...
Traditional neuron models use analog values for information representati...
Spiking neural networks (SNNs) are considered as a potential candidate t...
Spikes are the currency in central nervous systems for information
trans...
The capability for environmental sound recognition (ESR) can determine t...