This paper presents a novel approach to address the Entity Recognition a...
This article investigates the weak approximation towards the invariant
m...
Referring Expression Generation (REG) aims to generate unambiguous Refer...
By combining a logarithm transformation with a corrected Milstein-type
m...
The paper establishes the strong convergence rates of a spatio-temporal ...
In this paper, we revisit the backward Euler method for numerical
approx...
Existing multimodal task-oriented dialog data fails to demonstrate the
d...
In the field of computational finance, it is common for the quantity of
...
Reinforcement learning has been applied to train the dialog systems in m...
Existing multimodal conversation agents have shown impressive abilities ...
Reference Expression Generation (REG) and Comprehension (REC) are two hi...
We study a class of fully-discrete schemes for the numerical approximati...
The first aim of this paper is to examine existence, uniqueness and
regu...
In this paper, we propose and analyze an explicit time-stepping scheme f...
Existing Visual Question Answering (VQA) models have explored various vi...
Most existing approaches to Visual Question Answering (VQA) answer quest...
A slot value might be provided segment by segment over multiple-turn
int...
Visual dialog has witnessed great progress after introducing various
vis...
As opposed to an overwhelming number of works on strong approximations, ...
Keyphrase provides accurate information of document content that is high...
Unlike well-structured text, such as news reports and encyclopedia artic...
Considering the importance of building a good Visual Dialog (VD) Questio...
Multimodal pre-training models, such as LXMERT, have achieved excellent
...
To encourage AI agents to conduct meaningful Visual Dialogue (VD), the u...
Massive MIMO uses a large number of antennas to increase the spectral
ef...
We propose a novel task, Multi-Document Driven Dialogue (MD3), in which ...
Emotion Recognition in Conversations (ERC) is essential for building
emp...
A goal-oriented visual dialogue involves multi-turn interactions between...
A major challenge of multi-label text classification (MLTC) is to
stimul...
A novel class of implicit Milstein type methods is devised and analyzed ...
Knowledge graph (KG) entity typing aims at inferring possible missing en...
The Guesser plays an important role in GuessWhat?! like visual dialogues...
This article aims to reveal the mean-square convergence rate of the back...
We discretize the stochastic Allen-Cahn equation with additive noise by ...
GuessWhat?! is a visual dialogue task between a guesser and an oracle. T...
This paper proposes a deep neural network model for joint modeling Natur...
Wireless channels generally exhibit dispersion in both time and frequenc...
We establish a multi-user extrinsic information transfer (EXIT) chart ar...
This paper presents a strong baseline for real-world visual reasoning (G...
We consider spatially coupled low-density parity-check (SC-LDPC) codes w...
We consider the design of low-density parity-check (LDPC) codes with
clo...
Leveraging both visual frames and audio has been experimentally proven
e...
Canonical Massive MIMO uses time division duplex (TDD) to exploit channe...
To effectively retrieve objects from large corpus with high accuracy is ...
This paper describes our solution to the multi-modal learning challenge ...
The ICML 2013 Workshop on Challenges in Representation Learning focused ...