Liqiang Nie

research

∙ 09/04/2023

Target-Guided Composed Image Retrieval

Composed image retrieval (CIR) is a new and flexible image retrieval par...

0 Haokun Wen, et al. ∙

research

∙ 08/17/2023

Building Emotional Support Chatbots in the Era of LLMs

The integration of emotional support into various conversational scenari...

0 Zhonghua Zheng, et al. ∙

research

∙ 08/14/2023

Temporal Sentence Grounding in Streaming Videos

This paper aims to tackle a novel task - Temporal Sentence Grounding in ...

0 Tian Gan, et al. ∙

research

∙ 08/09/2023

LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation

In the text-to-image generation field, recent remarkable progress in Sta...

0 Leigang Qu, et al. ∙

research

∙ 08/06/2023

Semantic-Guided Feature Distillation for Multimodal Recommendation

Multimodal recommendation exploits the rich multimodal information assoc...

0 Fan Liu, et al. ∙

research

∙ 08/06/2023

StyleEDL: Style-Guided High-order Attention Network for Image Emotion Distribution Learning

Emotion distribution learning has gained increasing attention with the t...

0 Peiguang Jing, et al. ∙

research

∙ 07/27/2023

Sample Less, Learn More: Efficient Action Recognition via Frame Feature Restoration

Training an effective video action recognition model poses significant c...

0 Harry Cheng, et al. ∙

research

∙ 07/24/2023

Towards Generalizable Deepfake Detection by Primary Region Regularization

The existing deepfake detection methods have reached a bottleneck in gen...

0 Harry Cheng, et al. ∙

research

∙ 07/20/2023

General Debiasing for Multimodal Sentiment Analysis

Existing work on Multimodal Sentiment Analysis (MSA) utilizes multimodal...

0 Teng Sun, et al. ∙

research

∙ 06/29/2023

Multi-source Semantic Graph-based Multimodal Sarcasm Explanation Generation

Multimodal Sarcasm Explanation (MuSE) is a new yet challenging task, whi...

0 Liqiang Jing, et al. ∙

research

∙ 06/01/2023

DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection

Existing deepfake detection methods fail to generalize well to unseen or...

0 Rui Shao, et al. ∙

research

∙ 05/17/2023

Dual Semantic Knowledge Composed Multimodal Dialog Systems

Textual response generation is an essential task for multimodal task-ori...

0 Xiaolin Chen, et al. ∙

research

∙ 05/17/2023

Self-Training Boosted Multi-Faceted Matching Network for Composed Image Retrieval

The composed image retrieval (CIR) task aims to retrieve the desired tar...

0 Haokun Wen, et al. ∙

research

∙ 05/05/2023

Stylized Data-to-Text Generation: A Case Study in the E-Commerce Domain

Existing data-to-text generation efforts mainly focus on generating a co...

0 Liqiang Jing, et al. ∙

research

∙ 04/24/2023

ChatLLM Network: More brains, More intelligence

Dialogue-based language models mark a huge milestone in the field of art...

0 Rui Hao, et al. ∙

research

∙ 04/03/2023

Rethinking Context Aggregation in Natural Image Matting

For natural image matting, context information plays a crucial role in e...

0 Qinglin Liu, et al. ∙

research

∙ 03/15/2023

Micro-video Tagging via Jointly Modeling Social Influence and Tag Relation

The last decade has witnessed the proliferation of micro-videos on vario...

3 Xiao Wang, et al. ∙

research

∙ 03/14/2023

Efficient Image-Text Retrieval via Keyword-Guided Pre-Screening

Under the flourishing development in performance, current image-text ret...

0 Min Cao, et al. ∙

research

∙ 02/04/2023

Learning to Agree on Vision Attention for Visual Commonsense Reasoning

Visual Commonsense Reasoning (VCR) remains a significant yet challenging...

0 Zhenyang Li, et al. ∙

research

∙ 12/22/2022

Multi-queue Momentum Contrast for Microvideo-Product Retrieval

The booming development and huge market of micro-videos bring new e-comm...

0 Yali Du, et al. ∙

research

∙ 12/20/2022

Causal Inference for Knowledge Graph based Recommendation

Knowledge Graph (KG), as a side-information, tends to be utilized to sup...

0 Yinwei Wei, et al. ∙

research

∙ 12/12/2022

Multimodal Matching-aware Co-attention Networks with Mutual Knowledge Distillation for Fake News Detection

Fake news often involves multimedia information such as text and image t...

0 Linmei Hu, et al. ∙

research

∙ 11/11/2022

A Survey of Knowledge-Enhanced Pre-trained Language Models

Pre-trained Language Models (PLMs) which are trained on large text corpu...

0 Linmei Hu, et al. ∙

research

∙ 09/27/2022

Privacy-Preserving Synthetic Data Generation for Recommendation Systems

Recommendation systems make predictions chiefly based on users' historic...

0 Fan Liu, et al. ∙

research

∙ 09/12/2022

Deep Convolutional Pooling Transformer for Deepfake Detection

Recently, Deepfake has drawn considerable public attention due to securi...

0 Tianyi Wang, et al. ∙

research

∙ 07/24/2022

Visual Perturbation-aware Collaborative Learning for Overcoming the Language Prior Problem

Several studies have recently pointed that existing Visual Question Answ...

19 Yudong Han, et al. ∙

research

∙ 07/24/2022

Counterfactual Reasoning for Out-of-distribution Multimodal Sentiment Analysis

Existing studies on multimodal sentiment analysis heavily rely on textua...

0 Teng Sun, et al. ∙

research

∙ 07/21/2022

Semantic-aware Modular Capsule Routing for Visual Question Answering

Visual Question Answering (VQA) is fundamentally compositional in nature...

0 Yudong Han, et al. ∙

research

∙ 07/16/2022

Multimodal Dialog Systems with Dual Knowledge-enhanced Generative Pretrained Language Model

Text response generation for multimodal task-oriented dialog systems, wh...

0 Xiaolin Chen, et al. ∙

research

∙ 07/13/2022

Lipschitz Continuity Retained Binary Neural Network

Relying on the premise that the performance of a binary neural network c...

11 Yuzhang Shang, et al. ∙

research

∙ 06/30/2022

A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQA

Knowledge-based Visual Question Answering (VQA) expects models to rely o...

0 Yangyang Guo, et al. ∙

research

∙ 04/29/2022

User-controllable Recommendation Against Filter Bubbles

Recommender systems usually face the issue of filter bubbles: overrecomm...

0 Wenjie Wang, et al. ∙

research

∙ 03/28/2022

Image-text Retrieval: A Survey on Recent Research and Development

In the past few years, cross-modal image-text retrieval (ITR) has experi...

0 Min Cao, et al. ∙

research

∙ 03/18/2022

Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation

Scene Graph Generation, which generally follows a regular encoder-decode...

0 Xingning Dong, et al. ∙

research

∙ 03/10/2022

Disentangled Multimodal Representation Learning for Recommendation

Many multimodal recommender systems have been proposed to exploit the ri...

0 Fan Liu, et al. ∙

research

∙ 03/04/2022

Voice-Face Homogeneity Tells Deepfake

Detecting forgery videos is highly desirable due to the abuse of deepfak...

1 Harry Cheng, et al. ∙

research

∙ 03/01/2022

MERIt: Meta-Path Guided Contrastive Learning for Logical Reasoning

Logical reasoning is of vital importance to natural language understandi...

0 Fangkai Jiao, et al. ∙

research

∙ 02/25/2022

On Modality Bias Recognition and Reduction

Making each modality in multi-modal data contribute is of vital importan...

0 Yangyang Guo, et al. ∙

research

∙ 02/25/2022

Joint Answering and Explanation for Visual Commonsense Reasoning

Visual Commonsense Reasoning (VCR), deemed as one challenging extension ...

0 Zhenyang Li, et al. ∙

research

∙ 01/30/2022

Win the Lottery Ticket via Fourier Analysis: Frequencies Guided Network Pruning

With the remarkable success of deep learning recently, efficient network...

24 Yuzhang Shang, et al. ∙

research

∙ 12/02/2021

Learning Robust Recommender from Noisy Implicit Feedback

The ubiquity of implicit feedback makes it indispensable for building re...

0 Wenjie Wang, et al. ∙

research

∙ 10/31/2021

Hierarchical Deep Residual Reasoning for Temporal Moment Localization

Temporal Moment Localization (TML) in untrimmed videos is a challenging ...

0 Ziyang Ma, et al. ∙

research

∙ 10/12/2021

Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos

This paper focuses on tackling the problem of temporal language localiza...

0 Zongmeng Zhang, et al. ∙

research

∙ 08/29/2021

Lipschitz Continuity Guided Knowledge Distillation

Knowledge distillation has become one of the most important model compre...

0 Yuzhang Shang, et al. ∙

research

∙ 08/17/2021

When Product Search Meets Collaborative Filtering: A Hierarchical Heterogeneous Graph Neural Network Approach

Personalization lies at the core of boosting the product search system p...

0 Xiangkun Yin, et al. ∙

research

∙ 07/12/2021

Contrastive Learning for Cold-Start Recommendation

Recommending cold-start items is a long-standing and fundamental challen...

0 Yinwei Wei, et al. ∙

research

∙ 06/08/2021

Review Polarity-wise Recommender

Utilizing review information to enhance recommendation, the de facto rev...

0 Han Liu, et al. ∙

research

∙ 05/10/2021

REPT: Bridging Language Models and Machine Reading Comprehension via Retrieval-Based Pre-training

Pre-trained Language Models (PLMs) have achieved great success on Machin...

0 Fangkai Jiao, et al. ∙

research

∙ 05/05/2021

AdaVQA: Overcoming Language Priors with Adapted Margin Cosine Loss

A number of studies point out that current Visual Question Answering (VQ...

0 Yangyang Guo, et al. ∙

research

∙ 04/17/2021

A Graph-guided Multi-round Retrieval Method for Conversational Open-domain Question Answering

In recent years, conversational agents have provided a natural and conve...

12 Yongqi Li, et al. ∙

Liqiang Nie

Featured Co-authors

Sign in with Google

Consider DeepAI Pro