Video compression relies heavily on exploiting the temporal redundancy
b...
Leveraging vast training data (SA-1B), the foundation Segment Anything M...
A key puzzle in search, ads, and recommendation is that the ranking mode...
Most contemporary supervised Remote Sensing (RS) image Change Detection ...
In machine learning and big data, the optimization objectives based on
s...
To design a drug given a biological molecule by using deep learning meth...
Knowledge tracing (KT) is a fundamental task in educational data mining ...
Supervised multi-view stereo (MVS) methods have achieved remarkable prog...
Recently, Implicit Neural Representations (INRs) parameterized by neural...
We introduce the task of spotting temporally precise, fine-grained event...
Metacells are disjoint and homogeneous groups of single-cell profiles,
r...
We present GLIPv2, a grounded VL understanding model, that serves both
l...
This paper presents a grounded language-image pre-training (GLIP) model ...
In this paper, we present TransMVSNet, based on our exploration of featu...
In this paper, we present a visual localization pipeline, namely MegLoc,...
This report describes Megvii-3D team's approach towards SimLocMatch Chal...
This report describes Megvii-3D team's approach towards CVPR 2021 Image
...
We present the novel Efficient Line Segment Detector and Descriptor (ELS...
Efficient exploration is one of the most important issues in deep
reinfo...
In neural text editing, prevalent sequence-to-sequence based approaches
...
Cable TV news reaches millions of U.S. households each day, meaning that...
We present a system that converts annotated broadcast video of tennis ma...
Multiple object tracking (MOT) is a crucial task in computer vision soci...
The advancement of artificial intelligence has cast a new light on the
d...
This paper proposes the first-ever algorithmic framework for tuning
hype...
Tuning hyper-parameters for evolutionary algorithms is an important issu...
Drones, or general UAVs, equipped with a single camera have been widely
...
Image pairing is an important research task in the field of computer vis...
Many real-world video analysis applications require the ability to ident...
We propose a novel video inpainting algorithm that simultaneously
halluc...
Following recent successes in applying BERT to question answering, we ex...
We introduce, TextureNet, a neural network architecture designed to extr...
Multi-object tracking (MOT) is an important and practical task related t...
This study uses a novel simulation framework to evaluate whether the tim...
Most work on natural language question answering today focuses on answer...
Time is an important relevance signal when searching streams of social m...