Entity matching (EM) is a challenging problem studied by different
commu...
Towards better explainability in the field of information retrieval, we
...
New technologies and the availability of geospatial data have drawn atte...
Entity resolution (ER) aims at matching records that refer to the same
r...
Sharing trajectories is beneficial for many real-world applications, suc...
As data is a central component of many modern systems, the cause of a sy...
Data integration is a long-standing interest of the data management comm...
Order dependencies (ODs) capture relationships between ordered domains o...
Differential privacy is the state-of-the-art formal definition for data
...
Blocking is a mechanism to improve the efficiency of Entity Resolution (...
Much real-world data come with explicitly defined domain orders; e.g.,
l...
We propose PODS (Predictable Outliers in Data-trendS), a method that, gi...
Random sampling has been widely used in approximate query processing on ...
A number of extensions to the classical notion of functional dependencie...
Counting the fraction of a population having an input within a specified...
Stratified random sampling (SRS) is a fundamental sampling technique tha...
Many analysis and machine learning tasks require the availability of mar...
Concern about how to aggregate sensitive user data without compromising
...