Applications of large open-domain knowledge graphs (KGs) to real-world
p...
There is an increasing adoption of machine learning for encoding data in...
We introduce Saga, a next-generation knowledge construction and serving
...
Structured data, or data that adheres to a pre-defined schema, can suffe...
Outlier detection plays a significant role in various real world applica...
Real-time data analytics systems such as SAP HANA, MemSQL, and IBM Wildf...
Organizations are increasingly relying on data to support decisions. Whe...
Lack of data and data quality issues are among the main bottlenecks that...
Data deduplication is the task of detecting records in a database that
c...
Record fusion is the task of aggregating multiple records that correspon...
The problem of mining integrity constraints from data has been extensive...
We study the problem of recovering the latent ground truth labeling of a...
How should a cleaning system measure the amount of inconsistency in the
...
We introduce a few-shot learning framework for error detection. We show ...
This paper explores the problem of matching entities across different
kn...
We analyze the problem of discovering dependencies from distributed big ...
Traditional modeling of inconsistency in database theory casts all possi...
Data cleaning is the process of detecting and repairing inaccurate or co...
Four key processes in data integration are: data preparation (i.e.,
extr...