Abstract
Real-life graph datasets extracted from Web are inevitably full of incompleteness, conflicts, and redundancies, so graph data cleaning shows its necessity. One of the main issues is to automatically repair the graph with some repairing rules. Although rules like data dependencies have been widely studied in relational data repairing, very few works exist to repair the graph data. In this paper, we introduce an automatic repairing semantic for graphs, called Graph-Repairing Rules (GRRs). This semantic can capture the incompleteness, conflicts, and redundancies in the graphs and indicate how to correct these errors. We study three fundamental problems associated with GRRs, implication, consistency and termination, which show whether a given set of GRRs make sense. Repairing the graph data using GRRs involves a problem of finding isomorphic subgraphs of the graph data for each GRR, which is NP-complete. To efficiently circumvent the complex calculation of subgraph isomorphism, we design a decomposition-And-join strategy to solve this problem. Extensive experiments on real datasets show that our GRR semantic and corresponding repairing algorithms can effectively and efficiently repair real-life graph data.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 773-784 |
| Number of pages | 12 |
| ISBN (Electronic) | 9781538655207 |
| DOIs | |
| Publication status | Published - 24 Oct 2018 |
| Event | 34th IEEE International Conference on Data Engineering, ICDE 2018 - Paris, France Duration: 16 Apr 2018 → 19 Apr 2018 |
Publication series
| Name | Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018 |
|---|
Conference
| Conference | 34th IEEE International Conference on Data Engineering, ICDE 2018 |
|---|---|
| Country/Territory | France |
| City | Paris |
| Period | 16/04/18 → 19/04/18 |
Bibliographical note
Publisher Copyright:© 2018 IEEE.
Keywords
- Graph Repair
- Rule