Multiscale histograms: Summarizing topological relations in large spatial datasets

Xuemin Lin, Qing Liu, Yidong Yuan, Xiaofang Zhou

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

16 Citations (Scopus)

Abstract

Summarizing topological relations is fundamental to many spatial applications including spatial query optimization. In this paper, we present several novel techniques to effectively construct cell density based spatial histograms for range (window) summarizations restricted to the four most important topological relations: contains, contained, overlap, and disjoint. We first present a novel framework to construct a multiscale histogram composed of multiple Euler histograms with the guarantee of the exact summarization results for aligned windows in constant time. Then we present an approximate algorithm, with the approximate ratio 19/12, to minimize the storage spaces of such multiscale Euler histograms, although the problem is generally NP-hard. To conform to a limited storage space where only k Euler histograms are allowed, an effective algorithm is presented to construct multiscale histograms to achieve high accuracy. Finally, we present a new approximate algorithm to query an Euler histogram that cannot guarantee the exact answers; it runs in constant time. Our extensive experiments against both synthetic and real world datasets demonstrated that the approximate multiscale histogram techniques may improve the accuracy of the existing techniques by several orders of magnitude while retaining the cost efficiency, and the exact multiscale histogram technique requires only a storage space linearly proportional to the number of cells for the real datasets.

Original languageEnglish
Title of host publicationProceedings - 29th International Conference on Very Large Data Bases, VLDB 2003
EditorsPatricia G. Selinger, Michael J. Carey, Johann Christoph Freytag, Serge Abiteboul, Peter C. Lockemann, Andreas Heuer
PublisherMorgan Kaufmann
Pages814-825
Number of pages12
ISBN (Electronic)0127224424, 9780127224428
Publication statusPublished - 2003
Externally publishedYes
Event29th International Conference on Very Large Data Bases, VLDB 2003 - Berlin, Germany
Duration: 9 Sept 200312 Sept 2003

Publication series

NameProceedings - 29th International Conference on Very Large Data Bases, VLDB 2003

Conference

Conference29th International Conference on Very Large Data Bases, VLDB 2003
Country/TerritoryGermany
CityBerlin
Period9/09/0312/09/03

Fingerprint

Dive into the research topics of 'Multiscale histograms: Summarizing topological relations in large spatial datasets'. Together they form a unique fingerprint.

Cite this