Estimating the Selectivity of Spatial Queries Using the 'Correlation' Fractal Dimension

Belussi, Alberto; Faloutsos, Christos

doi:10.1184/R1/6605282.v1

file.pdf (537.1 kB)

Estimating the Selectivity of Spatial Queries Using the 'Correlation' Fractal Dimension

journal contribution

posted on 2005-03-01, 00:00 authored by Alberto Belussi, Christos Faloutsos

We examine the estimation of selectivities for range and spatial join queries in real spatial databases. As we have shown earlier [FK94], real point sets: (a) violate consistentlythe "uniformity" and "independence" assumptions, (b) can often be described as "fractals", with non-integer (fractal) dimension. In this paper we show that, among the infinite family of fractal dimensions, the so called "Correlation Dimension" D2 is the one that we need to predict the selectivity of spatial join. The main contribution is that, for all the real and synthetic point-sets we tried, the average number of neighbors for a given point of the point-set follows a power law, with D2 as the exponent. This immediately solves the selectivity estimation for spatial joins, as well as for "biased" range queries (i.e., queries whose centers prefer areas of high point density). We present the formulas to estimate the selectivity for the biased queries, including an integration constant (Kshape!,) for each query shape. Finally, we show results on real and synthetic point sets, where our formulas achieve very low relative errors (typically about 10%, versus 40%-100% of the uniformity assumption).

History

Date

2005-03-01

Usage metrics

Keywords

computer sciences

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Estimating the Selectivity of Spatial Queries Using the 'Correlation' Fractal Dimension

History

Date

Usage metrics

Categories

Keywords

Licence

Exports