TY - JOUR
T1 - Fast and accurate kernel density approximation using a divide-and-conquer approach
AU - Jin, Yan Xia
AU - Zhang, Kai
AU - Kwok, James T.
AU - Zhou, Han Chang
PY - 2010/9
Y1 - 2010/9
N2 - Density-based nonparametric clustering techniques, such as the mean shift algorithm, are well known for their flexibility and effectiveness in real-world vision-based problems. The underlying kernel density estimation process can be very expensive on large datasets. In this paper, the divide-and-conquer method is proposed to reduce these computational requirements. The dataset is first partitioned into a number of small, compact clusters. Components of the kernel estimator in each local cluster are then fit to a single, representative density function. The key novelty presented here is the efficient derivation of the representative density function using concepts from function approximation, such that the expensive kernel density estimator can be easily summarized by a highly compact model with very few basis functions. The proposed method has a time complexity that is only linear in the sample size and data dimensionality. Moreover, the bandwidth of the resultant density model is adaptive to local data distribution. Experiments on color image filtering/segmentation show that, the proposed method is dramatically faster than both the standard mean shift and fast mean shift implementations based on kd-trees while producing competitive image segmentation results.
AB - Density-based nonparametric clustering techniques, such as the mean shift algorithm, are well known for their flexibility and effectiveness in real-world vision-based problems. The underlying kernel density estimation process can be very expensive on large datasets. In this paper, the divide-and-conquer method is proposed to reduce these computational requirements. The dataset is first partitioned into a number of small, compact clusters. Components of the kernel estimator in each local cluster are then fit to a single, representative density function. The key novelty presented here is the efficient derivation of the representative density function using concepts from function approximation, such that the expensive kernel density estimator can be easily summarized by a highly compact model with very few basis functions. The proposed method has a time complexity that is only linear in the sample size and data dimensionality. Moreover, the bandwidth of the resultant density model is adaptive to local data distribution. Experiments on color image filtering/segmentation show that, the proposed method is dramatically faster than both the standard mean shift and fast mean shift implementations based on kd-trees while producing competitive image segmentation results.
KW - Image filtering
KW - Kernel density estimation
KW - Mean shift
KW - Nonparametric clustering
UR - https://www.webofscience.com/wos/woscc/full-record/WOS:000281732700002
UR - https://openalex.org/W2090660355
UR - https://www.scopus.com/pages/publications/78650080851
U2 - 10.1631/jzus.C0910668
DO - 10.1631/jzus.C0910668
M3 - Journal Article
SN - 1869-1951
VL - 11
SP - 677
EP - 689
JO - Journal of Zhejiang University: Science C
JF - Journal of Zhejiang University: Science C
IS - 9
ER -