Document clustering based on Web-log mining

Zhong Su*, Shao Ping Ma, Qiang Yang, Hong Jiang Zhang

*Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

6 Citations (Scopus)

Abstract

The effectiveness and efficiency are two problems in clustering algorithms. DBSCAN(density based spatial clustering of applications with noise) is a typical density based clustering(RDBC) algorithm that is very efficient on large databases. A recursive density based clustering algorithm that can adaptively change its parameters intelligently is presented. This clustering algorithm RDBC is based on DBACAN. It can be shown that RDBC require the same time complexity as that of the DBSCAN algorithm. In addition, it is proved both analytically and experimentally that this method yields results more superior than that of DBSCAN.

Original languageEnglish
Pages (from-to)99-104
Number of pages6
JournalRuan Jian Xue Bao/Journal of Software
Volume13
Issue number1
Publication statusPublished - Jan 2002
Externally publishedYes

Keywords

  • Clustering
  • Data mining
  • Databases
  • Web mining

Fingerprint

Dive into the research topics of 'Document clustering based on Web-log mining'. Together they form a unique fingerprint.

Cite this