Adaptive partitioning by local density-peaks: An efficient density-based clustering algorithm for analyzing molecular dynamics trajectories

Song Liu, Lizhe Zhu*, Fu Kit Sheong, Wei Wang, Xuhui Huang

*Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

Abstract

We present an efficient density-based adaptive-resolution clustering method APLoD for analyzing large-scale molecular dynamics (MD) trajectories. APLoD performs the k-nearest-neighbors search to estimate the density of MD conformations in a local fashion, which can group MD conformations in the same high-density region into a cluster. APLoD greatly improves the popular density peaks algorithm by reducing the running time and the memory usage by 2–3 orders of magnitude for systems ranging from alanine dipeptide to a 370-residue Maltose-binding protein. In addition, we demonstrate that APLoD can produce clusters with various sizes that are adaptive to the underlying density (i.e., larger clusters at low-density regions, while smaller clusters at high-density regions), which is a clear advantage over other popular clustering algorithms including k-centers and k-medoids. We anticipate that APLoD can be widely applied to split ultra-large MD datasets containing millions of conformations for subsequent construction of Markov State Models.

Original languageEnglish
Pages (from-to)152-160
Number of pages9
JournalJournal of Computational Chemistry
Volume38
Issue number3
DOIs
Publication statusPublished - 30 Jan 2017

Bibliographical note

Publisher Copyright:
© 2016 Wiley Periodicals, Inc.

Keywords

  • Markov State Models
  • clustering algorithm
  • density peaks
  • kNN search
  • molecular dynamics

Fingerprint

Dive into the research topics of 'Adaptive partitioning by local density-peaks: An efficient density-based clustering algorithm for analyzing molecular dynamics trajectories'. Together they form a unique fingerprint.

Cite this