Abstract
We present an efficient density-based adaptive-resolution clustering method APLoD for analyzing large-scale molecular dynamics (MD) trajectories. APLoD performs the k-nearest-neighbors search to estimate the density of MD conformations in a local fashion, which can group MD conformations in the same high-density region into a cluster. APLoD greatly improves the popular density peaks algorithm by reducing the running time and the memory usage by 2–3 orders of magnitude for systems ranging from alanine dipeptide to a 370-residue Maltose-binding protein. In addition, we demonstrate that APLoD can produce clusters with various sizes that are adaptive to the underlying density (i.e., larger clusters at low-density regions, while smaller clusters at high-density regions), which is a clear advantage over other popular clustering algorithms including k-centers and k-medoids. We anticipate that APLoD can be widely applied to split ultra-large MD datasets containing millions of conformations for subsequent construction of Markov State Models.
| Original language | English |
|---|---|
| Pages (from-to) | 152-160 |
| Number of pages | 9 |
| Journal | Journal of Computational Chemistry |
| Volume | 38 |
| Issue number | 3 |
| DOIs | |
| Publication status | Published - 30 Jan 2017 |
Bibliographical note
Publisher Copyright:© 2016 Wiley Periodicals, Inc.
Keywords
- Markov State Models
- clustering algorithm
- density peaks
- kNN search
- molecular dynamics
Fingerprint
Dive into the research topics of 'Adaptive partitioning by local density-peaks: An efficient density-based clustering algorithm for analyzing molecular dynamics trajectories'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver