TY - JOUR
T1 - Memory kernel minimization-based neural networks for discovering slow collective variables of biomolecular dynamics
AU - Liu, Bojun
AU - Cao, Siqin
AU - Boysen, Jordan G.
AU - Xue, Mingyi
AU - Huang, Xuhui
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer Nature America, Inc. 2025.
PY - 2025/7
Y1 - 2025/7
N2 - Identifying collective variables (CVs) that accurately capture the slowest timescales of protein conformational changes is crucial to comprehend numerous biological processes. Here we introduce memory kernel minimization-based neural networks (MEMnets), a deep learning framework that accurately identifies the slow CVs of biomolecular dynamics. Unlike popular CV-identification methods, which typically assume Markovian dynamics, MEMnets is built on the integrative generalized master equation theory, which incorporates non-Markovian dynamics by encoding them in a memory kernel for continuous CVs. The key innovation of MEMnets is the identification of optimal CVs by minimizing the upper bound for the time-integrated memory kernels through parallel encoder networks. We demonstrate that MEMnets effectively identifies slow CVs involved in the folding of the FIP35 WW domain, revealing two parallel folding pathways. In addition, we illustrate MEMnets’ robust numerical stability in identifying meaningful CVs in large biomolecular dynamic systems with limited sampling by applying it to the clamp opening of bacterial RNA polymerase, a much more complex conformational change.
AB - Identifying collective variables (CVs) that accurately capture the slowest timescales of protein conformational changes is crucial to comprehend numerous biological processes. Here we introduce memory kernel minimization-based neural networks (MEMnets), a deep learning framework that accurately identifies the slow CVs of biomolecular dynamics. Unlike popular CV-identification methods, which typically assume Markovian dynamics, MEMnets is built on the integrative generalized master equation theory, which incorporates non-Markovian dynamics by encoding them in a memory kernel for continuous CVs. The key innovation of MEMnets is the identification of optimal CVs by minimizing the upper bound for the time-integrated memory kernels through parallel encoder networks. We demonstrate that MEMnets effectively identifies slow CVs involved in the folding of the FIP35 WW domain, revealing two parallel folding pathways. In addition, we illustrate MEMnets’ robust numerical stability in identifying meaningful CVs in large biomolecular dynamic systems with limited sampling by applying it to the clamp opening of bacterial RNA polymerase, a much more complex conformational change.
UR - https://www.webofscience.com/wos/woscc/full-record/WOS:001505321800001
UR - https://www.scopus.com/pages/publications/105007758250
U2 - 10.1038/s43588-025-00815-8
DO - 10.1038/s43588-025-00815-8
M3 - Journal Article
SN - 2662-8457
VL - 5
SP - 562
EP - 571
JO - Nature Computational Science
JF - Nature Computational Science
IS - 7
ER -