TY - JOUR
T1 - Pipelines for automating compliance-based elimination and extension (PACE2)
T2 - a systematic framework for high-throughput biomolecular materials simulation workflows
AU - Mushnoori, Srinivas C.
AU - Zang, Ethan
AU - Banerjee, Akash
AU - Hooten, Mason
AU - Merzky, Andre
AU - Turilli, Matteo
AU - Jha, Shantenu
AU - Dutt, Meenakshi
N1 - Publisher Copyright:
© 2023 The Author(s). Published by IOP Publishing Ltd
PY - 2024/1/1
Y1 - 2024/1/1
N2 - The formation of biomolecular materials via dynamical interfacial processes, such as self-assembly and fusion, for diverse compositions and external conditions can be efficiently probed using ensemble Molecular Dynamics (MD). However, this approach requires many simulations when investigating a large composition phase space. In addition, there is difficulty in predicting whether each simulation will yield biomolecular materials with the desired properties or outcomes and how long each simulation will run. These difficulties can be overcome by rules-based management systems, including intermittent inspection, variable sampling, and premature termination or extension of the individual MD simulations. Automating such a management system can significantly improve runtime efficiency and reduce the burden of organizing large ensembles of MD simulations. To this end, a computational framework, the Pipelines for Automating Compliance-based Elimination and Extension (PACE2), is proposed for high-throughput ensemble biomolecular materials simulations. The PACE2 framework encompasses Candidate pipelines, where each pipeline includes temporally separated simulation and analysis tasks. When a MD simulation is completed, an analysis task is triggered, which evaluates the MD trajectory for compliance. Compliant simulations are extended to the next MD phase with a suitable sample rate to allow additional, detailed analysis. Non-compliant simulations are eliminated, and their computational resources are reallocated or released. The framework is designed to run on local desktop computers and high-performance computing resources. Preliminary scientific results enabled by the use of PACE2 framework are presented, which demonstrate its potential and validates its function. In the future, the framework will be extended to address generalized workflows and investigate composition-structure-property relations for other classes of materials.
AB - The formation of biomolecular materials via dynamical interfacial processes, such as self-assembly and fusion, for diverse compositions and external conditions can be efficiently probed using ensemble Molecular Dynamics (MD). However, this approach requires many simulations when investigating a large composition phase space. In addition, there is difficulty in predicting whether each simulation will yield biomolecular materials with the desired properties or outcomes and how long each simulation will run. These difficulties can be overcome by rules-based management systems, including intermittent inspection, variable sampling, and premature termination or extension of the individual MD simulations. Automating such a management system can significantly improve runtime efficiency and reduce the burden of organizing large ensembles of MD simulations. To this end, a computational framework, the Pipelines for Automating Compliance-based Elimination and Extension (PACE2), is proposed for high-throughput ensemble biomolecular materials simulations. The PACE2 framework encompasses Candidate pipelines, where each pipeline includes temporally separated simulation and analysis tasks. When a MD simulation is completed, an analysis task is triggered, which evaluates the MD trajectory for compliance. Compliant simulations are extended to the next MD phase with a suitable sample rate to allow additional, detailed analysis. Non-compliant simulations are eliminated, and their computational resources are reallocated or released. The framework is designed to run on local desktop computers and high-performance computing resources. Preliminary scientific results enabled by the use of PACE2 framework are presented, which demonstrate its potential and validates its function. In the future, the framework will be extended to address generalized workflows and investigate composition-structure-property relations for other classes of materials.
KW - biomolecular materials modeling
KW - biomolecular simulations
KW - ensemble methods
KW - high performance computing resource management
KW - software workflows
UR - http://www.scopus.com/inward/record.url?scp=85177492890&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85177492890&partnerID=8YFLogxK
U2 - 10.1088/2515-7639/ad08d0
DO - 10.1088/2515-7639/ad08d0
M3 - Article
AN - SCOPUS:85177492890
SN - 2515-7639
VL - 7
JO - JPhys Materials
JF - JPhys Materials
IS - 1
M1 - 015006
ER -