TY - JOUR
T1 - Blind assessment of monomeric AlphaFold2 protein structure models with experimental NMR data
AU - Li, Ethan H.
AU - Spaman, Laura E.
AU - Tejero, Roberto
AU - Janet Huang, Yuanpeng
AU - Ramelot, Theresa A.
AU - Fraga, Keith J.
AU - Prestegard, James H.
AU - Kennedy, Michael A.
AU - Montelione, Gaetano T.
N1 - Publisher Copyright:
© 2023 The Authors
PY - 2023/7
Y1 - 2023/7
N2 - Recent advances in molecular modeling of protein structures are changing the field of structural biology. AlphaFold-2 (AF2), an AI system developed by DeepMind, Inc., utilizes attention-based deep learning to predict models of protein structures with high accuracy relative to structures determined by X-ray crystallography and cryo-electron microscopy (cryoEM). Comparing AF2 models to structures determined using solution NMR data, both high similarities and distinct differences have been observed. Since AF2 was trained on X-ray crystal and cryoEM structures, we assessed how accurately AF2 can model small, monomeric, solution protein NMR structures which (i) were not used in the AF2 training data set, and (ii) did not have homologous structures in the Protein Data Bank at the time of AF2 training. We identified nine open-source protein NMR data sets for such “blind” targets, including chemical shift, raw NMR FID data, NOESY peak lists, and (for 1 case) 15N-1H residual dipolar coupling data. For these nine small (70–108 residues) monomeric proteins, we generated AF2 prediction models and assessed how well these models fit to these experimental NMR data, using several well-established NMR structure validation tools. In most of these cases, the AF2 models fit the NMR data nearly as well, or sometimes better than, the corresponding NMR structure models previously deposited in the Protein Data Bank. These results provide benchmark NMR data for assessing new NMR data analysis and protein structure prediction methods. They also document the potential for using AF2 as a guiding tool in protein NMR data analysis, and more generally for hypothesis generation in structural biology research.
AB - Recent advances in molecular modeling of protein structures are changing the field of structural biology. AlphaFold-2 (AF2), an AI system developed by DeepMind, Inc., utilizes attention-based deep learning to predict models of protein structures with high accuracy relative to structures determined by X-ray crystallography and cryo-electron microscopy (cryoEM). Comparing AF2 models to structures determined using solution NMR data, both high similarities and distinct differences have been observed. Since AF2 was trained on X-ray crystal and cryoEM structures, we assessed how accurately AF2 can model small, monomeric, solution protein NMR structures which (i) were not used in the AF2 training data set, and (ii) did not have homologous structures in the Protein Data Bank at the time of AF2 training. We identified nine open-source protein NMR data sets for such “blind” targets, including chemical shift, raw NMR FID data, NOESY peak lists, and (for 1 case) 15N-1H residual dipolar coupling data. For these nine small (70–108 residues) monomeric proteins, we generated AF2 prediction models and assessed how well these models fit to these experimental NMR data, using several well-established NMR structure validation tools. In most of these cases, the AF2 models fit the NMR data nearly as well, or sometimes better than, the corresponding NMR structure models previously deposited in the Protein Data Bank. These results provide benchmark NMR data for assessing new NMR data analysis and protein structure prediction methods. They also document the potential for using AF2 as a guiding tool in protein NMR data analysis, and more generally for hypothesis generation in structural biology research.
KW - AlphaFold2
KW - Artificial intelligence
KW - Deep learning
KW - Model validation
KW - NMR data
KW - Protein NMR
UR - http://www.scopus.com/inward/record.url?scp=85160214521&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85160214521&partnerID=8YFLogxK
U2 - 10.1016/j.jmr.2023.107481
DO - 10.1016/j.jmr.2023.107481
M3 - Article
C2 - 37257257
AN - SCOPUS:85160214521
SN - 1090-7807
VL - 352
JO - Journal of Magnetic Resonance
JF - Journal of Magnetic Resonance
M1 - 107481
ER -