TY - JOUR
T1 - Assessment of prediction methods for protein structures determined by NMR in CASP14
T2 - Impact of AlphaFold2
AU - Huang, Yuanpeng Janet
AU - Zhang, Ning
AU - Bersch, Beate
AU - Fidelis, Krzysztof
AU - Inouye, Masayori
AU - Ishida, Yojiro
AU - Kryshtafovych, Andriy
AU - Kobayashi, Naohiro
AU - Kuroda, Yutaka
AU - Liu, Gaohua
AU - LiWang, Andy
AU - Swapna, G. V.T.
AU - Wu, Nan
AU - Yamazaki, Toshio
AU - Montelione, Gaetano T.
N1 - Publisher Copyright:
© 2021 The Authors. Proteins: Structure, Function, and Bioinformatics published by Wiley Periodicals LLC.
PY - 2021/12
Y1 - 2021/12
N2 - NMR studies can provide unique information about protein conformations in solution. In CASP14, three reference structures provided by solution NMR methods were available (T1027, T1029, and T1055), as well as a fourth data set of NMR-derived contacts for an integral membrane protein (T1088). For the three targets with NMR-based structures, the best prediction results ranged from very good (GDT_TS = 0.90, for T1055) to poor (GDT_TS = 0.47, for T1029). We explored the basis of these results by comparing all CASP14 prediction models against experimental NMR data. For T1027, NMR data reveal extensive internal dynamics, presenting a unique challenge for protein structure prediction methods. The analysis of T1029 motivated exploration of a novel method of “inverse structure determination,” in which an AlphaFold2 model was used to guide NMR data analysis. NMR data provided to CASP predictor groups for target T1088, a 238-residue integral membrane porin, was also used to assess several NMR-assisted prediction methods. Most groups involved in this exercise generated similar beta-barrel models, with good agreement with the experimental data. However, as was also observed in CASP13, some pure prediction groups that did not use any NMR data generated models for T1088 that better fit the NMR data than the models generated using these experimental data. These results demonstrate the remarkable power of modern methods to predict structures of proteins with accuracies rivaling solution NMR structures, and that it is now possible to reliably use prediction models to guide and complement experimental NMR data analysis.
AB - NMR studies can provide unique information about protein conformations in solution. In CASP14, three reference structures provided by solution NMR methods were available (T1027, T1029, and T1055), as well as a fourth data set of NMR-derived contacts for an integral membrane protein (T1088). For the three targets with NMR-based structures, the best prediction results ranged from very good (GDT_TS = 0.90, for T1055) to poor (GDT_TS = 0.47, for T1029). We explored the basis of these results by comparing all CASP14 prediction models against experimental NMR data. For T1027, NMR data reveal extensive internal dynamics, presenting a unique challenge for protein structure prediction methods. The analysis of T1029 motivated exploration of a novel method of “inverse structure determination,” in which an AlphaFold2 model was used to guide NMR data analysis. NMR data provided to CASP predictor groups for target T1088, a 238-residue integral membrane porin, was also used to assess several NMR-assisted prediction methods. Most groups involved in this exercise generated similar beta-barrel models, with good agreement with the experimental data. However, as was also observed in CASP13, some pure prediction groups that did not use any NMR data generated models for T1088 that better fit the NMR data than the models generated using these experimental data. These results demonstrate the remarkable power of modern methods to predict structures of proteins with accuracies rivaling solution NMR structures, and that it is now possible to reliably use prediction models to guide and complement experimental NMR data analysis.
KW - MipA
KW - integral membrane proteins
KW - machine leaning
KW - protein dynamics
KW - protein structure prediction
KW - solution NMR
KW - structure determination
UR - http://www.scopus.com/inward/record.url?scp=85116508656&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85116508656&partnerID=8YFLogxK
U2 - 10.1002/prot.26246
DO - 10.1002/prot.26246
M3 - Article
C2 - 34559429
AN - SCOPUS:85116508656
SN - 0887-3585
VL - 89
SP - 1959
EP - 1976
JO - Proteins: Structure, Function and Bioinformatics
JF - Proteins: Structure, Function and Bioinformatics
IS - 12
ER -