TY - JOUR
T1 - Analysis
T2 - Analysis of impact metrics for the protein data bank
AU - Markosian, Christopher
AU - Di Costanzo, Luigi
AU - Sekharan, Monica
AU - Shao, Chenghua
AU - Burley, Stephen K.
AU - Zardecki, Christine
N1 - Funding Information:
The RCSB PDB is jointly funded by the National Science Foundation, the National Institutes of Health, and the Department of Energy (NSF-DBI 1338415). We gratefully acknowledge contributions from all members of the Research Collaboratory for Structural Bioinformatics PDB, our Worldwide Protein Data Bank partners, and the Rutgers Office of Research Analytics. We thank Laura Bowering Mullen and Laura Palumbo (Rutgers University), Anthony Molisani (Clarivate Analytics), and Ludo Waltman (Leiden University) for guidance.
Funding Information:
Competing Interests: C.M. declares no competing interests. L.D., M.S., C.S., S.K.B., and C.Z. receive research support from a grant to the RCSB PDB (DBI-1338415) from the National Science Foundation, the National Institutes of Health, and the US Department of Energy.
PY - 2018
Y1 - 2018
N2 - Since 1971, the Protein Data Bank (PDB) archive has served as the single, global repository for open access to atomic-level data for biological macromolecules. The archive currently holds >140,000 structures (>1 billion atoms). These structures are the molecules of life found in all organisms. Knowing the 3D structure of a biological macromolecule is essential for understanding the molecule’s function, providing insights in health and disease, food and energy production, and other topics of concern to prosperity and sustainability. PDB data are freely and publicly available, without restrictions on usage. Through bibliometric and usage studies, we sought to determine the impact of the PDB across disciplines and demographics. Our analysis shows that even though research areas such as molecular biology and biochemistry account for the most usage, other fields are increasingly using PDB resources. PDB usage is seen across 150 disciplines in applied sciences, humanities, and social sciences. Data are also re-used and integrated with >400 resources. Our study identifies trends in PDB usage and documents its utility across research disciplines.
AB - Since 1971, the Protein Data Bank (PDB) archive has served as the single, global repository for open access to atomic-level data for biological macromolecules. The archive currently holds >140,000 structures (>1 billion atoms). These structures are the molecules of life found in all organisms. Knowing the 3D structure of a biological macromolecule is essential for understanding the molecule’s function, providing insights in health and disease, food and energy production, and other topics of concern to prosperity and sustainability. PDB data are freely and publicly available, without restrictions on usage. Through bibliometric and usage studies, we sought to determine the impact of the PDB across disciplines and demographics. Our analysis shows that even though research areas such as molecular biology and biochemistry account for the most usage, other fields are increasingly using PDB resources. PDB usage is seen across 150 disciplines in applied sciences, humanities, and social sciences. Data are also re-used and integrated with >400 resources. Our study identifies trends in PDB usage and documents its utility across research disciplines.
UR - http://www.scopus.com/inward/record.url?scp=85054898013&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85054898013&partnerID=8YFLogxK
U2 - 10.1038/sdata.2018.212
DO - 10.1038/sdata.2018.212
M3 - Article
C2 - 30325351
AN - SCOPUS:85054898013
VL - 5
JO - Scientific data
JF - Scientific data
SN - 2052-4463
M1 - 180212
ER -