RCSB Protein Data Bank: Sustaining a living digital data resource that enables breakthroughs in scientific research and biomedical education

Stephen K. Burley, Helen Berman, Cole Christie, Jose M. Duarte, Zukang Feng, John Westbrook, Jasmine Young, Christine Zardecki

Research output: Contribution to journalArticlepeer-review

213 Scopus citations

Abstract

The Protein Data Bank (PDB) is one of two archival resources for experimental data central to biomedical research and education worldwide (the other key Primary Data Archive in biology being the International Nucleotide Sequence Database Collaboration). The PDB currently houses >134,000 atomic level biomolecular structures determined by crystallography, NMR spectroscopy, and 3D electron microscopy. It was established in 1971 as the first open-access, digital-data resource in biology, and is managed by the Worldwide Protein Data Bank partnership (wwPDB; wwpdb.org). US PDB operations are conducted by the RCSB Protein Data Bank (RCSB PDB; RCSB.org; Rutgers University and UC San Diego) and funded by NSF, NIH, and DoE. The RCSB PDB serves as the global Archive Keeper for the wwPDB. During calendar 2016, >591 million structure data files were downloaded from the PDB by Data Consumers working in every sovereign nation recognized by the United Nations. During this same period, the RCSB PDB processed >5300 new atomic level biomolecular structures plus experimental data and metadata coming into the archive from Data Depositors working in the Americas and Oceania. In addition, RCSB PDB served >1 million RCSB.org users worldwide with PDB data integrated with ∼40 external data resources providing rich structural views of fundamental biology, biomedicine, and energy sciences, and >600,000 PDB101.rcsb.org educational website users around the globe. RCSB PDB resources are described in detail together with metrics documenting the impact of access to PDB data on basic and applied research, clinical medicine, education, and the economy.

Original languageEnglish (US)
Pages (from-to)316-330
Number of pages15
JournalProtein Science
Volume27
Issue number1
DOIs
StatePublished - Jan 2018

All Science Journal Classification (ASJC) codes

  • Biochemistry
  • Molecular Biology

Keywords

  • 3D electron microscopy
  • FAIR principles
  • NMR spectroscopy
  • PDB
  • PDBx/mmCIF
  • Protein Data Bank
  • RCSB
  • Research Collaboratory for Structure Bioinformatics
  • Worldwide Protein Data Bank
  • biocuration
  • chemical component dictionary
  • crystallography
  • data archive
  • data deposition
  • integrative/hybrid methods
  • macromolecular structure
  • metadata
  • open access
  • validation
  • wwPDB

Fingerprint

Dive into the research topics of 'RCSB Protein Data Bank: Sustaining a living digital data resource that enables breakthroughs in scientific research and biomedical education'. Together they form a unique fingerprint.

Cite this