ProtaBank: A repository for protein design and engineering data

Connie Y. Wang, Paul M. Chang, Marie L. Ary, Benjamin D. Allen, Roberto A. Chica, Stephen L. Mayo, Barry D. Olafson

Research output: Contribution to journalArticlepeer-review

38 Scopus citations


We present ProtaBank, a repository for storing, querying, analyzing, and sharing protein design and engineering data in an actively maintained and updated database. ProtaBank provides a format to describe and compare all types of protein mutational data, spanning a wide range of properties and techniques. It features a user-friendly web interface and programming layer that streamlines data deposition and allows for batch input and queries. The database schema design incorporates a standard format for reporting protein sequences and experimental data that facilitates comparison of results across different data sets. A suite of analysis and visualization tools are provided to facilitate discovery, to guide future designs, and to benchmark and train new predictive tools and algorithms. ProtaBank will provide a valuable resource to the protein engineering community by storing and safeguarding newly generated data, allowing for fast searching and identification of relevant data from the existing literature, and exploring correlations between disparate data sets. ProtaBank invites researchers to contribute data to the database to make it accessible for search and analysis. ProtaBank is available at

Original languageEnglish (US)
Pages (from-to)1113-1124
Number of pages12
JournalProtein Science
Issue number6
StatePublished - Jun 2018

All Science Journal Classification (ASJC) codes

  • Biochemistry
  • Molecular Biology


Dive into the research topics of 'ProtaBank: A repository for protein design and engineering data'. Together they form a unique fingerprint.

Cite this