DOI badge

Last updated on Tue Dec 1 14:24:26 2020.

The Protein Data Bank (PDB) is a repository of experimentally determined three-dimensional structures of biological macromolecules (mostly proteins and nucleic acids). The structures it contains are very useful by themselves for answering biological questions, or for asking even more questions. In addition, the associated metadata (structure annotations) can also answer many interesting questions.

The PDB already provides statistics on some of its metadata, but these are very general in scope. The PDB in Europe (PDBe) provides programmatic access to the database through the PDBe API. By collecting appropriate metadata from the database, one can get much finer insight, for example specific to a particular field of structural biology.

This website presents the results of analyzing metadata from the Protein Data Bank to answer questions I am interested in. Some of these analyses are part of a series of posts on my blog; this website consolidates the results in a format that is easier to keep up-to-date and consult. I will add more analyses as I find the need. Each analysis indicates the date when it was last run, and I will try to update the entire site about once a month. If you notice it is out of date, feel free to contact me and I will update it.

You are free to use all figures and code used to generate them, provided you give credit appropriately.

LS0tCnRpdGxlOiAiSW5zaWdodHMgZnJvbSB0aGUgUERCIgotLS0KClshW0RPSSBiYWRnZV1bZG9pLWJhZGdlXV1bZG9pLWxpbmtdCgoqKkxhc3QgdXBkYXRlZCBvbiBgciBkYXRlKClgLioqCgpUaGUgW1Byb3RlaW4gRGF0YSBCYW5rXVtwZGJdIChQREIpIGlzIGEgcmVwb3NpdG9yeSBvZiBleHBlcmltZW50YWxseSBkZXRlcm1pbmVkCnRocmVlLWRpbWVuc2lvbmFsIHN0cnVjdHVyZXMgb2YgYmlvbG9naWNhbCBtYWNyb21vbGVjdWxlcyAobW9zdGx5IHByb3RlaW5zIGFuZApudWNsZWljIGFjaWRzKS4gVGhlIHN0cnVjdHVyZXMgaXQgY29udGFpbnMgYXJlIHZlcnkgdXNlZnVsIGJ5IHRoZW1zZWx2ZXMgZm9yCmFuc3dlcmluZyBiaW9sb2dpY2FsIHF1ZXN0aW9ucywgb3IgZm9yIGFza2luZyBldmVuIG1vcmUgcXVlc3Rpb25zLiBJbiBhZGRpdGlvbiwKdGhlIGFzc29jaWF0ZWQgKm1ldGFkYXRhKiAoc3RydWN0dXJlIGFubm90YXRpb25zKSBjYW4gYWxzbyBhbnN3ZXIgbWFueQppbnRlcmVzdGluZyBxdWVzdGlvbnMuCgpUaGUgUERCIGFscmVhZHkgcHJvdmlkZXMgW3N0YXRpc3RpY3Mgb24gc29tZSBvZiBpdHMgbWV0YWRhdGFdW3BkYi1zdGF0c10sIGJ1dAp0aGVzZSBhcmUgdmVyeSBnZW5lcmFsIGluIHNjb3BlLiBUaGUgW1BEQiBpbiBFdXJvcGVdW3BkYmVdIChQREJlKSBwcm92aWRlcwpwcm9ncmFtbWF0aWMgYWNjZXNzIHRvIHRoZSBkYXRhYmFzZSB0aHJvdWdoIHRoZSBbUERCZSBBUEldW3BkYmUtc2VhcmNoXS4gQnkKY29sbGVjdGluZyBhcHByb3ByaWF0ZSBtZXRhZGF0YSBmcm9tIHRoZSBkYXRhYmFzZSwgb25lIGNhbiBnZXQgbXVjaCBmaW5lcgppbnNpZ2h0LCBmb3IgZXhhbXBsZSBzcGVjaWZpYyB0byBhIHBhcnRpY3VsYXIgZmllbGQgb2Ygc3RydWN0dXJhbCBiaW9sb2d5LgoKVGhpcyB3ZWJzaXRlIHByZXNlbnRzIHRoZSByZXN1bHRzIG9mIGFuYWx5emluZyBtZXRhZGF0YSBmcm9tIHRoZSBQcm90ZWluIERhdGEKQmFuayB0byBhbnN3ZXIgcXVlc3Rpb25zIEkgYW0gaW50ZXJlc3RlZCBpbi4gU29tZSBvZiB0aGVzZSBhbmFseXNlcyBhcmUgcGFydCBvZgpbYSBzZXJpZXMgb2YgcG9zdHMgb24gbXkgYmxvZ11bYmxvZ107IHRoaXMgd2Vic2l0ZSBjb25zb2xpZGF0ZXMgdGhlIHJlc3VsdHMgaW4gYQpmb3JtYXQgdGhhdCBpcyBlYXNpZXIgdG8ga2VlcCB1cC10by1kYXRlIGFuZCBjb25zdWx0LiBJIHdpbGwgYWRkIG1vcmUgYW5hbHlzZXMKYXMgSSBmaW5kIHRoZSBuZWVkLiBFYWNoIGFuYWx5c2lzIGluZGljYXRlcyB0aGUgZGF0ZSB3aGVuIGl0IHdhcyBsYXN0IHJ1biwgYW5kIEkKd2lsbCB0cnkgdG8gdXBkYXRlIHRoZSBlbnRpcmUgc2l0ZSBhYm91dCBvbmNlIGEgbW9udGguIElmIHlvdSBub3RpY2UgaXQgaXMgb3V0Cm9mIGRhdGUsIGZlZWwgZnJlZSB0byBbY29udGFjdCBtZV1bY29udGFjdF0gYW5kIEkgd2lsbCB1cGRhdGUgaXQuCgpZb3UgYXJlIGZyZWUgdG8gdXNlIGFsbCBmaWd1cmVzIGFuZCBjb2RlIHVzZWQgdG8gZ2VuZXJhdGUgdGhlbSwgW3Byb3ZpZGVkIHlvdQpnaXZlIGNyZWRpdCBhcHByb3ByaWF0ZWx5XVtwZXJtaXNzaW9uc10uCgojIyBDdXJyZW50bHkgYXZhaWxhYmxlIGFuYWx5c2VzCgotIFtOdWNsZW9zb21lIHN0cnVjdHVyZXNdKG51Y2xlb3NvbWUtc3RydWN0dXJlcy5odG1sKQotIFtETkEgbGVuZ3RoIGluIHByb3RlaW4tRE5BIGNvbXBsZXhlc10oZG5hLWxlbmd0aC1pbi1wcm90ZWluLWRuYS1jb21wbGV4ZXMuaHRtbCkKLSBbRE5BIGxlbmd0aCBpbiBzdHJ1Y3R1cmVzIG9mIGZyZWUgRE5BXShmcmVlLWRuYS5odG1sKQoKCltkb2ktYmFkZ2VdOiBodHRwczovL3plbm9kby5vcmcvYmFkZ2UvZG9pLzEwLjUyODEvemVub2RvLjM0NzAxMTkuc3ZnCgpbZG9pLWxpbmtdOiBodHRwczovL2RvaS5vcmcvMTAuNTI4MS96ZW5vZG8uMzQ3MDExOQoKW3BkYl06IGh0dHBzOi8vZW4ud2lraXBlZGlhLm9yZy93aWtpL1Byb3RlaW5fRGF0YV9CYW5rCgpbcGRiLXN0YXRzXTogaHR0cHM6Ly93d3cucmNzYi5vcmcvc3RhdHMvCgpbcGRiZV06IGh0dHBzOi8vd3d3LmViaS5hYy51ay9wZGJlCgpbcGRiZS1zZWFyY2hdOiBodHRwczovL3d3dy5lYmkuYWMudWsvcGRiZS9hcGkvZG9jL3NlYXJjaC5odG1sCgpbYmxvZ106IGh0dHBzOi8vd3d3LmdhdWxsaWVyLm9yZy9lbi9jYXRlZ29yaWVzL2luc2lnaHRzLWZyb20tdGhlLXBkYi8KCltjb250YWN0XTogYWJvdXQuaHRtbCNjb250YWN0CgpbcGVybWlzc2lvbnNdOiBhYm91dC5odG1sI3Blcm1pc3Npb25zCg==