DisProt
In molecular biology, DisProt is a curated biological database collection of intrinsically unstructured proteins.[1][2] It is a community resource annotating protein sequences for intrinsically disorder regions from the literature. DisProt classifies intrinsic disorder based on experimental methods and three ontologies for molecular function, transition and binding partners.
Content | |
---|---|
Description | DisProt contains information and disorder annotation in intrinsically unstructured proteins. |
Data types captured | Annotation of intrinsically unstructured proteins |
Organisms | All |
Contact | |
Research center | Department of Biomedical Sciences at University of Padua |
Primary citation | PMID 31713636 |
Release date | September 2019 |
Access | |
Data format | CVS and JSON |
Website | |
Download URL | |
Web service URL | REST API see info here |
Miscellaneous | |
License | CC-BY |
Version | 8.0 |
Curation policy | Yes - manual curation from the literature |
Historically, the study of disordered proteins has been hampered by the lack of an organised resource collecting them and their properties together. Release 7 of DisProt[3] contains information on more than 800 proteins. Each protein entry in DisProt is characterised by a DisProt identifier which takes the form of the prefix DP followed by a 5 digit protein identifier. For example, DP00016 refers to the Cyclin-dependent kinase inhibitor 1 protein. Release 8 of DisProt[4] contains more than 1400 non ambiguous entries and over 3000 disordered protein regions. DisProt 8 also introduced the concept of a stable DisProt region identifier. DisProt has been widely used to train software methods to predict disordered regions in proteins. In addition, DisProt has been used to understand the properties of intrinsically unstructured proteins.[5]
Website
The DisProt website provides users with an interface to search by keyword, freetext, or by sequence similarity using BLAST. Users can also browse through the entries (proteins or regions) by their identifier, detection method or PubMed ID. The entire data set can be downloaded from the website in either CSV or JSON format.
DisProt web-server exposes some RESTful endpoints allowing programmatic access to DisProt and retrieval of different data types. Available GET routes provide access to all available data given a DisProt ID, a list of entries of a given type or a list of functional terms used for DisProt annotation.
External links
References
- Vucetic, Slobodan; Obradovic, Zoran; Vacic, Vladimir; Radivojac, Predrag; Peng, Kang; Iakoucheva, Lilia M.; Cortese, Marc S.; Lawson, J. David; Brown, Celeste J. (2005-01-01). "DisProt: a database of protein disorder". Bioinformatics. 21 (1): 137–140. doi:10.1093/bioinformatics/bth476. ISSN 1367-4803. PMID 15310560.
- Sickmeier, Megan; Hamilton, Justin A.; LeGall, Tanguy; Vacic, Vladimir; Cortese, Marc S.; Tantos, Agnes; Szabo, Beata; Tompa, Peter; Chen, Jake (2007-01-01). "DisProt: the Database of Disordered Proteins". Nucleic Acids Research. 35 (Database issue): D786–793. doi:10.1093/nar/gkl893. ISSN 1362-4962. PMC 1751543. PMID 17145717.
- Piovesan, Damiano; Tabaro, Francesco; Mičetić, Ivan; Necci, Marco; Quaglia, Federica; Oldfield, Christopher J.; Aspromonte, Maria Cristina; Davey, Norman E.; Davidović, Radoslav (2016-11-28). "DisProt 7.0: a major update of the database of disordered proteins". Nucleic Acids Research. 45 (D1): D219–D227. doi:10.1093/nar/gkw1056. ISSN 1362-4962. PMC 5210544. PMID 27899601.
- Hatos, András; Hajdu-Soltész, Borbála; Monzon, Alexander M.; Palopoli, Nicolas; Álvarez, Lucía; Aykac-Fas, Burcu; Bassot, Claudio; Benítez, Guillermo I.; Bevilacqua, Martina; Chasapi, Anastasia; Chemes, Lucia (2019). "DisProt: intrinsic protein disorder annotation in 2020". Nucleic Acids Research. doi:10.1093/nar/gkz975. PMID 31713636.
- Kovačević JJ (June 2012). "Computational analysis of position-dependent disorder content in DisProt database". Genomics Proteomics Bioinformatics. 10 (3): 158–65. doi:10.1016/j.gpb.2012.01.002. PMC 5056116. PMID 22917189.