Popular AA-level structure descriptors include solvent accessibility, secondary structure, torsion angles, intrinsic disorder and flexibility. Intermediate level annotations, also called 1D descriptors ( 4, 5), describe structural and functional features of the AAs that compose protein chains. Protein-level data can be collected from several resources, including the manually reviewed Swiss-Prot and computationally annotated TrEMBL ( 1, 3). The primary repository of atomic-level information is the Protein Data Bank (PDB) ( 2), which currently houses ∼160 thousand protein structures. The functions and structures of proteins can be annotated at three levels of resolution: atomic, amino-acid (AA) and whole-protein. Īs the amount of sequence data grows rapidly, currently including over 189 million protein coding regions in the UniProt release 2020_04 ( 1), scientists face the huge task to characterize novel proteins functionally and structurally. Future releases will expand the coverage of DescribePROT. The putative annotations included by DescriPROT are useful for a broad range of studies, including: investigations of protein function, applied projects focusing on therapeutics and diseases, and in the development of predictors for other protein sequence descriptors. The predictions can be accesses via an interactive graphical interface that allows simultaneous analysis of multiple descriptors and can be also downloaded in structured formats at the protein, proteome and whole database scale. The pre-computed results are made available instantaneously. Users can search DescribePROT by the amino acid sequence and the UniProt accession number and entry name. The descriptors encompass sequence conservation, position specific scoring matrix, secondary structure, solvent accessibility, intrinsic disorder, disordered linkers, signal peptides, MoRFs and interactions with proteins, DNA and RNAs. The current version includes 7.8 billion predictions for close to 600 million amino acids in 1.4 million proteins. DescribePROT delivers a comprehensive collection of 13 complementary descriptors predicted using 10 popular and accurate algorithms for 83 complete proteomes that cover key model organisms. We present DescribePROT, the database of predicted amino acid-level descriptors of structure and function of proteins.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |