Just How Good Are Protein Disorder Prediction Programs?
Disorder in proteins is vital for biological function, and structural disorder in protein is more pervasive than you might think. Proteins with disordered regions may also be sticky, and clump together inside and between cells, and are directly implicated in a number of neurodegenerative diseases. Thus, being able to identify disordered regions in proteins is highly important.
Unfortunately, it is challenging and time-consuming to characterize the structural propensities of polypeptides experimentally, and therefore bioinformatics methods for predicting protein disorder from sequence are indispensable.
Over recent years many bioinformaticians have therefore constructed algorithms to differentiate peptide sequences that will fold from those that do not, and these algorithms can be based on various 'features', derived from physicochemical parameters (like charge or hydrophobicity of an amino acid) as well as looking at evolutionary relatedness.
Now that many such prediction programs have become available, it is of obvious value to have some kind of benchmark to validate and test the predictions. To resolve this quandary, Nielsen and Mulder generated and validated a representative experimental benchmarking set of site-specific and continuous disorders, using deposited NMR chemical shift data for more than a hundred selected proteins. They then analysed the performance of 26 widely-used disorder prediction methods and found that these vary noticeably.
The thorough comparison presented in their research will help protein scientists around the globe to make better informed choices about which programmes are best to use.
This article has been republished from materials provided by the Interdisciplinary Nanoscience Center at Aarhus University. Note: material may have been edited for length and content. For further information, please contact the cited source.
Reference: Jakob T. Nielsen and Frans A. A. Mulder. 2019. Quality and bias of protein disorder predictors. Scientific Reports. DOI: https://doi.org/10.1038/s41598-019-41644-w.