Download Improving Function Prediction Using Patterns of Native Disorder in

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Rosetta@home wikipedia , lookup

Protein design wikipedia , lookup

Circular dichroism wikipedia , lookup

Structural alignment wikipedia , lookup

Protein folding wikipedia , lookup

Bimolecular fluorescence complementation wikipedia , lookup

Protein domain wikipedia , lookup

List of types of proteins wikipedia , lookup

Protein wikipedia , lookup

Cyclol wikipedia , lookup

Protein purification wikipedia , lookup

Nuclear magnetic resonance spectroscopy of proteins wikipedia , lookup

Proteomics wikipedia , lookup

Western blot wikipedia , lookup

Protein moonlighting wikipedia , lookup

Protein mass spectrometry wikipedia , lookup

Protein–protein interaction wikipedia , lookup

Homology modeling wikipedia , lookup

Protein structure prediction wikipedia , lookup

Intrinsically disordered proteins wikipedia , lookup

Transcript
Improving Function Prediction Using Patterns of Native Disorder in Proteins
Anna Lobley
Abstract
Instrinsically unstructured (disordered) proteins adopt little or no stable secondary
structure in their native state. Proteins containing long disordered regions are
abundant within eukaryotic genomes and can be predicted successfully from amino
sequence. Disordered regions have been shown to be important for functional
specificity and frequently contain binding motifs or are located at sites of covalent
modification often inferring a regulatory role for the protein. Computational methods
that predict protein function from sequence rely upon the use of homology
information to transfer annotations between proteins. These methods are not
applicable to orphan proteins or cases where whole families of protein sequences are
not annotated. To address the requirement for protein function prediction methods that
are independent of sequence homology and explore the use of information describing
protein disorder, we have implemented a machine learning method for predicting
protein function from sequence. A set of features for encoding disorder information
was designed and their importance in predicting Gene Ontology (GO) categories
demonstrated. The addition of disorder features significantly improved prediction of
many GO categories. The method has
been benchmarked against a competing
method and the practical use of the classifiers demonstrated through the annotation of
a set of orphan and unknown human proteins.