Download Name of Student: Dominik Sommerfeld

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

P-type ATPase wikipedia , lookup

Biochemistry wikipedia , lookup

Magnesium transporter wikipedia , lookup

Protein moonlighting wikipedia , lookup

Molecular evolution wikipedia , lookup

Ubiquitin wikipedia , lookup

Enzyme wikipedia , lookup

Protein wikipedia , lookup

Genetic code wikipedia , lookup

List of types of proteins wikipedia , lookup

Proteasome wikipedia , lookup

Self-assembling peptide wikipedia , lookup

Nuclear magnetic resonance spectroscopy of proteins wikipedia , lookup

Homology modeling wikipedia , lookup

Protein (nutrient) wikipedia , lookup

Western blot wikipedia , lookup

G protein–coupled receptor wikipedia , lookup

Peptide synthesis wikipedia , lookup

Point mutation wikipedia , lookup

Catalytic triad wikipedia , lookup

Protein adsorption wikipedia , lookup

Ultrasensitivity wikipedia , lookup

Protein–protein interaction wikipedia , lookup

Cell-penetrating peptide wikipedia , lookup

Ancestral sequence reconstruction wikipedia , lookup

Protein structure prediction wikipedia , lookup

Two-hybrid screening wikipedia , lookup

Ribosomally synthesized and post-translationally modified peptides wikipedia , lookup

Bottromycin wikipedia , lookup

Transcript
Name of Student:
Dominik Sommerfeld
Research Supervisor: Dr. Steven Pelech
Title of Presentation: Determination of the Substrate Specificities of Protein Kinases by Prediction Algorithms and Substrate Peptide Microarrays
Abstract
Background: Protein kinases play a virtually universal role in the regulation of eukaryotic cellular processes by phosphorylating a plethora of protein (and lipid) substrates. Over two thirds of the proteins encoded by
the human genome are subjected to phosphorylation on multiple sites, and there may be in excess of
500,000 phosphorylation sites (P-sites) in the human proteome. To ensure signalling fidelity amidst the vast
complement of potential substrate P-sites, protein kinases must exert specificity for their substrates and act
only on a defined subset of cellular targets. In addition to various extrinsic, contextual factors (including
interaction with protein scaffolds, localization, and expression) substrate recognition by a particular kinase
is also largely determined by molecular recognition of the amino acid sequence surrounding a target P-site.
An important aspect of substrate peptide specificity is that the target P-site falls within a consensus sequence
motif, which exhibits structural and chemical complementarity to the kinase active site. Consensus P-site
motifs for individual kinases have been determined through alignment of known substrate P-sites, screening
of substrate peptide libraries in solution and arrays, and systematic mutation of substrate sequences. However, the substrate consensus sequences for most of the 516 human protein kinases remain unknown.
In the last 5 years there have been several attempts to predict kinase specificities for protein substrate recognition. At Kinexus Bioinformatics, we developed an algorithm to predict the specificities of all typical protein kinases (Kinase Specificity Predictor 1.0). This algorithm utilizes mutual information and charge information theory to identify Specificity Determining Residues (SDRs) in the catalytic domains of kinases and
predict the preferred substrate sequences (13 amino acid residues in length) for 488 typical human protein
kinase catalytic domains. It generates a frequency table or Position Specific Scoring Matrix (PSSM) that
provides the probability of occurrence for each of the 20 amino acids at each of the 12 positions surrounding
a target P-site for a specific protein kinase. While the accuracy of our algorithm has been tested in silico by
comparing predicted consensus sequences with known substrate sequences, it has yet to be validated empirically by in vitro assays.
Methodology: To test the accuracy of our algorithm, we used the PSSM for each of 488 typical human protein kinases to generate a list of ‘optimal’ substrate peptide sequences. We selected 445 peptide sequences
(omitting nearly identical consensus sequences for highly related kinases) and printed them onto glass slide
microarrays. Peptide microarrays are incubated with purified kinases (and ATP), and phosphorylation of the
peptides is detected using the fluorescent Pro-Q Diamond® phosphoprotein/peptide stain, which directly
binds phosphate groups on phosphorylated peptides. The degree of phosphorylation of a peptide substrate,
as measured by the signal intensity from respective spots, is used as a measure of its ‘efficiency’ as a substrate for a particular kinase. Peptide substrates are ranked based on signal intensities (or degree of phosphorylation), and the sequences of the top ranking peptides are aligned and used to derive consensus sequences for the respective kinases.
Progress: We are currently testing 204 purified recombinant human protein kinases in high-throughput substrate phosphorylation assays. To date we have tested 112 kinases, and these different kinases exhibit distinct substrate fingerprints/phosphorylation profiles. Our results indicate that our first generation kinase
specificity algorithm is capable of predicting optimal substrate sequences with high accuracy.
The substrate phosphorylation data obtained from testing all of these protein kinases against our peptide
microarrays will be used to expand the input datasets for training of our algorithm. In addition, adoption of
3D structural information of kinase catalytic domains, the incorporation of a larger number of SDRs, and
adoption of an empirically-derived amino acid interaction table will also be included in the development of a
second generation Kinase Specificity Predictor algorithm with improved accuracy and predictive power for
mapping phosphoproteome interactions.