Download Epitope prediction algorithms

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Monoclonal antibody wikipedia , lookup

Immunomics wikipedia , lookup

Polyclonal B cell response wikipedia , lookup

Molecular mimicry wikipedia , lookup

Transcript
Epitope prediction algorithms
Urmila Kulkarni-Kale
Bioinformatics Centre
University of Pune
Vaccine development
In Post-genomic era:
Reverse Vaccinology
Approach.
•
October 2K5
Rappuoli R. (2000). Reverse
vaccinology.
Curr
Opin
Microbiol. 3:445-450.
© Bioinformatics Centre, UoP
2
Genome Sequence
Proteomics
Technologies
In silico
analysis
DNA
microarrays
High throughput
Cloning and expression
In vitro and in vivo assays for
Vaccine candidate identification
Global
approach
to identify
new vaccine candidates
Octobergenomic
2K5
© Bioinformatics
Centre, UoP
3
In Silico Analysis
Peptide
Multiepitope
vaccines
VACCINOME
Candidate Epitope DB
Epitope prediction
Disease related protein DB
October 2K5
© Bioinformatics
Centre, Database
UoP
Gene/Protein
Sequence
4
What Are Epitopes?
Antigenic determinants or Epitopes are the
portions of the antigen molecules which are
responsible for specificity of the antigens in
antigen-antibody (Ag-Ab) reactions and
that combine with the antigen binding site
of Ab, to which they are complementary.
October 2K5
© Bioinformatics Centre, UoP
5
Types of Epitopes
• Sequential / Continuous epitopes:
• recognized by Th cells
• linear peptide fragments
• amphipathic helical 9-12 mer
• Conformational / Discontinuous epitopes:
• recognized by both Th & B cells
• non-linear discrete amino acid sequences, come together
due to folding
• exposed 15-22 mer
October 2K5
© Bioinformatics Centre, UoP
6
Properties of Epitopes
• They occur on the surface of the protein
and are more flexible than the rest of the
protein.
• They have high degree of exposure to the
solvent.
• The amino acids making the epitope are
usually charged and hydrophilic.
October 2K5
© Bioinformatics Centre, UoP
7
Methods to identify epitopes
1. Immunochemical methods
•
•
•
ELISA : Enzyme linked immunosorbent assay
Immunoflurorescence
Radioimmunoassay
2. X-ray crystallography: Ag-Ab complex is
crystallized and the structure is scanned for
contact residues between Ag and Ab. The contact
residues on the Ag are considered as the epitope.
3. Prediction methods: Based on the X-ray crystal
data available for Ag-Ab complexes, the
propensity of an amino acid to lie in an epitope is
calculated.
October 2K5
© Bioinformatics Centre, UoP
8
Antigen-Antibody (Ag-Ab) complexes
• Non-obligatory heterocomplexes that are made
and broken according to the environment
• Involve proteins (Ag & Ab) that must also exist
independently
• Remarkable feature:
– high affinity and strict specificity of antibodies for their
antigens.
• Ab recognize the unique conformations and spatial
locations on the surface of Ag
• Epitopes & paratopes are relational entities
October 2K5
© Bioinformatics Centre, UoP
9
Antigen-Antibody complex
October 2K5
© Bioinformatics Centre, UoP
10
Ab-binding sites:
Sequential & Conformational Epitopes!
Paratope
Sequential Conformational
Ab-binding sites
October 2K5
© Bioinformatics Centre, UoP
11
B cell epitope prediction algorithms :
•
•
•
•
•
Hopp and Woods –1981
Welling et al –1985
Parker & Hodges - 1986
Kolaskar & Tongaonkar – 1990
Kolaskar & Urmila Kulkarni - 1999
T cell epitope prediction algorithms :
•
•
•
•
Margalit, Spouge et al - 1987
Rothbard & Taylor – 1988
Stille et al –1987
Tepitope -1999
October 2K5
© Bioinformatics Centre, UoP
12
Hopp & Woods method
• Pioneering work
• Based on the fact that only the hydrophilic
nature of amino acids is essential for an
sequence to be an antigenic determinant
• Local hydrophilicity values are assigned to
each amino acid by the method of repetitive
averaging using a window of six
• Not very accurate
October 2K5
© Bioinformatics Centre, UoP
13
Welling’s method
• Based on the % of each aa present in
known epitopes compared with the % of
aa in the avg. composition of a protein.
• assigns an antigenicity value for each
amino acid from the relative occurrence
of the amino acid in an antigenic
determinant site.
• regions of 7 aa with relatively high
antigenicity are extended to 11-13 aa
depending on the antigenicity values of
neighboring residues.
October 2K5
© Bioinformatics Centre, UoP
14
Parker & Hodges method
• Utilizes 3 parameters :
– Hydrophilicity : HPLC
– Accessibility : Janin’s scale
– Flexibility : Karplus & Schultz
• Hydrophilicity parameter was calculated using
HPLC from retention co-efficients of model
synthetic peptides.
• Surface profile was determined by summing the
parameters for each residue of a seven-residue
segment and assigning the sum to the fourth
residue.
• One of the most useful prediction algorithms
October 2K5
© Bioinformatics Centre, UoP
15
Kolaskar & Tongaonkar’s method
• Semi-empirical method which uses
physiological properties of amino acid
residues
• frequencies of occurrence of amino acids in
experimentally known epitopes.
• Data of 169 epitopes from 34 different
proteins was collected of which 156 which
have less than 20 aa per determinant were
used.
• Antigen: EMBOSS
October 2K5
© Bioinformatics Centre, UoP
16
CEP Server
• Predicts the conformational epitopes from
X-ray crystals of Ag-Ab complexes.
• uses percent accessible surface area and
distance as criteria
October 2K5
© Bioinformatics Centre, UoP
17
An algorithm to map sequential and conformational
epitopes of protein antigens of known structure
October 2K5
© Bioinformatics Centre, UoP
18
October 2K5
© Bioinformatics Centre, UoP
19
CE: Beyond validation
• High accuracy:
– Limited data set to evaluate the algorithm
– Non-availability of true negative data sets
• Prediction of false positives?
– Are they really false positives?
• Limitation:
Different Abs
(HyHEL10 & D1.3)
have over-lapping
binding sites
– Limited by the availability of 3D structure data of
antigens
October 2K5
© Bioinformatics Centre, UoP
20
CE: Features
• The first algorithm for the prediction of
conformational epitopes or antibody
binding sites of protein antigens
• Maps both: sequential & conformational
epitopes
• Prerequisite: 3D structure of an antigen
October 2K5
© Bioinformatics Centre, UoP
21
CEP: Conformational Epitope Prediction Server
http://bioinfo.ernet.in/cep.htm
October 2K5
© Bioinformatics Centre, UoP
22
T-cell epitope prediction algorithms
• Considers amphipathic helix segments,
tetramer and pentamer motifs (charged
residues or glycine) followed by 2-3
hydrophobic residues and then a polar
residue.
• Sequence motifs of immunodominant
secondary structure capable of binding to
MHC with high affinity.
• Virtual matrices which are used for
predicting MHC polymorphism and anchor
residues.
October 2K5
© Bioinformatics Centre, UoP
23
• Case study: Design & development of
peptide vaccine against Japanese
encephalitis virus
October 2K5
© Bioinformatics Centre, UoP
24
We Have Chosen JE Virus,
Because
 JE virus is endemic in South-east Asia
including India.
 JE virus causes encephalitis in children
between 5-15 years of age with fatality rates
between 21-44%.
 Man is a "DEAD END" host.
October 2K5
© Bioinformatics Centre, UoP
25
We Have Chosen JE Virus,
Because
• Killed virus vaccine purified from mouse brain
is used presently which requires storage at
specific temperatures and hence not cost
effective in tropical countries.
• Protective prophylactic immunity is induced
only after administration of 2-3 doses.
• Cost
of
vaccination,
transportation is high.
October 2K5
© Bioinformatics Centre, UoP
storage
and
26
Predicted structure of JEVS
Mutations: JEVN/JEVS
October 2K5
© Bioinformatics Centre, UoP
27
October 2K5
© Bioinformatics Centre, UoP
28
CE of JEVN Egp
October 2K5
© Bioinformatics Centre, UoP
29
Species and Strain specific properties:
TBEV/ JEVN/JEVS
• Loop1 in TBEV:
• Loop1 in JEVN:
• Loop1 in JEVS:
LA EEH QGGT
HN EKR ADSS
HN KKR ADSS
Antibodies recognising TBEV and JEVN would require
exactly opposite pattern of charges in their CDR regions.
Further, modification in CDR is required to recognise
strain-specific region of JEVS.
October 2K5
© Bioinformatics Centre, UoP
30
Multiple alignment of Predicted TH-cell epitope in the JE_Egp
with corresponding epitopes in Egps of other Flaviviruses
426
457
JE
DFGSIGGVFNSIGKAVHQVFGGAFRTLFGGMS
MVE DFGSVGGVFNSIGKAVHQVFGGAFRTLFGGMS
WNE DFGSVGGVFTSVGKAIHQVFGGAFRSLFGGMS
KUN DFGSVGGVFTSVGKAVHQVFGGAFRSLFGGMS
SLE DFGSIGGVFNSIGKAVHQVFGGAFRTLFGGMS
DEN2 DFGSLGGVFTSIGKALHQVFGAIYGAAFSGVS
YF
DFSSAGGFFTSVGKGIHTVFGSAFQGLFGGLN
TBE DFGSAGGFLSSIGKAVHTVLGGAFNSIFGGVG
COMM DF S GG
S GK H V G
F G
Multiple alignment of JE_Egp with Egps of other Flaviviruses in
the YSAQVGASQ region.
151
183
JE
SENHGNYSAQVGASQAAKFTITPNAPSITLKLG
MVE STSHGNYSTQIGANQAVRFTISPNAPAITAKMG
WNE VESHG----KIGATQAGRFSITPSAPSYTLKLG
KUN VESHGNYFTQTGAAQAGRFSITPAAPSYTLKLG
SLE STSHGNYSEQIGKNQAARFTISPQAPSFTANMG
DEN2 HAVGNDTG-----KHGKEIKITPQSSTTEAELT
YF
QENWN--------TDIKTLKFDALSGSQEVEFI
October 2K5
© Bioinformatics Centre, UoP
31
TBE VAANETHS----GRKTASFTIS--SEKTILTMG
Peptide Modeling
Initial random conformation
Force field: Amber
Distance dependent dielectric constant 4rij
Geometry optimization: Steepest descents & Conjugate gradients
Molecular dynamics at 400 K for 1ns
Peptides are:
SENHGNYSAQVGASQ
NHGNYSAQVGASQ
YSAQVGASQ
YSAQVGASQAAKFT
NHGNYSAQVGASQAAKFT
SENHGNYSAQVGASQAAKFT
149
168
October 2K5
© Bioinformatics Centre, UoP
33
October 2K5
© Bioinformatics Centre, UoP
34
Relevant Publications & Patent
• Urmila Kulkarni-Kale, Shriram Bhosale, G. Sunitha Manjari, Ashok
Kolaskar, (2004). VirGen: A comprehensive viral genome resource.
Nucleic Acids Research 32:289-292.
• Urmila Kulkarni-Kale & A. S. Kolaskar (2003). Prediction of 3D
structure of envelope glycoprotein of Sri Lanka strain of Japanese
encephalitis virus. In Yi-Ping Phoebe Chen (ed.), Conferences in
research and practice in information technology. 19:87-96.
• A. S. Kolaskar & Urmila Kulkarni-Kale (1999) Prediction of threedimensional structure and mapping of conformational antigenic
determinants of envelope glycoprotein of Japanese encephalitis virus.
Virology. 261:31-42.
Patent: Chimeric T helper-B cell peptide as a vaccine for Flaviviruses.
Dr. M. M. Gore, Dr. S.S. Dewasthaly, Prof. A.S. Kolaskar, Urmila Kulkarni-Kale
Sangeeta Sawant WO 02/053182 A1
October 2K5
© Bioinformatics Centre, UoP
35
Important references
•
•
•
•
•
•
Hopp, Woods, 1981, Prediction of protein antigenic determinants from
amino acid sequences, PNAS U.S.A 78, 3824-3828
Parker, Hodges et al, 1986, New hydrophilicity scale derived from high
performance liquid chromatography peptide retention data: Correlation
of predicted surface residues with antigenicity and X-ray derived
accessible sites, Biochemistry:25, 5425-32
Kolaskar, Tongaonkar, 1990, A semi empirical method for prediction of
antigenic determinants on protein antigens, FEBS 276, 172-174
Men‚ndez-Arias, L. & Rodriguez, R. (1990), A BASIC microcomputer
program forprediction of B and T cell epitopes in proteins, CABIOS, 6,
101-105
Peter S. Stern (1991), Predicting antigenic sites on proteins, TIBTECH, 9,
163-169
A.S. Kolaskar and Urmila Kulkarni-Kale, 1999 - Prediction of threedimensional structure and mapping of conformational epitopes of
envelope glycoprotein of Japanese encephalitis virus,Virology, 261, 31-42
October 2K5
© Bioinformatics Centre, UoP
36