Download Prediction of B cell epitopes

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Signal transduction wikipedia , lookup

Two-hybrid screening wikipedia , lookup

Paracrine signalling wikipedia , lookup

Western blot wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Antibody wikipedia , lookup

Protein structure prediction wikipedia , lookup

Monoclonal antibody wikipedia , lookup

Polyclonal B cell response wikipedia , lookup

Transcript
Prediction of B cell epitopes
Pernille Haste Andersen
Immunological Bioinformatics
CBS, DTU
[email protected]
B cells and antibodies

Antibodies are produced by B lymphocytes
(B cells)

Antibodies circulate in the blood


They are referred to as “the first line of
defense” against infection
Antibodies play a central role in immunity
by attaching to pathogens and recruiting
effector systems that kill the invader
What is a B cell epitope?
B cell epitopes

Accessible and
recognizable structural
feature of a pathogen
molecule (antigen)

Antibodies are developed
to bind the epitope with
high affinity by using the
complementarity
determining regions
(CDRs)
Antibody Fab
fragment
B cell epitope
Motivations for prediction of B cell epitopes
 Prediction of B cell epitopes can potentially
guide experimental epitope mapping
 Predictions of antigenicity in proteins can be
used for selecting subunits in rational vaccine
design
 Predictions of B cell epitopes may also be
valuable for interpretation of results from
experiments based on antibody affinity binding
such as ELISA, RIA and western blotting
Computational Rational Vaccine
Design
>PATHOGEN PROTEIN
KVFGRCELAAAMKRHGLDNYR
GYSLGNWVCAAKFESNF
Rational Vaccine
Design
B cell epitopes, linear or
discontinuous?
 Classified into linear (~10%) and discontinuous
epitopes (~90%)
 Databases: AntiJen, IEDB, BciPep, Los Alamos
HIV database, Protein Data Bank
 Large amount of data available for linear epitopes
 Few data available for discontinuous epitopes
 In general, B cell epitope prediction methods have
relatively low performances
Discontinuous B cell epitopes
An example: The epitope of
the outer surface protein A
from Borrelia Burgdorferi
(1OSP)
SLDEKNSVSVDLPGEMKVLV
SKEKNKDGKYDLIATVDKLEL
KGTSDKNNGSGVLEGVKADK
CKVKLTISDDLGQTTLEVFKE
DGKTLVSKKVTSKDKSSTEE
KFNEKGEVSEKIITRADGTRLE
YTGIKSDGSGKAKEVLKG
•
..\Discotope\1OSP_epitope\1OSP_epitope.psw
A data set of 3D discontinuous
epitopes
 A data set of 75 discontinuous epitopes was
compiled from structures of antibodies/protein
antigen complexes in the PDB
 The data set has been used for developing a method
for predictions of discontinuous B cell epitopes
 Since about 30 of the PDB entries represented
Lysozyme, I have used homology grouping (25
groups of non-homologous antigens) and 5 fold
cross-validation for training of the method
 Performance was measured using ROC curves on a
per antigen basis, and by weighted averaging of AUC
values
Epitope log-odds ratios
Table 1. The Parker hydrophilicity
scale and epitope log-odds ratiosa
 Frequencies of amino acids in epitopes
Amino acid
Parker
Log-odds
Ratios
D
2.460
0.691
compared to frequencies of non-epitopes
E
1.860
0.346
N
1.640
1.242
S
1.500
-0.145
Q
1.370
1.082
G
1.280
0.189
K
1.260
1.136
T
1.150
-0.233
R
0.870
1.180
P
0.300
1.164
H
0.300
1.098
C
0.110
-3.519
A
0.030
-1.522
Y
-0.780
0.030
V
-1.270
-1.474
M
-1.410
0.273
I
-2.450
-0.713
F
-2.780
-1.147
L
-2.870
-1.836
W
-3.000
-0.064
Several discrepancies compared to the
Parker hydrophilicity scale which is often
used for epitope prediction
Both methods are used for predictions
using a sequential average of scores
Predictive performance of B cell epitopes:
Parker
0.614 AUC
Epitope log–odds
0.634 AUC
a
Amino acids are listed with descending
hydrophilicity using the values of the Parker
scale.
3D information: Contact numbers
Surface exposure and
structural protrusion can
be measured by residue
contact numbers
The predictive performance:
Parker
Epitope log–odds
Contact numbers
0.614 AUC
0.634 AUC
0.647 AUC
DiscoTope : Prediction of Discontinuous
epiTopes using 3D structures
A combination of:
– Sequentially averaged epitope logodds values of residues in spatial
proximity
– Contact numbers
.LIST..FVDEKRPGSDIVED……ALILKDENKTTVI.
-0.145
+0.691+0.346+1.136+1.180+1.164
+0.346
Contact number :
K 10
+1.136
Sum of log-odds
values
DiscoTope prediction
value
DiscoTope : Prediction of Discontinuous
epiTopes
 Improved prediction of residues in discontinuous B
cell epitopes in the data set
 The predictive performance on B cell epitopes:
Parker
0.614 AUC
Epitope log–odds
0.634 AUC
Contact numbers
0.647 AUC
DiscoTope
0.711 AUC
Evaluation example AMA1
• Apical membrane antigen 1 from
Plasmodium falciparum (not used
for training/testing)
• Two epitopes were identified using
phage-display, point-mutation
(black side chains) and sequence
variance analysis (side chains of
polyvalent residues in yellow)
• Most residues identified as epitopes
were successfully predicted by
DiscoTope (green backbone)
..\Discotope\1Z40_epitope\1Z40_movie.mov
DiscoTope is available as web server:
http://www.cbs.dtu.dk/services/DiscoTope/
Future improvements
 Add epitope predictions for protein-protein
complexes
 Visualization of epitopes integrated in web server
 Testing a score for sequence variability fx based on
entropy of positions in the antigens
 Combination with glycosylation site predictions
 Combination with predictions of trans-membrane
regions
 Assembling predicted residues into whole epitopes
Presentation of the web server
Presentation of the web server
output
Acknowledgements
DiscoTope
Ole Lund
 Ideas, supervision and support
Morten Nielsen
 Ideas, development of method and web server
Nicholas Gauthier
 Improving the method, improving the web server