Download Alignment between domain region and whole enzyme

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Protein wikipedia , lookup

Ancestral sequence reconstruction wikipedia , lookup

Point mutation wikipedia , lookup

Clinical neurochemistry wikipedia , lookup

Interactome wikipedia , lookup

Drug design wikipedia , lookup

Biochemistry wikipedia , lookup

Amino acid synthesis wikipedia , lookup

NADH:ubiquinone oxidoreductase (H+-translocating) wikipedia , lookup

Enzyme inhibitor wikipedia , lookup

Proteolysis wikipedia , lookup

Biosynthesis wikipedia , lookup

G protein–coupled receptor wikipedia , lookup

Nuclear magnetic resonance spectroscopy of proteins wikipedia , lookup

Western blot wikipedia , lookup

Structural alignment wikipedia , lookup

Ligand binding assay wikipedia , lookup

Protein–protein interaction wikipedia , lookup

Enzyme wikipedia , lookup

Protein structure prediction wikipedia , lookup

Metalloprotein wikipedia , lookup

Two-hybrid screening wikipedia , lookup

Anthrax toxin wikipedia , lookup

Transcript
1
An in-silico analysis showing the interaction of FAD ligand with cytokinin
2
dehydrogenase enzyme and with its domain part in rice
3
ABSTRACT
4
The cytokinin dehydrogenase enzyme plays an important role in the high grain
5
production of rice. This enzyme interacts with the FAD ligand. This study shows
6
the domain region of the enzyme which was identified by protparam tool,
7
generated homology models using modeller9.12 tool, models were validated using
8
errat, procheck saves server, prosa and anolea server. Then the active sites were
9
predicted by using castP server. Finally the interaction was studied between the
10
FAD and cytokinin dehydrogenase and with its domain part. The stronger
11
interaction was found between the FAD and the domain of the cytokinin
12
degydrogenase with the binding enegy of -10.91. ALA6, ARG1, AR536, TYR10,
13
ASP55, ALA57, CYS38 of FAD is binding to the domain part of cytokinin
14
degydrogenase in rice.
15
INTRODUCTION
16
Cytokinin dehydrogenase of Oryza sativa (japonica) plays an important role in
17
high grain production. The cytokinin dehydrogenase helps in the oxidation of
18
cytokinins. This protein belongs to the family of N(6)-substituted adenine
19
derivatives, which is a plant hormone having isopentenyl group. This protein helps
20
in modulateing the number of reproductive organs by regulating the cytokinin
21
accumulation in inflorescence meristems. The CKX2 is also known as OsCKX2.
22
The CKX2 is generally located on Os01g0197700. FAD or flavoprotein is the
23
ligand of Cytokinin dehydrogenase 2 protein in O.sativa.CKX2 is located on
24
chromosome number 1. CKX2 has 5137 basepairs. Cytokinin positively regulates
25
the growth of shoot apical meristem that helps in more seed production. The Gn1a
1
26
gene of rice encodes cytokinin oxidase as a major quantitative trait locus. zinc
27
finger transcription factor directly regulates the OsCKX2 expression in the
28
reproductive meristem [1]. Reduced expression of OsCKX2 causes cytokinin
29
accumulation in inflorescence meristems leading to the increase in number of
30
reproductive organs which results more grain production [2]. CKX helps for high
31
grain production in wheat, maize, barely, sorghum, foxtail millet [3]. Cytokinin
32
dehydrogenase plays an important role in the formation of crown root system in
33
rice, which helps in high grain production [4]. The main objective of this study is
34
to find out the best interaction among the ligand FAD and enzyme. Here the
35
interaction is studied by considering the whole amino acid sequence and then only
36
the domain part. Domain is the function region of a protein or domain. So
37
generally the binding is more stronger at domain part. This study involves retrieval
38
of amino acid sequence, predicting the domain region of enzyme, finding the
39
common residues among domain part and the whole sequence, generating
40
homologous models, validating and evaluating models, predicting the active site,
41
interaction of FAD with the enzyme.
42
MATERIAL AND METHODS
43
Sequence retrieval
44
The amino acid sequences of cytokinin dehydrogenase was retrieved from uniprot
45
in fasta format.
46
47
Finding the domain part
2
48
The sequence of domain in the protein was retrieved by using protparam tool.
49
However the domain region can be found from CDD (conserved domain database)
50
too.
51
Alignment between whole protein sequence and domain
52
Multiple sequenece alignment is generally done to find out the conserved residues
53
among multiple sequences[5]. From the alignment we found the regions of
54
common residues among the domain part and whole sequence depending upon
55
which we could analyse the stronger binding region where ligand (FAD) is
56
binding. The alignment result is shown in fig.1.
57
Predicting the structures of domain and whole protein
58
As the NMR crystallographic structure of enzyme is not available at PDB, so the
59
tertiary structure was generated on the basis of homology modeling concept [6]-
60
[8].The models for whole protein and domain parts were generated using
61
modeller9.12 tool. The align2d.py, model-single.py, evaluate-model.py files were
62
run on the python script by setting the template, target and the number of models to
63
be generated. In this study 10 models were generated for both domain part and for
64
whole enzyme sequence. The best model was selected on the basis of lowest
65
DOPE score. The final generated models were visualized by using visualizing tools
66
i.e, pymol, discovery studio and swiss-pdb viewer. Multiple visualiser tools were
67
used to understand the structures more properly (table1,table2). The final models
68
were shown in fig.2 and fig.3.
69
70
71
Model validation and optimization
3
72
Expasy server provides many online tools with the help of which the generated
73
protein models can be evaluated and validated [9]. The generated models were then
74
validated by using Errat, Verify3d, Procheck saves server. The backbone
75
confirmation of the protein was studied by using procheck saves server from where
76
the residues lying in favoured region, allowed region and outlier regions can be
77
found. When more residues found in favoured region and less residues found in
78
outlier region, then it represents that protein is stable for further study (table3).
79
Then the final Z-score of the models were obtained from prosa server, which
80
showed the overall model quality (table.4b). The models were then submitted to
81
Anolea server which gave the QMEAN score and Z-score (table.4a).
82
Active site prediction
83
The active site or binding site is the region where the ligand binds to receptor. In
84
this study the active site was predicted by using castp server [10], [11]. The active
85
sites of both enzyme and domain are shown in fig.4 and fig.5. The active sites
86
shows the region where the ligand can bind or not. The binding energy dependent
87
upon the type of residue to which it is binding i.e, hydrophobic or hydrophilic or
88
polar or no-polar etc. the residues found at active sites are given in fig.6 and fig.7
89
respectively.
90
Protein-ligand interaction
91
The interaction of ligand (FAD) was checked with whole enzyme and only with
92
the domain parts separately to find out the regions where the lignad binds properly.
93
The interaction was studied by using autodock tool [12], [13]. The pdb files of both
94
enzyme, domain, ligand were uploaded one after another by setting the docking
95
parameters. The autodock and autogrid files were generally run on python script
96
after which the interactions found by analyzing the dock.dlg files. Less the binding
4
97
energy more stronger is the interaction. Howerevr the binding energy is dependent
98
on the length of amino acids. The interaction result is shown in fig.8 and fig.9
99
respectively.
100
RESULT
101
Retrieved sequence and finding the domain region
102
The amino acid sequence of enzyme was retrieved from uniprot with the uniprotID
103
of Q4ADV8. The protein contains 565 number of amino acids. The domain region
104
is found from 74-225. The information about the domain region was found from
105
both uniprot and from protparam output.
106
Alignment between domain region and whole enzyme
107
The domain region was extracted from the whole protein sequence and subjected
108
to clustalw2 for multiple sequence alignment. From this study the conserved
109
residues among domain region and whole enzyme sequence were obtained. The
110
match regions were denoted with *, mis-match regions with . and the gaps were
111
denoted with -. Here the fig.1 shows the common residues.
112
Homology modelling analysis
113
The homologous models for both enzyme and domains were generated. The
114
helices are denoted red colours, helices with yellow colours and the loops are
115
denoted with green colours respectively. The VDW radius, beta factor, mass of
116
structures, stability of the objects were found from yasara tool. Then total number
117
of atoms, formal charge sum, molecular surface area and solvent accessible surface
118
area were found from pymol tool.
119
Model validation and optimization
5
120
The backbone confirmations of the structures were generated from rampage server.
121
91.8% of the residues lie in most favoured region whereas 3.3% of the residues lie
122
in outlier region. The QMEAN score and Z-scores were found from both anolea
123
server and from prosa server.
124
Active site analysis result
125
The active sites were found from castp server. The balls with green colour shows
126
the binding regions of whole protein and the domain region. Many pockets were
127
found, but out of them the first volume was selected. The pockets of domain were
128
mainly ALA, ARG, CYS, HIS, GLY, VAL, SER, TYU, LEU, PHE, ILE, THR,
129
PRO and similarly the pockets of whole enzyme were PHE, ILE, LEU, VAL, SER,
130
HIS, GLY, CYS, ALA, MET, GLU, THR, ASP, TYR, ASN, GLN, PRO, TRP,
131
GLN respectively.
132
Protein-ligand interaction
133
The pdb files of protein, domain and the ligand were submitted and by adding
134
polar hydrogens to protein and domain the interaction was studied. The binding
135
energy which was found between the interaction of whole protein-ligand (FAD)
136
was -11.23, with -14.81 intermolecular energy, 3.58 torsion energy and 13.66
137
internal energy. Similarly the interaction which was performed between the
138
domain region and ligand (FAD) showed -10.91 binding energy, -14.49
139
intermolecular energy, 3.58 torsion energy and 14.04 internal energy respectively.
140
From the interaction it is found that the FAD is binding to ALA6, ARG1, AR536,
141
TYR10, ASP55, ALA57, CYS38 at the domain part and in whole protein the
142
domain is binding to ARG134, ASP128, SER131, TYR83, SER85, LEU135,
143
ARG86, GLN136, LEU135, LEU47.
6
144
DISCUSSION
145
From the above study it is found that the model of whole enzyme and the domain
146
are suitable for future analysis as most of the residues lie in favoured region. The
147
main objective of this study was to find the stronger interaction of ligand (FAD)
148
with the domain part of enzyme and with the whole sequence, which showed that
149
the binding interaction is more stronger between the domain part and FAD than in
150
comparision to the interaction between FAD and whole amino acid sequence.
151
REFERENCES
152
1-
Li S, Zhao B, Yuan D, Duan M, Qian Q, Tang L, Wang B, Liu X, Zhang J,
153
Wang J, Sun J, Liu Z, Feng YQ, Yuan L, Li C, Rice zinc finger protein DST
154
enhances grain production through controlling Gn1a/OsCKX2 expression. ,
155
10(8):3167-72. doi: 10.1073/pnas.1300359110. 2013.
156
2-
Ashikari M, Sakakibara H, Lin S, Yamamoto T, Takashi T, Nishimura A,
157
Angeles ER, Qian Q, Kitano H, Matsuoka M, Cytokinin oxidase regulates
158
rice grain production. Science j, 309(5735):741-5. 2005.
159
3-
Mameaux S, Cockram J, Thiel T, Steuernagel B, Stein N, Taudien S, Jack
160
P, Werner P, Gray JC, Greenland AJ, Powell W, Molecular, phylogenetic
161
and comparative genomic analysis of the cytokinin oxidase/dehydrogenase
162
gene family in the Poaceae. , 10(1):67-82. doi: 10.1111/j.1467-
163
7652.2011.00645.x. 2012.
164
4-
Gao S, Fang J, Xu F, Wang W, Sun X, Chu J, Cai B, Feng Y, Chu C,
165
Cytokinin oxidase/dehydrogenase4 Integrates Cytokinin and Auxin
166
Signaling to Control Rice Crown Root Formation. Plant
167
Physiol.165(3):1035-1046. 2014.
7
168
5-
Thompson, JD., Higgins, DG., Gibson, TJ, CLUSTAL W: improving the
169
sensitivity of progressive multiple sequence alignment through sequence
170
weighting, position-specific gap penalties and weight matrix choice. Nucleic
171
Acids Research, Vol. 22, No. 22 4673-4680. 1994.
172
6-
Bilal S, Ali SB, Fazal S, Mir A, Generation of a 3D model for human
173
cereblon using comparative modeling. journal of Bioinformatics and
174
Sequence Analysis Vol. 5(1), pp. 10-15. 2013
175
7-
Singh B, Gupta A, Sharada M, Potukuchi Comparative modeling and
176
analysis of 3-D structure of Hsp 70, in Cancer irroratus. An International
177
Journal, 1(2): 1-4, 2009.
178
8-
spatial restraints. J Mol Biol 1993; 234: 779-815.
179
180
Sali A, Blundelll TL, Comparative protein modeling by satisfaction of
9-
Elisabeth Gasteiger, Christine Hoogland, Alexandre Gattiker,Séverine
181
Duvaud, Marc R. Wilkins, Ron D. Appel, and Amos Bairoch, Protein
182
Analysis Tools on the ExPASy Server , Protein Identification and Analysis
183
Tools on the ExPASy Server, Protein Analysis Tools on the ExPASy Server
184
571The Proteomics Protocols Handbook.
185
10-
Namasivayam SKR, J.Muthukumaran, Prediction of Three Dimensional
186
Model and Active Site Analysis of Inducible
187
Serine Protease Inhibitor -2 (ISPI -2) in Galleria Mellonella. J Comput Sci
188
Syst Biol, Volume 1, 119-125. 2008.
189
190
11-
R Rambabu, Peri SR, Allam AR, Computational analysis and function
prediction of a hypothetical protein 1RW0. International Journal of
8
191
Computational Bioinformatics and In Silico Modeling, Vol. 1, No. 5. 58-62.
192
2012.
193
194
12-
Adinarayana KPS, Devi RK, Protein-Ligand interaction studies on 2, 4, 6-
195
trisubstituted triazine derivatives as anti-malarial DHFR agents using
196
AutoDock. Bioinformation 6(2): 74-77. 2011.
197
198
199
13-
Seeliger D, de Groot BL, Ligand docking and binding site analysis with
PyMOL and Autodock/Vina. J Comput Aided Mol Des. 24:417–422. 2010.
9
200
201
Fig.1 multiple sequence alignment between domain and whole enzyme
202
203
204
Fig.2 Model generated for whole enzyme
10
Fig.3 Model generated for domain
205
206
Fig.4 Binding site of whole enzyme
Fig.5 Binding site of domain
207
208
Fig.6 residues present at the active site of whole enzyme
209
210
Fig.7 residues present at the active site of domain of the enzyme
11
211
212
Fig.8 FAD binding to whole enzyme
12
213
214
Fig.9 FAD binding to domain
13
215
Table.1 information found after visualizing the structures using Pymol tool
Analysis
Protein structure
Domain structure
Atom count
4233
1311
Formal charge sum
-6.0
0.0
Molecular surface area
52371.758A0
16504.783 A0
Solvent accessible surface area
23178.145 A0
10648.261 A0
216
217
Table.2 information found after visualizing the structures using Yasara tool
Analysis
Protein structure
Domain structure
VDW radius
85.337
34.261
Beta factor
109.7
111.3
Mass of object
55813.069g/mol
17316.863g/mol
Stability of the object
971.13kcal/mol
286.78kcal/mol
218
219
Table.3 The backbone confirmation of models found from rampage server
Analysis
Protein structure Domain structure
Number of residues in favoured region
517 ( 91.8%)
159 ( 88.3%)
Number of residues in allowed region
34 ( 6.0%)
15 ( 8.3%)
Number of residues in outlier region
12 ( 2.1%)
6 ( 3.3%)
220
14
221
222
Table.4 Model validation result
4a-
Result found from anolea server
Analysis
protein
Domain
QMEAN score
0.452
0.628
Z-score
-3.206
-1.483
223
224
4b-
Result found from prosa server
Analysis
protein
Domain
Z-score
-8.32
-4.21
225
15