Download Alignment between domain region and whole enzyme

1 An in-silico analysis showing the interaction of FAD ligand with cytokinin 2 dehydrogenase enzyme and with its domain part in rice 3 ABSTRACT 4 The cytokinin dehydrogenase enzyme plays an important role in the high grain 5 production of rice. This enzyme interacts with the FAD ligand. This study shows 6 the domain region of the enzyme which was identified by protparam tool, 7 generated homology models using modeller9.12 tool, models were validated using 8 errat, procheck saves server, prosa and anolea server. Then the active sites were 9 predicted by using castP server. Finally the interaction was studied between the 10 FAD and cytokinin dehydrogenase and with its domain part. The stronger 11 interaction was found between the FAD and the domain of the cytokinin 12 degydrogenase with the binding enegy of -10.91. ALA6, ARG1, AR536, TYR10, 13 ASP55, ALA57, CYS38 of FAD is binding to the domain part of cytokinin 14 degydrogenase in rice. 15 INTRODUCTION 16 Cytokinin dehydrogenase of Oryza sativa (japonica) plays an important role in 17 high grain production. The cytokinin dehydrogenase helps in the oxidation of 18 cytokinins. This protein belongs to the family of N(6)-substituted adenine 19 derivatives, which is a plant hormone having isopentenyl group. This protein helps 20 in modulateing the number of reproductive organs by regulating the cytokinin 21 accumulation in inflorescence meristems. The CKX2 is also known as OsCKX2. 22 The CKX2 is generally located on Os01g0197700. FAD or flavoprotein is the 23 ligand of Cytokinin dehydrogenase 2 protein in O.sativa.CKX2 is located on 24 chromosome number 1. CKX2 has 5137 basepairs. Cytokinin positively regulates 25 the growth of shoot apical meristem that helps in more seed production. The Gn1a 1 26 gene of rice encodes cytokinin oxidase as a major quantitative trait locus. zinc 27 finger transcription factor directly regulates the OsCKX2 expression in the 28 reproductive meristem [1]. Reduced expression of OsCKX2 causes cytokinin 29 accumulation in inflorescence meristems leading to the increase in number of 30 reproductive organs which results more grain production [2]. CKX helps for high 31 grain production in wheat, maize, barely, sorghum, foxtail millet [3]. Cytokinin 32 dehydrogenase plays an important role in the formation of crown root system in 33 rice, which helps in high grain production [4]. The main objective of this study is 34 to find out the best interaction among the ligand FAD and enzyme. Here the 35 interaction is studied by considering the whole amino acid sequence and then only 36 the domain part. Domain is the function region of a protein or domain. So 37 generally the binding is more stronger at domain part. This study involves retrieval 38 of amino acid sequence, predicting the domain region of enzyme, finding the 39 common residues among domain part and the whole sequence, generating 40 homologous models, validating and evaluating models, predicting the active site, 41 interaction of FAD with the enzyme. 42 MATERIAL AND METHODS 43 Sequence retrieval 44 The amino acid sequences of cytokinin dehydrogenase was retrieved from uniprot 45 in fasta format. 46 47 Finding the domain part 2 48 The sequence of domain in the protein was retrieved by using protparam tool. 49 However the domain region can be found from CDD (conserved domain database) 50 too. 51 Alignment between whole protein sequence and domain 52 Multiple sequenece alignment is generally done to find out the conserved residues 53 among multiple sequences[5]. From the alignment we found the regions of 54 common residues among the domain part and whole sequence depending upon 55 which we could analyse the stronger binding region where ligand (FAD) is 56 binding. The alignment result is shown in fig.1. 57 Predicting the structures of domain and whole protein 58 As the NMR crystallographic structure of enzyme is not available at PDB, so the 59 tertiary structure was generated on the basis of homology modeling concept [6]- 60 [8].The models for whole protein and domain parts were generated using 61 modeller9.12 tool. The align2d.py, model-single.py, evaluate-model.py files were 62 run on the python script by setting the template, target and the number of models to 63 be generated. In this study 10 models were generated for both domain part and for 64 whole enzyme sequence. The best model was selected on the basis of lowest 65 DOPE score. The final generated models were visualized by using visualizing tools 66 i.e, pymol, discovery studio and swiss-pdb viewer. Multiple visualiser tools were 67 used to understand the structures more properly (table1,table2). The final models 68 were shown in fig.2 and fig.3. 69 70 71 Model validation and optimization 3 72 Expasy server provides many online tools with the help of which the generated 73 protein models can be evaluated and validated [9]. The generated models were then 74 validated by using Errat, Verify3d, Procheck saves server. The backbone 75 confirmation of the protein was studied by using procheck saves server from where 76 the residues lying in favoured region, allowed region and outlier regions can be 77 found. When more residues found in favoured region and less residues found in 78 outlier region, then it represents that protein is stable for further study (table3). 79 Then the final Z-score of the models were obtained from prosa server, which 80 showed the overall model quality (table.4b). The models were then submitted to 81 Anolea server which gave the QMEAN score and Z-score (table.4a). 82 Active site prediction 83 The active site or binding site is the region where the ligand binds to receptor. In 84 this study the active site was predicted by using castp server [10], [11]. The active 85 sites of both enzyme and domain are shown in fig.4 and fig.5. The active sites 86 shows the region where the ligand can bind or not. The binding energy dependent 87 upon the type of residue to which it is binding i.e, hydrophobic or hydrophilic or 88 polar or no-polar etc. the residues found at active sites are given in fig.6 and fig.7 89 respectively. 90 Protein-ligand interaction 91 The interaction of ligand (FAD) was checked with whole enzyme and only with 92 the domain parts separately to find out the regions where the lignad binds properly. 93 The interaction was studied by using autodock tool [12], [13]. The pdb files of both 94 enzyme, domain, ligand were uploaded one after another by setting the docking 95 parameters. The autodock and autogrid files were generally run on python script 96 after which the interactions found by analyzing the dock.dlg files. Less the binding 4 97 energy more stronger is the interaction. Howerevr the binding energy is dependent 98 on the length of amino acids. The interaction result is shown in fig.8 and fig.9 99 respectively. 100 RESULT 101 Retrieved sequence and finding the domain region 102 The amino acid sequence of enzyme was retrieved from uniprot with the uniprotID 103 of Q4ADV8. The protein contains 565 number of amino acids. The domain region 104 is found from 74-225. The information about the domain region was found from 105 both uniprot and from protparam output. 106 Alignment between domain region and whole enzyme 107 The domain region was extracted from the whole protein sequence and subjected 108 to clustalw2 for multiple sequence alignment. From this study the conserved 109 residues among domain region and whole enzyme sequence were obtained. The 110 match regions were denoted with *, mis-match regions with . and the gaps were 111 denoted with -. Here the fig.1 shows the common residues. 112 Homology modelling analysis 113 The homologous models for both enzyme and domains were generated. The 114 helices are denoted red colours, helices with yellow colours and the loops are 115 denoted with green colours respectively. The VDW radius, beta factor, mass of 116 structures, stability of the objects were found from yasara tool. Then total number 117 of atoms, formal charge sum, molecular surface area and solvent accessible surface 118 area were found from pymol tool. 119 Model validation and optimization 5 120 The backbone confirmations of the structures were generated from rampage server. 121 91.8% of the residues lie in most favoured region whereas 3.3% of the residues lie 122 in outlier region. The QMEAN score and Z-scores were found from both anolea 123 server and from prosa server. 124 Active site analysis result 125 The active sites were found from castp server. The balls with green colour shows 126 the binding regions of whole protein and the domain region. Many pockets were 127 found, but out of them the first volume was selected. The pockets of domain were 128 mainly ALA, ARG, CYS, HIS, GLY, VAL, SER, TYU, LEU, PHE, ILE, THR, 129 PRO and similarly the pockets of whole enzyme were PHE, ILE, LEU, VAL, SER, 130 HIS, GLY, CYS, ALA, MET, GLU, THR, ASP, TYR, ASN, GLN, PRO, TRP, 131 GLN respectively. 132 Protein-ligand interaction 133 The pdb files of protein, domain and the ligand were submitted and by adding 134 polar hydrogens to protein and domain the interaction was studied. The binding 135 energy which was found between the interaction of whole protein-ligand (FAD) 136 was -11.23, with -14.81 intermolecular energy, 3.58 torsion energy and 13.66 137 internal energy. Similarly the interaction which was performed between the 138 domain region and ligand (FAD) showed -10.91 binding energy, -14.49 139 intermolecular energy, 3.58 torsion energy and 14.04 internal energy respectively. 140 From the interaction it is found that the FAD is binding to ALA6, ARG1, AR536, 141 TYR10, ASP55, ALA57, CYS38 at the domain part and in whole protein the 142 domain is binding to ARG134, ASP128, SER131, TYR83, SER85, LEU135, 143 ARG86, GLN136, LEU135, LEU47. 6 144 DISCUSSION 145 From the above study it is found that the model of whole enzyme and the domain 146 are suitable for future analysis as most of the residues lie in favoured region. The 147 main objective of this study was to find the stronger interaction of ligand (FAD) 148 with the domain part of enzyme and with the whole sequence, which showed that 149 the binding interaction is more stronger between the domain part and FAD than in 150 comparision to the interaction between FAD and whole amino acid sequence. 151 REFERENCES 152 1- Li S, Zhao B, Yuan D, Duan M, Qian Q, Tang L, Wang B, Liu X, Zhang J, 153 Wang J, Sun J, Liu Z, Feng YQ, Yuan L, Li C, Rice zinc finger protein DST 154 enhances grain production through controlling Gn1a/OsCKX2 expression. , 155 10(8):3167-72. doi: 10.1073/pnas.1300359110. 2013. 156 2- Ashikari M, Sakakibara H, Lin S, Yamamoto T, Takashi T, Nishimura A, 157 Angeles ER, Qian Q, Kitano H, Matsuoka M, Cytokinin oxidase regulates 158 rice grain production. Science j, 309(5735):741-5. 2005. 159 3- Mameaux S, Cockram J, Thiel T, Steuernagel B, Stein N, Taudien S, Jack 160 P, Werner P, Gray JC, Greenland AJ, Powell W, Molecular, phylogenetic 161 and comparative genomic analysis of the cytokinin oxidase/dehydrogenase 162 gene family in the Poaceae. , 10(1):67-82. doi: 10.1111/j.1467- 163 7652.2011.00645.x. 2012. 164 4- Gao S, Fang J, Xu F, Wang W, Sun X, Chu J, Cai B, Feng Y, Chu C, 165 Cytokinin oxidase/dehydrogenase4 Integrates Cytokinin and Auxin 166 Signaling to Control Rice Crown Root Formation. Plant 167 Physiol.165(3):1035-1046. 2014. 7 168 5- Thompson, JD., Higgins, DG., Gibson, TJ, CLUSTAL W: improving the 169 sensitivity of progressive multiple sequence alignment through sequence 170 weighting, position-specific gap penalties and weight matrix choice. Nucleic 171 Acids Research, Vol. 22, No. 22 4673-4680. 1994. 172 6- Bilal S, Ali SB, Fazal S, Mir A, Generation of a 3D model for human 173 cereblon using comparative modeling. journal of Bioinformatics and 174 Sequence Analysis Vol. 5(1), pp. 10-15. 2013 175 7- Singh B, Gupta A, Sharada M, Potukuchi Comparative modeling and 176 analysis of 3-D structure of Hsp 70, in Cancer irroratus. An International 177 Journal, 1(2): 1-4, 2009. 178 8- spatial restraints. J Mol Biol 1993; 234: 779-815. 179 180 Sali A, Blundelll TL, Comparative protein modeling by satisfaction of 9- Elisabeth Gasteiger, Christine Hoogland, Alexandre Gattiker,Séverine 181 Duvaud, Marc R. Wilkins, Ron D. Appel, and Amos Bairoch, Protein 182 Analysis Tools on the ExPASy Server , Protein Identification and Analysis 183 Tools on the ExPASy Server, Protein Analysis Tools on the ExPASy Server 184 571The Proteomics Protocols Handbook. 185 10- Namasivayam SKR, J.Muthukumaran, Prediction of Three Dimensional 186 Model and Active Site Analysis of Inducible 187 Serine Protease Inhibitor -2 (ISPI -2) in Galleria Mellonella. J Comput Sci 188 Syst Biol, Volume 1, 119-125. 2008. 189 190 11- R Rambabu, Peri SR, Allam AR, Computational analysis and function prediction of a hypothetical protein 1RW0. International Journal of 8 191 Computational Bioinformatics and In Silico Modeling, Vol. 1, No. 5. 58-62. 192 2012. 193 194 12- Adinarayana KPS, Devi RK, Protein-Ligand interaction studies on 2, 4, 6- 195 trisubstituted triazine derivatives as anti-malarial DHFR agents using 196 AutoDock. Bioinformation 6(2): 74-77. 2011. 197 198 199 13- Seeliger D, de Groot BL, Ligand docking and binding site analysis with PyMOL and Autodock/Vina. J Comput Aided Mol Des. 24:417–422. 2010. 9 200 201 Fig.1 multiple sequence alignment between domain and whole enzyme 202 203 204 Fig.2 Model generated for whole enzyme 10 Fig.3 Model generated for domain 205 206 Fig.4 Binding site of whole enzyme Fig.5 Binding site of domain 207 208 Fig.6 residues present at the active site of whole enzyme 209 210 Fig.7 residues present at the active site of domain of the enzyme 11 211 212 Fig.8 FAD binding to whole enzyme 12 213 214 Fig.9 FAD binding to domain 13 215 Table.1 information found after visualizing the structures using Pymol tool Analysis Protein structure Domain structure Atom count 4233 1311 Formal charge sum -6.0 0.0 Molecular surface area 52371.758A0 16504.783 A0 Solvent accessible surface area 23178.145 A0 10648.261 A0 216 217 Table.2 information found after visualizing the structures using Yasara tool Analysis Protein structure Domain structure VDW radius 85.337 34.261 Beta factor 109.7 111.3 Mass of object 55813.069g/mol 17316.863g/mol Stability of the object 971.13kcal/mol 286.78kcal/mol 218 219 Table.3 The backbone confirmation of models found from rampage server Analysis Protein structure Domain structure Number of residues in favoured region 517 ( 91.8%) 159 ( 88.3%) Number of residues in allowed region 34 ( 6.0%) 15 ( 8.3%) Number of residues in outlier region 12 ( 2.1%) 6 ( 3.3%) 220 14 221 222 Table.4 Model validation result 4a- Result found from anolea server Analysis protein Domain QMEAN score 0.452 0.628 Z-score -3.206 -1.483 223 224 4b- Result found from prosa server Analysis protein Domain Z-score -8.32 -4.21 225 15

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Download Alignment between domain region and whole enzyme