* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Molecular electrostatic potentials and Mulliken charge populations of DNA mini-sequences ´ R. Santamaria
Survey
Document related concepts
Transcript
Chemical Physics 227 Ž1998. 317–329 Molecular electrostatic potentials and Mulliken charge populations of DNA mini-sequences R. Santamaria ) , G. Cocho, L. Corona, E. Gonzalez ´ Instituto de Fısica, UNAM, Apdo. Post. 20-364, Mexico D.F. 01000, Mexico ´ Received 14 March 1997 Abstract Molecular electrostatic potentials ŽMEPs. are used to investigate the biochemical reactivity of DNA nucleic acid bases ŽNABs.. As a complementary scheme, a Mulliken population analysis is performed to determine internal charge distribution changes when nucleic acids take part in pair complexes and DNA mini-sequences. From the fact that MEPs are capable enough for predicting the formation of different hydrogen bonded pairs among NABs, it is inferred that for these compounds the electrostatic force plays a dominant role. Particular attention is paid to Watson–Crick dimers as they exhibit different preferred sites for an electrophilic attack to these of single nucleic acids. Finally, from the mini-sequences studied here, it is concluded that recognition of DNA segments proceeds at relatively short distances as a result of a poor influence of NABs on sugar and phosphate fragments. All quantum mechanical computations are performed within the density functional formulation. q 1998 Elsevier Science B.V. PACS: 87.15.Kg; 87.15.By; 31.15.Ew Keywords: Molecular electrostatic potentials; Mulliken populations; Reactivity; Nucleic acids; Density functional theory 1. The molecular electrostatic potential The determination of the deoxyribonucleic acid ŽDNA. chemical active sites becomes of great interest for understanding a variety of specific reactions between DNA and other molecular fragments. In turn, knowledge of the shape and relative locations of chemical active sites allows for DNA chemistry modeling. A large number of studies indicate that molecular electrostatic potentials represent a suitable quantum mechanical tool for elucidating reactive centers w1–3x. ) Corresponding author. E-mail: [email protected] The molecular electrostatic potential ŽMEP. is defined as w4x V Ž r . s Ý Zar< R a y r < y Hd 3 r 1 r Ž r 1 . r< r 1 y r < , a Ž 1. where Za and R a denote the charge and position of nucleus a , respectively. Eq. Ž1. renders the electrostatic interaction between the unperturbed charge distribution of the molecule and a positive unit charge located at point r. Shortcomings from the application of the MEP to studying reactivity, at the early stage of the recognition process, are preponderantly due to the inability of a point charge to indicate orientation 0301-0104r98r$19.00 q 1998 Elsevier Science B.V. All rights reserved. PII S 0 3 0 1 - 0 1 0 4 Ž 9 7 . 0 0 3 2 0 - 0 318 R. Santamaria et al.r Chemical Physics 227 (1998) 317–329 of an incoming partner molecule w5x, and to the approximation resorted to for computing the electron density r . Because of the long-range nature of the forces involved in Eq. Ž1., interaction at far distances between a partner molecule and DNA is mostly dominated by the electrostatic force. However, as the separation between both fragments decreases other well-known relaxation forces must be taken into account, such as those coming from mutual polarization, exchange of electrons, charge transfer, dispersion, etc. w6,7x. Encouraged by the simplicity and general good performance of the electrostatic potential scheme, and within the accuracy limits of Eq. Ž1., we use the MEP approach as a tool to explore the reactive field of B-DNA nucleotides in a single strand configuration. It is important to note that expression Ž1. is not the only identity that relates the potential V with the electron density r . In fact, the Poisson expression: = 2 V Ž r . s 4 pr Ž r . Ž 2. leads to an alternative way to compute V from r . Nevertheless, in this work, electrostatic potential computations are done according to Eq. Ž1.. As a matter of further insight of DNA reactivity, we consider of interest to also carry out a Mulliken population w8x analysis on nucleotide sequences. Although such analysis may be arguable, for instance net atomic charges are incapable of reproducing the exact dipole moment w9x, it is clear that Mulliken populations yield one of the simplest pictures of charge distribution. On the other hand, while Mulliken charges render net atomic populations in the molecule, electrostatic potentials yield the electric field out of the molecule produced by the internal charge distribution. Therefore, Mulliken populations appear as a complementary tool to the MEP approximation for studying reactivity, and correlation between results from both schemes is expected. In order to apply a MEP and Mulliken population analysis, the use of accurate geometries is required. Thus, in Section 2, we briefly describe the way in which we generate DNA nucleotide sequences and the quantum mechanical method to compute the electron density r . In Section 3, population charges and MEPs of nucleic acid bases and their pair com- plexes in the gas phase are estimated, with the aim of elucidating charge distribution changes of these compounds when they are incorporated into the DNA helix. At the second part of Section 3, we discuss the same features for the case of single mini-strands of B-DNA. Finally, Section 4 gives the main conclusions derived from a MEP and Mulliken population analysis on the different DNA fragments, and the corresponding expectations for the whole DNA system. 2. Geometries of the B-DNA nucleotide sequences The large size of DNA prevents the application of any accurate quantum mechanical method to determine its structure and electronic properties. However, it is important to recall that only four different nucleotides constitute the basic building blocks of DNA. Then, we have recourse to accurate nucleotide structures as a departure point to create arbitrary DNA sequences. In a previous investigation, the geometries of nucleic acid bases and these of their pair complexes were determined w10x. The method under use was a density functional approach which in particular has shown to give reliable results for these kind of compounds w11x. Hence, by making use of the same scheme the structures of the B-DNA nucleotides: adenine ŽA., guanine ŽG., cytosine ŽC. and thymine ŽT., were obtained w12x. From such structures, we now face the possibility of building single and even double strand sequences of B-DNA. The process to create the chain from its basic components essentially follows the next steps. We first proceed to overlap the sugar rings of all nucleotides. This is a process that becomes possible as the sugar ring represents a common structure to all monomeric units. Next, we locate nucleotide structures in such a way that nucleic acids lie in the Z s 0 plane, without loosing the previous overlap among sugar rings. This is also an event that becomes possible since nucleic acids are to great extent planar. In order to find the position of the helix origin, we put in a stacking configuration a couple of nucleotides. The position of the origin is found out by matching the location of the phosphate oxygen with R. Santamaria et al.r Chemical Physics 227 (1998) 317–329 largest Z-component of the first nucleotide, with the oxygen that links the sugar ring of the second one. The matching process requires of the fact that ten B-DNA bases yield a complete helix turn, and therefore the twist angle between bases is ; 368 w13x. From this step we obtain an average helix rise per ˚ We note that such a value falls inside base of 3.0 A. ˚ w13x. Once the experimental range of 3.36 " 0.42 A the rise per base along the Z-direction and the twist angle around the helix axis are known, the desired sequence of nucleotides in a single strand conformation can be built up by repeating the above steps 1. Complementary nucleotides to these which appear in the single strand, in order to form a double helix, may be produced by transforming the Cartesian coordinates Ž x, y, z . of the partner nucleotide to the coordinates Ž x, yy, yz .. With the use of the nucleic acid pair complexes w10x as coupling molecules, it is possible to build the entire complementary strand, thereby forming the required double helix of B-DNA nucleotides. Local variations in the helix geometry are possible by allowing subtle changes on parameters, as for example; the rise per base, twist angle between nucleotides, propeller twist angle between bases, etc. w14x. Also, it is important to note that, since the helix chain was not allowed for an energetic relaxation, the resulting structure represents a first approximation to the true helix in the gas phase. Nevertheless, the DNA molecule is a dynamic entity which preserves its identity under certain limits and, as a result, our approximate B-DNA structure should to good extent retain the important electronic features of the true system. To our best knowledge, this is the first B-DNA structure whose monomeric species are obtained from all-electron self-consistent-field calculations. Finally, in this work we focus attention on nucleotides in a single strand configuration and in the usual direction recognized as 5X ™ 3X . In fact, we limit our analysis to doublets, considered as the shortest sequences, since we are interested on charge 1 In addition to the nucleotide coordinates of Ref. w12x, bonds of terminal oxygens, linked to the deoxyribose ring, were saturated with hydrogen in order to simulate continuation of the helix chain. In Fig. 3 such a hydrogen appears under notation H15. 319 distribution changes of nucleic bases when these are incorporated into nucleotides in a stacking geometry. We resort to density functional theory ŽDFT. to perform all computations. It is understood that DFT is one of the most flexible and reliable quantum mechanical techniques, appropriate for the relative large size compounds studied here, since simpler methods unfortunately fail on accuracy, while higher accurate schemes demand for time-consuming computations w15x. Hence DFT seems to be the suitable approach to study reactivity. In particular the method is based on the use of the wave function as traditional quantum mechanical methods do, and on the electron density. Both variables play a major role for solving self-consistently Schrodinger-type monoelec¨ tronic equations that emerge in the theory, and represent key ingredients for evaluating relevant quantities, among which MEPs and Mulliken population charges are of our concern. Further details about the application of the DFT approximation to DNA fragments are available from previous works w10–12x, and details about the method itself from literature w16–18x. 3. Mulliken charge populations and molecular electrostatic potentials 3.1. The case of the nucleic acid bases and their pair complexes We show Mulliken populations of the nucleic acid bases in Fig. 1. It is found that all nitrogens and oxygens accumulate negative charge as a result of molecular relaxation. The excess is taken from nearby hydrogens and carbons which appear with positive sign. Negative populations around those nitrogens and oxygens with no hydrogen link promote the formation of deep electrostatic potential wells in the neighborhood, thus pointing out chemical active sites Žalso depicted at Fig. 1.. For the case of purine ŽA and G. molecules we find 3 electrostatic potentials wells, while for the case of pyrimidine ŽT and C. species only 2 are located. Their minima are reported in Table 1. Cytosine and guanine yield the deepest minima in comparison with their counterparts thymine and adenine. 320 R. Santamaria et al.r Chemical Physics 227 (1998) 317–329 Fig. 1. Mulliken charge populations and molecular electrostatic potentials of DNA nucleic acid bases. Small rings identify minima in the electrostatic potential surfaces. Therefore, the G and C fragments yield the highest probabilities for interaction with alkylating agents or for an electrophilic attack. Our results are in good agreement with those of Ref. w3x, except for cytosine. According to that reference, nitrogen N3 of cytosine renders a value of y93.5 kcalrmol, apparently more negative than O8 of the same molecule by just 6.8 kcalrmol. A result that, under the own words of its authors, is rather remarkable. The number, location and shape of reactive sites associated to nucleic acids represent important electronic features, in addition to geometric factors, that R. Santamaria et al.r Chemical Physics 227 (1998) 317–329 321 Table 1 Minima of molecular electrostatic potentials ŽMEPs. for the case of DNA nucleic acids a, b Center MEP Žbase. MEP Žbaserpair. MEP Žbaserdinucletide. 08 N7 O10 N3 N3 N1 N7 O8 O7 N3 70 ŽC. 66 ŽG. 61 ŽG. 60 ŽC. 52 ŽA. 51 ŽA. 49 ŽA. 47 ŽT. 42 ŽT. 37 ŽG. 39 ŽCrCG. 72 ŽGrCG. 63 ŽGrCG. 155.5 " 8.5 ŽCrAC, CG, CT, CC, CC. 163 " 2 ŽGrAG, CG. 144 " 2 ŽGrAG, CG. 153.5 " 6.5ŽCrAC, CG, CT, CC, CC. 135.5 " 5.5 ŽArAT, AG, AC. 140 " 4 ŽArAT, AG, AC. 156.5 " 3.5 ŽArAT, AG, AC. 134 " 1 ŽTrAT, CT. 149 " 0 ŽTrAT, CT. 136.5 " 0.5 ŽGrAG, CG. 51 ŽArAT. 44 ŽArAT. 40 ŽTrAT. 45 ŽTrAT. 48 ŽGrGC. a All minima, in ykcalrmol, are found at the planes of nucleic acid bases. Abbreviations: A s adenine; C s cytosine; G s guanine; T s thymine. For pairs ArAT, for example, denotes adenine in the Watson–Crick pair AT, while for dinucleotides, it denotes adenine in the dinucleotide AT with T above A as depicted in Fig. 3. b must be taken into account in order to infer the kind of compounds that NABs may interact with w7x. As an example, let us consider the case of guanine and cytosine. Both molecules contain the type of nitrogens and oxygens responsible for the presence of potential wells, then we can visualize such atoms as acceptor-like elements. On the other hand, hydrogens are recognized as donor-type entities with opposite sign. Hence, formation of different hydrogen bridges between G and C becomes possible due to the presence of donor and acceptor species in both fragments. As a result, different dimer residues are expected to appear, among which the Watson–Crick pair ŽFig. 2. represents a particular situation. Table 2 gives all feasible forms of H-bonded pairs among the NABs. The different conformers are described according to the donor and acceptor atoms taking part in the binding. In all these structures at least two hydrogen links appear 2 , and in most cases donor atoms from one fragment fall deep inside the potential wells created by the acceptor atoms of the partner molecule. Experimental evidence about the occurrence of these H-bonded pairs in vacuo has so far been missing. However, in solution, pair formation of nucleic acid bases can be indirectly inferred. Nucleic 2 In general, pairs with only one hydrogen bridge are energetically less stable than those which contain two or more bridges. Thus, in this work, a H-bonded complex should be understood as containing two or more H-links. residues immersed in a non-aqueous medium tend to pair with higher rate than these in an aqueous solution. In the latter case water molecules come into competition to produce H-bonded compounds, leading to lower association constants among NABs that show on enthalpy changes. Nevertheless, problems arise when one tries to elucidate the kind of microscopic conformation adopted by dimers. A direct way to obtain structural information is by forming crystalline complexes among the different derivatives of the NABs, and performing the corresponding X-ray diffraction analysis. Henceforth, when one compares results from experiment with these predicted from Table 2, it is found that most of them indeed crystallize in a diversity of arrangements w19x. Observations under this scheme show that nucleic acid bases are capable of linking with themselves producing homologous pairs, or with other bases resulting in mixed complexes where two, three or more NAB derivatives coexist in the H-bonded infinite network. Water molecules, and either tautomeric species of NABs, may also engage in the binding to give rise to alternative crystallographic forms. Interestingly enough is the fact that contrary to NABs immersed in solution, in crystals, Watson–Crick linkages among the different nucleic acid derivatives seem to be energetically less stable than other types of H-bonding. Differences in their stability are attributed to different environments. For the particular case of Watson–Crick pair complexes in vacuo, we have also carried out a 322 R. Santamaria et al.r Chemical Physics 227 (1998) 317–329 R. Santamaria et al.r Chemical Physics 227 (1998) 317–329 Table 2 Different pairs of acceptor and donor atoms for each nucleic acid base a, b, c Adenine Guanine Thymine Cytosine N1, H12 N1, H13 N3, H13 N3, H15 N7, H11 N3, H13 N3, H16 N7, H15 O10, H14 O7, H13 O7, H14 O8, H10 O8, H14 N3, H9 O8, H13 a As a matter of example, adenine is capable of producing two hydrogen bridges with cytosine if atoms N1 and H12 of A link with atoms H9 and N3 of C, respectively. b In particular, the AT Watson–Crick pair corresponds to the binding of the set N1, H12 of adenine with the set O8, H14 of thymine. For the GC Watson–Crick pair, the corresponding sets are O10, H14 of guanine and N3, H9 of cytosine. The extra hydrogen bridge in the GC pair is implicitly contemplated in this binding since two links are enough to determine the kind of pair conformation. c Take note that, in addition to mixed dimers, homologous pairs should be also contemplated from the table. Mulliken analysis. Similar charge population results to these reported for single residues are found, refer to Fig. 2. No Mulliken charge transfer is noted from one molecule to the other since each fragment that builds the complex essentially retains its zero initial charge. However, substantial electronic reorganization occurs within each nucleic acid at the interaction region. If we compare Mulliken charges of those atoms taking part in the binding, before and after the formation of the complex, it is observed that hydrogens increase their positive charge while nitrogens and oxygens increase their negative charge. The new electron distribution induces local charge polarization between the partner molecules forming the dimer. It calls our attention that carbons 2, 5 and 6 of adenine, 4 of thymine, 5 and 6 of guanine, and 2 and 4 of cytosine also exhibit considerable charge readjustment, even when they are located relatively far away from the binding region. MEPs for the Watson–Crick nucleic complexes are likewise depicted in Fig. 2. In pair complexes some of the original electrostatic potential wells attached to single NABs disappear. For instance, a 323 total of 3 wells are found for single adenine before forming the complex, and only 2 of them are left once the formation of the AT dimer has taken place. The main reason is the near presence of atom H14 of thymine which produces a hydrogen bridge with N1 of adenine. Equivalent results about the lost of binding sites are observed for complex GC, where the formation of hydrogen bridges N3–H14 and O8–H12 essentially eliminates two potential wells initially attached to atoms N3 and O8 of cytosine. On the other hand, electrostatic potential wells that survive in the process of dimer formation largely preserve their original shape, and only some of them are cut or confined within a shorter region. Such potential wells in turn become the characteristic reactive centers of the Watson–Crick pair complexes. The lowest values of the electrostatic potential surfaces corresponding to the AT and GC compounds are shown in Table 1. In comparison with single nucleic acids, the relative order among minima is greatly modified by pair formation. For example, the most drastic change is recorded for site O8 of cytosine. In the case of pairs, active site O8 shows the shallowest minimum, contrary to the case of single nucleic acids where it represents the deepest one. For other pair conformations among NABs, with equal demand of time-consuming computations, similar results are expected. In fact, Watson–Crick dimers represent just an example where chemical active centers of single nucleic acids are potentially capable of linking a molecular partner in different conformations. Still, the question remains about possible changes of MEPs and population charges of NABs when other DNA constituents remain in their vicinity, a topic to be investigated next. 3.2. The case of the B-DNA single strand In this section we analyze charge populations and electrostatic potentials when nucleotides Žnucleic acid bases in conjunction with sugar and phosphate subunits. appear in a stacking configuration, or equiva- Fig. 2. Mulliken charge populations and molecular electrostatic potentials of DNA pair complexes. Small rings identify minima in the electrostatic potential surfaces. For comparison, charge populations of nucleic acids before pairing are also included. 324 R. Santamaria et al.r Chemical Physics 227 (1998) 317–329 lently in a single strand conformation. This analysis represents a more general study than previous studies performed on single nucleic acids by other authors as we now take into account the additional contributions of phosphate groups and sugar compounds. Such contributions can not be neglected due to the long range nature of the forces that participate in the electrostatic potential expression. On the other hand, because of the large size of the DNA chain, we concentrate our study on the shortest complexes, this is doublets of nucleotides. Sixteen possible structures is the total number of compounds; however, we only need analyze the four combinations between purines and pyrimidines to determine electron distribution changes. Hence, we choose pairs AT, AG, CT and CG as examples. Figs. 3 and 4 illustrate the AT and AG doublets and establish atom numbering. We first pay attention to sugar and phosphate subunits and later on to the nucleic acids themselves. Careful comparisons of Mulliken populations among corresponding atoms, taking part in sugars and phosphates of the above mini-strands, indicate small differences. In Fig. 5, plots of Mulliken net charges versus atom label are shown. Only three curves are depicted there since all remaining data fall on one of such curves. It is not difficult to see that the largest discrepancies in sugar compounds are recorded for atoms C3, C4, O6 and H9, while for phosphate fragments differences appear at oxygens O3, O4 and O5. The deviations are indeed small and, in most instances, curves follow a similar trend. Changes computed on atoms C3, C4 and H9 of sugar molecules are attributed to a poor influence of the adjacent nucleic bases that participate in the doublet, while these changes recalled on oxygens are mainly due to the location of the phosphate molecule. The phosphate may stay in between or at the end of the strand, leading to different kinds of binding which in turn originate different atomic charges because of the finite size of the chain. In spite of that, atomic population discrepancies in sugar compounds and Fig. 3. Molecular electrostatic potentials, at the planes of nucleic acids, for the DNA sequence AT. Dark and transparent surfaces belong to A and T, respectively. Small rings identify relative minima and strips gaps of 10 kcalrmol, starting from 120 downwards, in the electrostatic potential surfaces. Refer to Fig. 1 for atom numbering in the nucleic acid bases. R. Santamaria et al.r Chemical Physics 227 (1998) 317–329 325 Fig. 4. Molecular electrostatic potentials, at the planes of nucleic acids, for the DNA sequence AG. Dark and transparent surfaces belong to A and G, respectively. Small rings identify relative minima and strips gaps of 10 kcalrmol, starting from 120 au downwards, in the electrostatic potential surfaces. Refer to Figs. 1 and 3 for atom numbering in the nucleic acid bases, sugar and phosphate compounds. phosphate groups in general appear negligible enough that, it is our opinion, they are incapable of producing decisive changes on reactivity. Such an argument is based on the fact that DNA assumes a dynamical role in a variety of biochemical processes. Therefore, the flexibility of DNA is one of the main reasons that such a molecule is capable of preserving, even under different conformations, the important genetic code. However, take note that it may be equally valid the conclusion that the observed differences in population charges of sugar and phosphates are important enough that such subunits can be considered as the first recognition sites to discriminate among the attached nucleic acid bases w20–22x. Nevertheless, we remark and insist on the molecular flexibility of DNA to preserve the genetic information under relatively wide structural and electronic limits. Consequently, we may say that sugar and phosphate molecular groups preserve their electronic identity within some margins and constitute not only a struc- tural, but also a common electronic backbone of DNA according to a Mulliken population analysis. Mulliken population comparisons have been also carried out for nucleic acid bases in the four possible single strands. We find that charge populations of A in the sequence AG are almost identical to these of A in AT, see Fig. 6. The same situation is observed for G in AG and CG, for T in AT and CT, and for C in CG and CT. Then, within the doublet approximation, Mulliken charges of nucleic bases are indifferent to the contiguous base, whether this last one is situated above or under the reference nucleic acid. As a matter of fact, major charge population changes in nucleic bases are produced by the near presence of sugar and phosphate compounds. For instance, adenine alone and within the mini-strands shows substantially different charge populations, refer to Fig. 6. Equivalent results occur for the other bases. The new charge distribution of nucleic bases in the ministrands is mainly perceived at their junction with the 326 R. Santamaria et al.r Chemical Physics 227 (1998) 317–329 Fig. 5. Mulliken populations of the eight sugar and phosphate subunits taking part in the mini-sequences AG, AT, CG and CT. Only three curves are shown since all data fall in one of such curves: sugar and phosphate of adenine in AG Ž –v – ., of guanine in CG Ž –B– . and of thymine in AT Ž –'– .. Refer to Fig. 3 for atom numbering in sugar and phosphate groups. sugar molecule Žsee, e.g., N9 in Fig. 6. and in the case of purines, the redistribution includes the sixmembered rings. On the other hand, MEPs of A inside the sequences AT and AG are depicted at Figs. 3 and 4. In these figures, the two surfaces generated by the A nucleotide show similar not only in shape, but also in reference to the depth of potential wells. However, this is not the case when we compare the adenine surface with that generated by the G nucleotide. Guanine with its phosphate group yield an extended surface with just one isolated well Žin the vicinity of N3. that simulates an island, see Fig. 4. In contrast, adenine and its corresponding phosphate molecule exhibit a less extended surface and two isolated wells Žone near to N1 and the other near to N3.. On the other hand, if we compare MEPs produced by purines against these of pyrimidines, clear differences emerge. Pyrimidines present surfaces separated from those of the phosphate group due to the inter- position of H-bonds of either the amino group NH of cytosine or radical CH of thymine, refer for instance to Fig. 3. Hence, electrostatic potential wells of pyrimidines, because of their relative small size, become slightly more inaccessible within the DNA complex. Still note that the external part of the electrostatic potential surfaces produced by phosphate groups of purines and pyrimidines resemble each other. In fact, main differences in MEPs of dinucleotides are introduced by the nucleic acids themselves. For larger single strands we expect similar results. In Table 1, we consider the lowest values in the MEP surface associated to nucleic acid bases. Six different dinucleotide systems have been now contemplated there to carry out some statistics, namely; AG, AT, AC, CG, CT and CC. Values appear with deviations in order to summarize results in one central value and be able to establish relative order among minima. In particular, we observe that chemi- R. Santamaria et al.r Chemical Physics 227 (1998) 317–329 327 Fig. 6. Mulliken populations of adenine alone and in the mini-strands AG and AT. Refer to Fig. 1 for atom numbering. cal active center N7 of guanine and adenine leads to the deepest potential wells. They are closely followed by these produced by O8 and N3 of cytosine, and O7 of thymine. Note also that the relative order among minima in this last column differs from previous columns of Table 1. Therefore, preferred sites for electrophilic attack of nucleic bases participating in dinucleotides are different to these previously predicted for single nucleic acids and base pairs. However, as a common factor among values of Table 1, all minima are found at the planes of nucleic acid bases Žregardless the bases stay alone, forming pairs or building dinucleotides.. Other authors w3x have found secondary minima out of the planes of nucleic acids A and C. Nevertheless, a careful DFT analysis indicates that it is due to the nonplanarity of amino hydrogens. When nucleic bases take part in base pairs, such hydrogens become planar and the secondary minima disappear. In dinucleotides, minima out of the molecular plane are observed for A in AG and AT, and for C in CG and CT. The minima of A and C are located in opposite direction to the other nucleotide building the dinucleotide, in this case G or T. For the interesting mini-sequence AC, which involves both A and C, we observe a minimum out of the plane of A but do not observe a minimum out of the plane of C. A similar situation occurs for C in C 1C 2 , where there is an out of plane minimum attributed to the first cytosine ŽC 1 . but not found in the second one ŽC 2 .. Hence, the stacking conformation of nucleotides becomes an important factor, in addition to the planarity of amino hydrogens, to make disappear such kind of secondary minima. Based on the above results of Mulliken charge populations and MEP for phosphates, sugars and nucleic acid bases, we conclude that recognition of specific DNA sequences proceeds at short distances. The assertion derives from the fact that Mulliken charges and MEPs associated to phosphate and sugar subunits keep to great part their own electronic identity as a result of a relative poor influence from adjacent nucleic acid bases. Therefore, a partner molecule approaching to DNA is expected to be first 328 R. Santamaria et al.r Chemical Physics 227 (1998) 317–329 attracted by the strong field created by the phosphate group w23x and, as the partner gets closer, potential wells of nucleic acid bases come into play in the selective process favoring a close contact recognition. This result is experimentally evidenced in gene regulatory proteins w24x. Finally, our results should be useful to understand processes related to genetic recognition, such as DNA transcription, where discrimination of nucleotides plays an important role. Also, they should be of help to build more realistic models about the dynamics of DNA replication. In this last case we call attention to the molecular machine approximation, where one might refer to a molecular machine as an assembly of polymeric molecules with one of them sliding over the other, in order to transport some energy or certain type of mass. Note that the transportation of mass may be also understood as the transportation of information w25x. Of course, the sliding Žor sticking. of molecules should depend on the potential wells that one molecule finds along the other one. 4. Conclusions In this work we have shown the simplicity and importance of molecular electrostatic potentials and Mulliken charge populations to elucidate reactive centers of DNA. The application of both schemes to the nucleic acid bases revealed the existence of a variety of H-bonded associates among NABs. Experiments based on their derivatives confirm the dominant role of the electrostatic force, that participates in the MEPs, for predicting the pairing of compounds of such nature. Particular attention given to Watson–Crick dimers showed that, electronic charge is mainly reaccommodated at the interaction region between fragments, inducing local polarization, with the consequent lost of some binding sites. Surviving electrostatic potential wells in the process of dimer formation remain essentially with the same shape, except at some instances where they show clipped sides. A study on nucleotides indicated that sugar molecules and phosphate groups represent not only a structural, but also an electronic backbone of DNA. Nucleic acid bases in dinucleotides do not show important charge readjustment for the presence of a contiguous base, weather this last one is positioned above or under the reference nucleic acid. Then, it is assumed that selective interactions between partner molecules with DNA may be characterized as close contact interactions since nucleic acids are uncapable of substantially altering the charge populations and molecular electrostatic potentials of sugar and phosphate subunits that could indicate the presence of one or another adjacent nucleic acid base. Nucleic acids taking part in dinucleotides yield different preferred sites for an electrophilic attack to these found in single nucleic acids and pair complexes. This is due to the additional contributions of phosphate and sugar compounds, which are not negligible because of the long range nature of the electrostatic forces that appear in the MEP expression. However, a common factor among all reactive centers is the fact that they appear with their lowest values at the planes of nucleic acids, with some secondary sites observed out of molecular planes under special circumstances. Finally, it is important to mention that, the set of hydrogen atoms that participate in the binding of nucleic acid bases not only ensures a weak linking between bases, but also help to preserve the identity of each base inside the DNA complex. This is due to the limited charge reorganization that hydrogen atoms can suffer in comparison with heavier elements that, otherwise, could appear at the interaction region. Features like weak bonding make possible the process of exact duplication, while electronic identity preservation of nucleic acids in DNA makes possible the saving and interpretation of the genetic information. References w1x P. Politzer, D.G. Truhlar ŽEds.., Chemical Applications of Atomic and Molecular Electrostatic Potentials, Plenum, New York, 1981. w2x J.S. Murray, K. Sen ŽEds.., Molecular Electrostatic Potentials: Concepts and Applications, Elsevier, New York, 1996. w3x R. Bonaccorsi, A. Pullman, E. Scrocco, J. Tomasi, Theor. Chim. Acta 24 Ž1972. 51. w4x J.J.M. Wiener, M.E. Grice, J.S. Murray, P. Politzer, J. Chem. Phys. 104 Ž1996. 5109. w5x P.C. Mishra, R.D. Tewari, Int. J. Quantum Chem. 32 Ž1987. 181. R. Santamaria et al.r Chemical Physics 227 (1998) 317–329 w6x K. Morokuma, K. Kitauro, in: P. Politzer, D.G. Truhlar ŽEds.., Chemical Applications of Atomic and Molecular Electrostatic Potentials, Plenum, New York, 1981, pp. 215– 242. w7x I. Alkorta, J.J. Perez, Int. J. Quantum Chem. 57 Ž1996. 123. w8x R.S. Mulliken, J. Chem. Phys. 23 Ž1955. 1833. w9x C.G. Mohan, A. Kumar, P.C. Mishra, Int. J. Quantum Chem. 60 Ž1996. 699. w10x R. Santamaria, A. Vazquez, J. Comput. Chem. 15 Ž1994. ´ 981. w11x R. Santamaria, E. Charro, A. Zacarias, M. Castro Ž1997, submitted for publication.. w12x R. Santamaria, A. Quiroz-Gutierrez, C. Juarez, J. Mol. Struct. ´ ´ ´ ŽTheochem. 357 Ž1995. 161. w13x R.E. Dickerson, Sci. Am. 249 Ž1983. 86. w14x C.R. Calladine, J. Mol. Biol. 161 Ž1982. 343. w15x G. Bakalarski, P. Grochowski, J.S. Kwiatkowski, B. Lesyng, J. Leszczynski, Chem. Phys. 204 Ž1996. 301. w16x R.G. Parr, W. Yang, Density Functional Theory of Atoms and Molecules, International Series of Monographs on Chemistry, 16, Oxford University Press, Oxford, 1989. 329 w17x J.K. Labanowski, J.W. Andzelm ŽEds.., Density Functional Methods in Chemistry, Springer, New York, 1991. w18x J.M. Seminario, P. Politzer ŽEds.., Modern Density Functional Theory: A Tool for Chemistry, Theoretical and Computational Chemistry Series, vol. 2, Elsevier, Amsterdam, 1995. w19x D. Voet, A. Rich, Prog. Nucleic Acid Res. Mol. Biol. 10 Ž1970. 196, and references therein. w20x T. Shinoda, N. Shima, M. Tsukada, J. Theor. Biol. 151 Ž1991. 433. w21x T. Shinoda, N. Shima, M. Tsukada, Int. J. Quantum Chem. 47 Ž1993. 59. w22x T. Shinoda, N. Shima, M. Tsukada, Int. J. Quantum Chem. 49 Ž1994. 849. w23x P.K. Weiner, R. Langridge, J.M. Blaney, R. Schaefer, P.A. Kollman, Proc. Natl. Acad. Sci. USA 79 Ž1982. 3754. w24x A. Klug, in: D.A. Chambers ŽEd.., DNA: The Double Helix, vol. 758, Ann. N.Y. Acad. Sci., New York, 1995. w25x R.M. Stroud, C.R. Cantor, T.D. Pollard ŽEds.., Annual Review of Biophysics and Biomolecular Structure, vol. 23, Annual Reviews Inc., Palo Alto, CA, 1994.