* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download SOLID-STATE PHYSICS II 2007 O. Entin-Wohlman vs.
Survey
Document related concepts
Hydrogen atom wikipedia , lookup
Potential energy wikipedia , lookup
Time in physics wikipedia , lookup
Conservation of energy wikipedia , lookup
Internal energy wikipedia , lookup
Electromagnetism wikipedia , lookup
Condensed matter physics wikipedia , lookup
Nuclear physics wikipedia , lookup
Aharonov–Bohm effect wikipedia , lookup
Gibbs free energy wikipedia , lookup
Nuclear structure wikipedia , lookup
Relativistic quantum mechanics wikipedia , lookup
Density of states wikipedia , lookup
Superconductivity wikipedia , lookup
Electrical resistivity and conductivity wikipedia , lookup
Theoretical and experimental justification for the Schrödinger equation wikipedia , lookup
Transcript
SOLID-STATE PHYSICS II 2007 O. Entin-Wohlman 1. SEMICONDUCTORS The energy gap. When one plots the electronic density of states of a certain material vs. the energy, one observes that there are regions on the energy axis in which the density of states vanishes. The electronic spectrum can then be viewed as ‘bands’ separated by ‘gaps’. This is a consequence of the structure of the material (which may be, for example, a crystal or an amorphous system). Another property of the material at hand is the concentration of electrons. This concentration determines the Fermi energy, which is the highest energy that is occupied at zero temperature. We then encounter, at zero temperature, two possible situations: either the topmost band is completely filled, or it is just partially filled. In a metallic system, the Fermi energy is located well within one of the bands, which is then termed ‘the conduction band’. In an insulator, the topmost occupied band is completely filled, and is then termed ‘the valence band’. The next highest band, which is completely empty at zero temperature, is the conduction band of the insulator. The magnitude of the gap separating the conduction band from the valence one varies. When it is of order of few eV ’s, the material remains essentially an insulator even at room temperature, 1 eV ' 1.6 × 10−12 erg , kB ' 1.4 × 10−16 erg/K , kB T ' 4.2 × 10−14 erg , at room temperature , or 1 eV ' 104 Kelvin . (1.1) Thus the thermal energy is not sufficient to overcome a gap of order a few eV ’s, but does suffice when the energy gap is of order of fractions of an eV . In the latter case, the material is a ‘semiconductor’. The distinction between a metal and a semiconductor stems from the temperature dependence of their respective electrical conductivities: the conductivity of a metal, σ= ne2 τ , m 1 (1.2) decreases as the temperature is raised, since τ , the mean free time in-between collisions of the electrons, decreases as the temperature increases. Here, e is the electron charge, m is its mass and n is the electronic concentration. The dependence of τ −1 on the temperature follows usually a power law, and therefore it is not particularly strong. On the other hand, the conductivity of a semiconductor increases exponentially as the temperature is raised, since the number of electrons making the jump over the gap into the conduction band by virtue of the thermal energy is increasing exponentially. Semiconductors also exhibit the phenomenon of photoconductivity: the conductivity of a semiconductor increases when light is shined on it, since the energy gap (if it is sufficiently small) can be overcome by the energy of the light. Finally, the conductivities of various semiconductors can vary by several orders of magnitude. In this respect, one usually distinguishes between intrinsic semiconductors, in which the conductivity is dominated by the electrons which are excited thermally into the conductance band, and extrinsic semiconductors, whose electronic behavior is determined by the electrons contributed to the conduction band by impurities. One should keep in mind that the value of the energy gap itself depends on the temperature: the gap shrinks as the temperature is raised (by a few percents). This is because (i) the band structure is susceptible to the temperature (the material expands thermally); and (b) the effect of lattice vibrations (phonons). The energy gap is usually measured by monitoring the absorbtion of incident radiation, or by studying the temperature dependence of the intrinsic conductivity. The absorbtion of radiation increases abruptly when its frequency ω is such that ~ω overcomes the threshold of the gap. However, this is true only for direct transitions, which occur when the maximum of the valence band is right below the minimum of the conduction band. When this is not the case, and the maximum of the valence band is not right below the minimum of the conduction band, the minimal energy transitions are indirect transitions. Then, a phonon or more are required to supply the missing crystal momentum between the k−space locations of the minimum and the maximum. The energy of the phonon is then also absorbed, making the required photon energy less than the threshold. The band structure of a semiconductor. It is plausible to assume that the number of electrons in the conduction band, as well as the number of holes in the valence band, of a typical semiconductor is small; therefore we can approximate the relevant portions of the 2 dispersion relations of the both bands by quadratic forms ³ k2 k2 k2 ´ 1 ε(k) = εc + ~2 + 2 + 3 , for electrons , 2m1 2m2 2m3 ³ k2 k2 k2 ´ 1 ε(k) = εv − ~2 + 2 + 3 , for holes . 2m1 2m2 2m3 (1.3) Here, εc denotes the bottom of the conduction band and εv denotes the top of the valence band. In both these expressions, the three ‘masses’, m1 , m2 , and m3 are simply the coefficients of the expansions of the true dispersion relations around the respective maximum and minimum. Therefore, these are just effective masses, and their values have still to be determined. ∗ ∗ ∗ exercise: Why there are no linear in k−terms in the dispersion? ∗ ∗ ∗ exercise: Find the density of states for the ellipsoidal pockets described by Eqs. (1.3). ∗ ∗ ∗ solution: The constant energy surfaces of the electrons are ellipsoids. Let us first determine the volume enclosed by such a constant energy surface, of energy ε. Since energies in the conductance band are measured relative to εc , the constant energy surface is given by k2 k2 ´ ~2 ³ k12 + 2 + 3 =1. (1.4) ε − εc 2m1 2m2 2m3 p This is the equation of an ellipsoid, with three axes, given by Ri = 2mi (ε − εc )/~, i = 1, 2, or 3. The volume in k−space of our ellipsoid is (4π/3)R1 R2 R3 = (4π/3)[2ε − √ εc ]/~2 )3/2 m1 m2 m3 . We divide this volume by the volume of a unit cell in k−space, 8π 3 , and find that the total number of states contained inside a constant surface energy ε is 4π ³ 2(ε − εc ) ´3/2 √ 1 Nc (ε) = m1 m2 m3 3 . (1.5) 2 3 ~ 8π The density of states, Nc (ε), (the number of states per unit volume, per energy, per spin), is the ε−derivative of Nc (ε) with respect to ε, ∂N (ε) 1 Nc (ε) = = 3 2 ∂ε ~π r (ε − εc )m1 m2 m3 . 2 (1.6) A similar calculation can be carried out for the holes; the result then contains |ε − εv | in place of ε − εc . ∗ ∗ ∗ exercise: What is the difference (if there is any) between the electronic density of states of a semiconductor, and that of the free electron gas (which describes rather well a metal)? 3 The effective masses are measured by the technique of cyclotron resonance. This is accomplished by applying a (constant) magnetic field, B. Let us consider the electron pocket, described by the dispersion (1.3) (the first equation there). We use the semiclassical equations of motion to describe the motion of the electrons, employing the coordinate scheme in which the magnetic field and the velocity vectors are decomposed along the three principal axes of the effective mass tensor, ´ dvi e³ e mi = − v × B = − ²ij` vj B` . dt c c i (1.7) Here, ²ij` is the fully anti-symmetric Levi-Chivita tensor, and repeated indices are summed over. We now assume an oscillatory solution for the velocity, with frequency ω, such that d2 vi /dt2 = −ω 2 vi . Then, by taking a second temporal derivative in Eq. (1.7), we obtain ³ e ´2 ³ e ´2 1 1 2 −ω vi = ²ij` ²ji1 `1 B` B`1 vi1 = − ²ij` ²i1 j`1 B` B`1 vi1 . (1.8) c mi mj c mi mj There are two possibilities, due to the properties of the Levi-Chivita tensor: either i1 = i and then ` = `1 or i 6= i1 , in which case i1 = ` and ` = i. Therefore we obtain ³ e ´2 X B 2 ³ e ´2 X B B v i ` ` ` 2 2 −ω vi = − vi (²ij` ) + (²ij` )2 . c mi mj c mi mj j` j` (1.9) (Here, the summation symbol has been introduced explicitly for clarity.) For example, for i = 1 Eq. (1.9) becomes ³ ´ ³ e ´2 B X 1 ωc2 − ω 2 v1 = Bi m i v i , c m1 m2 m3 i=1,2,3 where we have defined ωc2 = ³ e ´2 P mi Bi2 , m1 m2 m3 c i=1,2,3 (1.10) (1.11) the cyclotron frequency squared. In general, Eq. (1.9) represents three linear homogeneous equations, and one can easily verify that the determinant of that system of equations vanishes when the frequency ω equals the cyclotron frequency. Since for that frequency the electrons’ motion is at resonance with the magnetic field, one can measure it (from peaks in the absorption spectrum as function of the magnetic field). One can deduce the cyclotron mass, m∗ , s P 2 1 i=1,2,3 mi Bi P = . m∗ m1 m2 m3 i=1,2,3 Bi2 4 (1.12) The measurement can be carried out for various orientations of the magnetic field, and then one can deduce information about the masses mi . Such measurements require that the mean-free time in-between collisions of the electrons will be larger than the cyclotron period, so that electron will complete several revolutions before being scattered away. In other words, the measurement requires rather low temperatures and relatively pure samples. The number density of charge carriers at thermal equilibrium. Quite generally, the electrons in the conduction band of a semiconductor are there either because they were transferred from the valence band (by the thermal energy), or because they come from various impurities present in the system. For the same reasons, there are holes in the valence band. In any event, the electronic concentration of the electrons in the conduction band, nc , at thermal equilibrium, is given by Z nc (T ) = ∞ dεNc (ε) εc 1 eβ(ε−µ) + 1 . (1.13) Here, β ≡ 1/(kB T ) denotes the inverse temperature, µ is the chemical potential, (it is a function of the temperature) and Nc (ε) is the density of states, found above [see Eq. (1.6)]. Note that we have taken the width of the conduction band to be unrestricted, whereas our result for the density of states is valid only in relatively small pockets around the minima of the conduction band. This approximation will turn out to be justified. Note also the implicit assumption that the presence of impurities does not affect the density of states. Employing similar approximations, the density of holes in the valence band, pc , is given by Z εv ´ Z εv ³ 1 1 = dεNv (ε) β(µ−ε) . (1.14) pc (T ) = dεNv (ε) 1 − β(ε−µ) e +1 e +1 −∞ −∞ The exact location of the chemical potential in the energy scheme depends on the nature of the impurities, their number, etc. However, we will make the assumption that the chemical potential is located somewhere within the energy gap between the conduction and the valence bands, and that the temperature is not too high, such that εc − µ À kB T , and µ − εv À kB T . (1.15) In other words, all energies, kB T , εc , εv , and the chemical potential µ are well separated, and not close to each other. ∗ ∗ ∗ exercise: Give numerical estimates for the temperature and the width of the energy gap for Eqs. (1.15) to hold. 5 The approximation (1.15) allows to replace the Fermi distribution by the Boltzmann one, 1 eβ(ε−µ) +1 ' e−β(ε−µ) , and 1 eβ(µ−ε) +1 ' e−β(µ−ε) . (1.16) Then, inserting the condition (1.15) into Eq. (1.13), we perform the resulting integration as follows r Z m1 m2 m3 ∞ p dε (ε − εc )e−β(ε−µ) 2 εc r Z m1 m2 m3 ∞ p −β(εc −µ) 1 =e dε (ε − εc )e−β(ε−εc ) ~3 π 2 2 εc r Z εc +kB T p 1 m1 m2 m3 ' e−β(εc −µ) 3 2 dε (ε − εc ) ~π 2 εc r m1 m2 m3 2 −β(εc −µ) 1 =e (kB T )3/2 ≡ Nc (T )e−β(εc −µ) . 3 2 ~π 2 3 1 nc (T ) ' 3 2 ~π An analogous calculation for the hole concentration gives r m1 m2 m3 2 −β(µ−εv ) 1 pv (T ) ' e (kB T )3/2 ≡ Pv (T )e−β(µ−εv ) . 3 2 ~π 2 3 (1.17) (1.18) (Note that the effective masses, mi , i = 1, 2, or 3 need not to be identical for the valence and for the conduction bands. Therefore, the temperature-dependent coefficients Nc (T ) and Pv (T ) need not to be identical.) Although the above calculation does not actually give the concentration of the charge carriers, since we still do not know the chemical potential, it does tell us that when one of the concentrations is known, we can find the other. The reason is that the chemical potential is cancelled when we consider the product, nc (T )pv (T ) = Nc (T )Pv (T )e−β(εc −εv ) = Nc (T )Pv (T )e−βEgap . (1.19) The pure (intrinsic) case. When the semiconductor is very clean, we may assume that all electrons in the conduction band come from the valence one, and therefore the density of holes must equal the density of the electrons. We denote the chemical potential of this case by µi , and the concentration of electrons in the conduction band or of holes in the valence band by ni (T ). Then, from Eqs. (1.17) and (1.18), we have e2βµi = εc + εv kB T Pv (T ) Pv (T ) β(εc +εv ) e V µi (T ) = + ln . Nc (T ) 2 2 Nc (T ) (1.20) Hence, since the chemical potential is known, we can compute the electron (or the hole) concentration. We also note that at zero temperature (where the chemical potential is 6 identical to the Fermi energy), the chemical potential of the pure semiconductor lies exactly at the middle of the gap. Finally we note that the change of the chemical potential with the temperature is not large, since the ln-term in Eq. (1.20), being determined by the ratio of the effective hole masses and the effective electron masses, is of order unity. All these considerations are valid when the temperature is sufficiently low, so that all our assumptions above are valid. The doped (extrinsic) case. When the semiconductor is doped, the impurities contribute electrons to the conduction band (or holes to the valence band). In that case, nc (T ) 6= pv (T ). Let us then denote ∆n(T ) = nc (T ) − pv (T ) , and nc (T )pv (T ) = n2i (T ) . (1.21) The second relation here is simply another way to present our result Eq. (1.19). It is straightforward to solve these two equations, and to obtain r³ ∆n ´2 ∆n nc = + n2i + , 2 r³ 2 ´ ∆n 2 ∆n + n2i − pv = . 2 2 (1.22) (We suppress the dependence on the temperature for brevity.) We see that if |∆n| À ni , then (depending on the sign of ∆n) either the concentration of the charge carriers is mainly that of the holes (∆n is negative), and then the semiconductor is of the “p-type”, or the concentration is mainly that of the electrons (∆n is positive) and then the semiconductor is of the “n-type” . We can proceed a bit further with the above considerations, and re-write nc for example, from Eq. (1.17), in the form −β(εc −µ) nc = Nc e = = np np Nc Pv e −βEgap /2 Nc Pv e−βEgap /2 o × o nr N c −β(− Egap +εc −µ) 2 o e Pv n o E k T v) −β(− gap +εc −µ+ B2 ln P 2 N c × e . (1.23) The first factor here is just ni , while the second one, using Eq. (1.20), contains µi (the chemical potential of the pure semiconductor) in the exponent. Therefore we have found that nc = eβ(µ−µi ) ni , pv = e−β(µ−µi ) ni . 7 (1.24) It follows that ∆n = 2 sinhβ(µ − µi ) . ni (1.25) In order to proceed farther, we need to calculate the concentration of electrons contributed by the impurities, or captured by them (and then there is a contribution to the hole concentration). Impurities which contribute electrons to the semiconductor are called ‘donors’, and those that capture electrons and hence contribute holes are called ‘acceptors’. Impurity energy levels. Let us first estimate the energy of an electron belonging to a certain donor. For simplicity it is assumed that each donor contributes just one electron, for example, an arsenic ion (charge 5e) in a germanium (charge 4e) crystal. One might have thought that the energy of the extra electron of the arsenic is its ionization energy, 9.81 eV , which is huge (as compared to the band structure energies). However, this value is for a free arsenic ion, while in our case the arsenic ion sits in the germanium medium. This fact reduces that ‘ionization’ energy drastically, for two main reasons. 1. The ionization energy is related to the Coulomb energy between the ion and the extra electron. The semiconductor has a relatively high (static) dielectric constant (of about 10-20), and therefore this energy is reduced by at least an order of magnitude. The reason for the high dielectric constant is related to the relatively small values of the energy gaps. In a metal, for example, there is no gap at all, and the static dielectric constant indeed is infinite. The smaller the energy gap is, the larger is the dielectric constant. 2. The electron released from the donor into the conduction band has there [see Eq. [1.3)] a parabolic relation between the energy and the momentum, but the mass is not the free electron mass. In general, that mass is about a factor of 10 smaller than the free electron mass; by moving into the conduction band the electron gains kinetic energy. We may combine these two sources to estimate the ground state energy using the Bohr radius, a0 , Ed = n me4 o n m∗ 1 o n o n o me4 e2 −3 ≡ 2 V × ' 13.6 eV × 10 . a0 ~ ~2 m ²2 (1.26) This binding energy should be measured relative to the conduction band; in other words, the additional electronic level contributed by the donor is at an energy distance Ed below the conduction band, so that it takes a much smaller energy to excite the electron from the donor level into the conduction band (as compared to exciting it from the valence band). 8 Although this estimate is done for a single donor, we may assume that all donors contribute their electrons to the localized level located below the conduction band. This is true as long as the concentration of donors is such that they do not interact among themselves. Similar considerations hold for the energy levels of the acceptors; those are located close to the valence band, so that it is easy for the electrons to be excited from the valence band to the acceptors’ level, leaving behind them holes in the valence band. Population of impurity energy levels. An impurity energy level can be in one of the three following configurations: a. it can host no electrons at all, and then it is empty, and the corresponding energy is zero; b. it can host an electron of spin up or an electron of spin down, and then it is singly occupied and its energy is Ed ; or c. it can host two electrons, one for each spin direction, and then it is doubly-occupied, and its energy is 2Ed + U , where U is the (repulsive) electrostatic energy it costs to bring two electrons to the same level. Taking the thermal average we find the thermal equilibrium population of an impurity energy level. The partition function of our little system, which is coupled to an electron bath of chemical potential µ and temperature T , is Z = 1 + 2e−β(Ed −µ) + e−β(2Ed +U −2µ) , (1.27) and the average population of the level is then hni = ´ 1 ³ −β(Ed −µ) 2e + 2e−β(2Ed +U −2µ) . Z (1.28) ∗ ∗ ∗ exercise: Under which circumstances Eq. (1.28) reproduces the Fermi distribution? explain. Usually, the on-level Coulomb repulsive interaction, U , is huge; of order of several eV ’s. The exponents including it are then almost zero. As a result Eq. (1.28) becomes hni = 1 1 β(Ed −µ) e 2 +1 , (1.29) so that the concentration of the electrons coming to the conductance band from the donors, nd , is nd = Nd hni , where Nd is the concentration of the donors. 9 (1.30) ∗ ∗ ∗ exercise: Show that the concentration of holes contributed to the valence band by the acceptors is given by pa = Na 1 β(µ−Ea ) e 2 +1 . (1.31) Here, Na is the concentration of acceptors, and Ea is the energy level of a single acceptor, measured relatively to the top of the valence band. The charge carrier density of a doped semiconductor. Let us now consider a doped semiconductor, in which the concentration of donors is Nd , and the concentration of acceptors is Na , such that Nd > Na . At zero temperature, the valence band is filled completely and the conduction band is empty. In addition, in each unit volume, Na electrons will ‘fall’ from the donor energy levels to the acceptor levels and fill them (remember that each acceptor has, e.g., one electron less than the host material, and each donor has one extra electron). Thus, at zero temperature, all acceptor levels (whose number per unit volume is Na ) are filled, the valence band is filled, and there are Nd − Na electrons in the donor levels. As the temperature is raised, the electrons will be redistributed among all the levels, (namely, the levels in the valence band, in the conduction band, and those belonging to donors and to the acceptors), but their total number will remain the same. At a finite temperature T there are nd electrons on the donor levels and nc electrons in the conduction band (per unit volume), and pv holes in the valence band, and pa holes on the acceptor levels. The total number of holes must be equal to the total number of electrons, Nd − Na = nc + nd − pv − pa . (1.32) This equation fixes the chemical potential µ of our system, and allows for a full calculation of all densities. We note that Eq. (1.32) ensures the neutrality of the system: Nd is the number of donors, which are charged positively, likewise Na is the number of acceptors, which are charged negatively. Therefore the total positive charge in the system is Nd + pv + pa , while the total negative charge of the system is Na + nc + nd (all per unit volume). However, in order to simplify the calculation, we make the assumption [compare with Eq. (1.15)] Ed − µ À kB T , and µ − Ea À kB T . (1.33) When these approximations are introduced into Eqs. (1.29), (1.30), and (1.31), we find that nd ' 2Nd e−β(Ed −µ) ¿ Nd , and pa ' 2Na e−β(µ−Ea ) ¿ Na , 10 (1.34) such that our condition Eq. (1.32) becomes ∆n ≡ nc − pv = Nd − Na . (1.35) In summary, [see Eqs. (1.21) and (1.22)], our set of equations for the charge carrier densities is Nd − Na = 2 sinhβ(µ − µi ) , ni (1.36) and n c pv = ´2 i1/2 1 h i 1 h³ Nd − Na + 4n2i ± Nd − Na . 2 2 (1.37) In the mostly intrinsic case ni À |Nd − Na |, and in the mostly extrinsic one ni ¿ |Nd − Na |. ∗ ∗ ∗ exercise: Show that in the mostly intrinsic case, the charge carrier densities are given by n 1 ' ni ± (Nd − Na ) , pv 2 c (1.38) while for the mostly extrinsic regime nc ' Nd − Na , and pv ' n2i , for Nd > Na , Nd − Na (1.39) pv ' Na − Nd , and nc ' n2i , for Na > Nd . Na − Nd (1.40) and 11 2. THE P-N JUNCTION Introduction. A p-n junction consists of two semi-infinite semiconductors, which we imagine to fill the entire space. One of them has more donors than acceptors, and so it is negatively charged, (it is the n-part of the junction) and the other has more acceptors than donors, and so it is positively charged (and hence it is the p-member of the junction). These two semiconductors are attached to one another at x = 0, such that along the x−axis the combined system is negatively charged for x > 0, and is positively charged at x < 0, while along the other two directions y and z it is homogeneous (namely, its properties do not vary with position). A very crude model to describe the system will be to assume that N , x>0 0 , x>0 d Nd (x) = , Na (x) = . 0 , x<0 Na , x < 0 (2.1) The non-uniformity in impurity concentrations induces non-uniformity in the density of the charge carriers (the electrons and the holes). It is plausible to expect that this nonuniformity is significant at and around x = 0 and decays far away as |x| → ∞. Hence, there is formed a ‘layer’ around x = 0, which is called the ‘depletion layer’, where the electron and hole concentration depends on x. In order to determine the properties of the depletion layer, we can view the abrupt change in the impurity concentration, as described by Eq. (2.1), as causing some electrostatic potential, φ(x), which we need to determine in a self-consistent way, since the charge carriers which are moving around will tend to screen any electrostatic potential as best as they can. Screening in a free electron gas. Suppose we have free electrons (that have only kinetic energy) moving against a positive background, so that the entire system is neutral. This describes a simple metal. Now let us introduce into this system a positively-charged particle, at a given position, and hold it there firmly. This charge will attract the electrons to it, and will thus create in its neighborhood a surplus of negative charge, which will reduce (i.e., ‘screen’) its positive charge. The relevant equations that govern this behavior are from electrostatics. Firstly, we have Poisson’s equation connecting the charge density of the positive ‘extra’ charge (which was 12 introduced into our neutral system) and the potential it creates, −∇2 φext (r) = 4πρext (r) . (2.2) Secondly, we have Poisson’s equation connecting the ‘true’ charge density, formed in the system after all the re-shuffling of the electrons, with the potential it creates −∇2 φ(r) = 4πρ(r) . (2.3) Now, the true charge density ρ(r), consists of the extra charge density inserted into the system, and the charge density it induces because it attracts electrons, ρ(r) = ρext (r) + ρind (r) . (2.4) To these (exact) electrostatic equations, we add a semi-classical approximation: we find a relation between the induced charge density and the (full) potential which the electrons ‘feel’. This approximation, known as the Thomas-Fermi approximation, is carried out as follows. At equilibrium, the chemical potential, µ, in the system fixes the electron density, n, n(µ) = X k 1 eβ(Ek −µ) +1 , (2.5) where β = 1/(kB T ) is the inverse of the temperature. When there is a constant electrostatic potential, φ, acting on the electrons, we may just add it to µ, i.e., µ → µ + eφ. When the electrostatic potential, φ(r), varies sufficiently smoothly, we nonetheless add it to the chemical potential. The result, according to Eq. (2.5), is that the electron density is changed, n(µ) → n(µ+eφ(r)). The change in the electron density, times −e, is just the induced charge density. Thus we have ρind (r) = −e[n(µ + eφ(r)) − n(µ)] ' −e2 ∂n φ(r) . ∂µ (2.6) Note that the derivative is to be found for φ = 0. ∗ ∗ ∗ exercise: Calculate explicitly the quantity e2 ∂n for the free electron gas in which ∂µ Ek = ~2 k 2 /2m, estimate its numerical value, and in particular determine its dimensions. Let us now write our equations in Fourier space. Equations (2.2) and (2.3) become φext (q) = 4π 4π ext ρ (q) , φ(q) = 2 ρ(q) , 2 q q 13 (2.7) and Eqs. (2.4) and (2.6) are h ∂n i ρ(q) = ρext (q) + ρind (q) , ρind (q) = −e2 φ(q) . ∂µ (2.8) Simple algebraic manipulations give us the relation between the true potential in the system, and the extra charge or extra potential that have caused this potential (in the initially neutral system) φ(q) = φext (q) 4πρext (q) h i= . 2 + 4πe2 ∂n q 2 ∂n e 1 + 4π ∂µ q2 ∂µ (2.9) We deduce from this calculation the dielectric function, ε(q), of the free electron gas in the Thomas-Fermi approximation, ε(q) = 1 + 4π h 2 ∂n i e . q2 ∂µ (2.10) For example, when the extra charge density is simply a (positive) point charge, ρext (r) = eδ(r) then ρext (q) = e, and the potential it creates is obviously φext (r) = e/r, which in Fourier space becomes φext (q) = 4πe/q 2 . The true potential created in the system is φ, which according to Eq. (2.9) is given in this example by φ(q) = ∂n 4πe , κ2 ≡ 4πe2 . 2 +κ ∂µ (2.11) e φ(r) = e−κr , r (2.12) q2 Transforming back to real space, so that instead of the long-range Coulomb potential, that decays at long distances as 1/r, the potential screened by the free electron gas decays exponentially. The exercise above will tell us how good this screening is, namely, how far (in space) extends the the potential (2.12). ∗ ∗ ∗ exercise: Work out explicitly the integral leading to the result (2.12). Self-consistent determination of the electrostatic potential in a p-n junction. Coming back to our p-n junction, we will calculate the electrostatic potential created by the junction (or rather, by the depletion layer) employing the same type of approximation 14 as the Thomas-Fermi approximation used above. Namely, we simply shift the chemical potential in Eqs. (1.17) and (1.18) by eφ(x), leading to nc (x) = Nc (T )e−β(εc −eφ(x)−µ) , pv (x) = Pv (T )e−β(µ+eφ(x)−εv ) . (2.13) Confining ourselves to the completely extrinsic case, in which the charge carrier density is entirely determined by the impurity concentration, we write down the charge carrier densities far away from the junction nc (∞) ≡ Nd = Nc (T )e−β(εc −eφ(∞)−µ) , pv (−∞) ≡ Na = Pv (T )e−β(µ+eφ(−∞)−εv ) . (2.14) These two equations give eφ(∞) + µ = εc + kB T ln Nd , Nc (T ) −eφ(−∞) − µ = −εv + kB T ln Na , Pv (T ) (2.15) so that we know the total electrostatic potential drop across the junction is eφ(∞) − eφ(−∞) = Egap + kB T ln Nd Na . Nc (T )Pv (T ) (2.16) The spatial variation of the electrostatic potential is determined, as usual, by the Poisson equation −∇2 φ = − 4πρ(x) d2 φ = , 2 dx ² (2.17) in which ² is the static dielectric constant of the semiconductor. (The static dielectric constant of the host material of the p-n junction takes into account all static screening effects which are there before the material turned into a p-n junction.) The charge density giving rise to the spatially-varying electrostatic potential is h i ρ(x) = e Nd (x) − Na (x) + pv (x) − nc (x) . (2.18) The meaning of the various factors here is simple. In the square brackets we have the number of completely ionized donors (positive charge density) + the number of completely ionized acceptors (negative charge density) + hole concentration (pv ) + electron concentration (negative charge, nc ). In principle, we have now all that is needed to find the electrostatic potential, since we can just insert Eqs. (2.1) and (2.13) into Eq. (2.18), and then in Poisson’s equation (2.17). However, the result will be an horrendous equation, so let us try to simplify it a bit. Combining 15 Eqs. (2.13) with Eqs. (2.14) we find nc (x) = Nd e−βe(φ(∞)−φ(x)) , pv (x) = Na e−βe(φ(x)−φ(−∞)) , (2.19) from which we see clearly that as x → ±∞, and the respective exponentials approach unity, the electron and hole concentrations approach the concentration of the donors and the acceptors, respectively. We may thus say that at x larger than a certain distance, say dn , Nd (x) − nc (x) → 0, while for x smaller than a certain distance, say −dp , Na (x) − pv (x) → 0. Therefore, a plausible approximate expression for the charge density ρ(x) will be 0, eN , d ρ(x) = −eNa , 0, dn > x > 0 , . 0 > x > −dp , −dp > x . x > dn , (2.20) In other words, outside the depletion layer, which extends from −dp up to dn , the electron density cancels the donor density (on the right side) and the hole density cancels the acceptor density (on the left side). The solution of Poisson’s equation with the approximate charge density, Eq. (2.20), is now simple. We have already found the values of φ at ±∞, [see Eq. (2.16)] and we know (from electrostatics) that φ(x) and its first derivative (which is minus the electric field) should be continuous. Therefore φ(∞) , ´ d φ(∞) − 2πeN (x − dn )2 , ² ³ ´ φ(x) = a φ(−∞) + 2πeN (x + dp )2 , ² φ(−∞) , ³ x > dn , dn > x > 0 , 0 > x > −dp , −d > x . . (2.21) p It remains to apply the boundary conditions at x = 0. First, we look at the electric field at the origin and require that it will be continuous. This gives us Nd dn = Na dp . (2.22) Second, the continuity of the electrostatic potential itself gives φ(∞) − φ(−∞) ≡ ∆φ = 16 ³ 2πe ´ ² (Na d2p + Nd d2n ) . (2.23) We therefore find for the boundaries of the depletion layer the result s s (Na /Nd )²∆φ (Nd /Na )²∆φ dn = , dp = . (Nd + Na )2πe (Nd + Na )2πe (2.24) ∗ ∗ ∗ exercise: Plot the electrostatic potential across the depletion layer of a p-n junction, plot the charge carrier density, and estimate the width of the depletion layer as function of the temperature. The biased p-n junction–simple considerations. So far, we have considered the p-n at equilibrium. Let us now examine what happens when the junction is biased by a potential drop, V . In that case the full change of the electrostatic potential is modified as compared to its equilibrium value. We may try to estimate the effect of the bias by using the approximation ∆φ ≡ φ(∞) − φ(−∞) = (∆φ)0 − V , (2.25) where (∆φ)0 is the equilibrium value [given by Eq. (2.16)]. One result of the bias voltage is that [see Eq. (2.24)] the depletion layer thickness changes: the layer shrinks when V > 0 (i.e., V raises the the potential of the p-side), and expands when V < 0. The other result is that an electrical current will flow across the junction. Denoting by J the density current and by j the electrical current, we have je = −eJe , and jh = eJh . (2.26) Let us consider first the flow of the holes. Firstly, there are the holes which are generated in the n-side of the junction. These are called “minority carriers”. There are not many of them, but once they are in the depletion layer, they will be immediately swallowed into the p-side by the strong electric field that exists in that layer. This current is called ‘the hole generation current’ and is not too sensitive to the bias V . Secondly, there is the current of holes going from the p-side of the junction to the n-side, which is called ‘the hole recombination current’ (because eventually these holes will combine with the electrons in the n-side and will disappear). These holes have to overcome the potential barrier, and therefore we expect that Jhrec ∝ e−eβ((∆φ)0 −V ) . 17 (2.27) Since at equilibrium, i.e., when V = 0, we expect the two currents to balance one another, ¯ ¯ Jhrec ¯ = Jhgen , we find that V =0 Jhrec = Jhgen eβeV , (2.28) so that the total current of holes flowing from the p-side of the junction to the n-side is given by Jh = Jhrec − Jhgen = Jhgen (eβeV − 1) . (2.29) Carrying out the analogous calculation for the electron current, and combining the hole current with the electron current, the total electric current across the junction is j = e(Jhgen + Jegen )(eβeV − 1) . (2.30) In particular, when V < 0, the current saturates at a (small) negative value, −e(Jhgen +Jegen ), independent of the voltage. In order to obtain an estimate for that current, we need to carry out a more detailed calculation. ∗ ∗ ∗ exercise: Plot the current Eq. (2.30) as function of the bias voltage, and discuss the result. The biased p-n junction–detailed calculation. Our more elaborate treatment of the p-n junction begins with the observation that in the presence of both an electric field, E = −dφ/dx, and density gradients, the current densities obey Je = −µn nc E − Dn dnc , dx Jh = µp pv E − Dp dpv . dx (2.31) Here, µn and µp are the electron and hole mobilities, respectively, and Dn and Dp are the diffusion coefficients. The latter are related to the mobilities by the Einstein relations µn = eDn , kB T µp = eDp . kB T (2.32) ∗ ∗ ∗ exercise: Derive the Einstein relation for a degenerate electron gas, and for the non degenerate one [which is just the one appearing in Eqs. (2.32)]. ∗ ∗ ∗ exercise: Prove that the Einstein relations (2.32) ensure that no current is flowing at thermal equilibrium. 18 We can re-write the mobilities in terms of the the mean free times between collisions and the effective masses. For example, if there are only electrons of uniform density in the system, then an electric field will give rise to electrical current, je = −eJe = σE, where σ is the electrical conductivity. According to Drude theory, σ = ne2 τ col /m, and therefore in our case eτ col µn = n , mn eτpcol µp = . mp (2.33) We have distinguished here between the mean free time in between collisions of the electrons (τecol ) and of the holes (τpcol ). We have also assigned different masses, mn and mp , to the charge carriers, but assumed the same mass for all three principal directions for each of the species. At equilibrium, where Je and Jh vanish, one can determine the carrier densities by using Eqs. (2.31) and (2.32). The solutions are just Eqs. (2.13). ∗ ∗ ∗ exercise: Prove the statement above. When the junction is biased, the system is no more at equilibrium. In this case we add two phenomenological equations, ∂Je ∂nc ³ ∂nc ´ − = , ∂t ∂t g−r ∂x ∂Jh ∂pv ³ ∂pv ´ − = , ∂t ∂t g−r ∂x (2.34) which aim to reflect the fact that charge is conserved. In fact, charge is not conserved in our case, since electrons and holes are created by thermal fluctuations and disappear by recombining (namely, electrons go back to the valence band). These processes are described by the first terms on the right hand side of each of Eqs. (2.34). [In the absence of these two terms, each of Eqs. (2.34) is just the continuity equation for the electrons and the holes, respectively.] In order to determine the forms of the terms (∂nc /∂t)g−r and (∂pv /∂t)g−r , we postulate that they act as to restore thermal equilibrium. Hence ³ ∂p ´ ³ ∂n ´ nc − n0c p − p0v v c =− , =− v , ∂t g−r τn ∂t g−r τp (2.35) where n0c and p0v are the equilibrium values of the densities. Since our p-n junction is in steady state (namely, it does not evolve in time), we set the ‘true’ time-derivatives in Eqs. (2.34) to be zero, so that dJe nc − n0c + =0, dx τn 19 dJh pv − p0v + =0. dx τp (2.36) These equations have to be solved together with Eqs. (2.31). This is not a simple task in general; however, in the regions where the electric field is almost constant, we may ignore the terms including it, and combine Eqs. (2.31) and (2.36) to find Dn d2 nc nc − n0c = , dx2 τn Dp pv − p0v d2 pv = . dx2 τp (2.37) Inspection of these equations shows that each of them defines a length, the ‘diffusion length’, L2p = Dp τp . L2n = Dn τn , (2.38) For example, if the density of holes at some x0 > 0 differs from its (equilibrium) value at x → +∞, then the second of Eqs. (2.38) tells us that pv (x) = pv (∞) + e−(x−x0 )/Lp (pv (x0 ) − pv (∞)) . (2.39) In other words, at x > Lp , the hole density (on the n-side of the junction) takes almost its equilibrium value (which is very small, see below). Since holes are created by thermal fluctuations at a rate p0v /τp , and are not recombined along a distance Lp , the flow of thermally generated holes per unit area into the depletion layer is about Lp p0v /τp ' (n2i /Nd )Lp /τp . Hence the saturation current is Jhgen + Jhgen = ³ n2 ´ L i p Nd τ p + ³ n2 ´ L i n Na τn . (2.40) ∗ ∗ ∗ exercise: Discuss the temperature dependence of the saturation current, Eq. (2.40). 20 3. DIAMAGNETISM AND PARAMAGNETISM The interaction of electrons with a uniform magnetic field. A uniform magnetic field couples to the electronic motion, and to the electron spin. The coupling with the spin adds to the Hamiltonian the Zeeman interaction g 0 µB H · S , (3.1) in which H is the magnetic field. Here, S is the total spin of the electrons, i.e., S= X si , i 1 and si = σ , 2 (3.2) where σ is the vector of Pauli matrices, σx = σ = x̂σx + ŷσy + ẑσz , 1 0 −i 1 0 , σy = , σz = . 0 i 0 0 −1 0 1 (3.3) In Eq. (3.1), µB is the Bohr magneton µB = e~/2m = 0.927 × 10−20 erg/G and g0 is the g−factor (Landé factor), which is about 2. The coupling of the magnetic field to the orbital motion of the electron is described by the vector potential A, such that H=∇×A . (3.4) We shall use the gauge in which ∇ · A = 0. (One can always shift the vector potential by an arbitrary function ∇χ and make ∇ · A = 0 without changing the magnetic field, which is the physical quantity). We hence take the vector potential to be 1 A(r) = − r × H . 2 (3.5) (Note that the magnetic field is uniform.) The vector potential modifies the kinetic energy, making the momentum of the i−th electron, pi , to be pi + (e/c)A(ri ). The kinetic energy part of the Hamiltonian becomes, in the presence of a uniform magnetic field, ´2 1 X 2 1 X³ e pi → pi + A(ri ) . 2m i 2m i c 21 (3.6) It follows from Eqs. (3.1) and (3.6) that the change in the Hamiltonian of the electrons due to the magnetic field is ∆H = g0 µB H · S − ´ e X e2 X ³ 2 2 2 pi · ri × H + r H − (r · H) . i i 2mc i 8mc2 i (3.7) The terms linear in the magnetic field can be combined together. Since the total electronic angular momentum of the electrons, L, is ~L = X ri × pi , (3.8) ³ ´ µ B H L + g0 S . (3.9) i the linear terms give We can therefore write the change in the Hamiltonian in the form ´ ³ ´ X³ e2 2 2 2 xi + y i . ∆H = µB H L + g0 S + H 8mc2 i (3.10) In writing down the second term here we have assumed that the magnetic field is along the z−direction. Once we know the modifications of the Hamiltonian in the presence of a uniform magnetic field, we can find the change in the energy of the system (or the change in the free energy) and use them in order to compute the magnetic properties of our system. The magnetic susceptibility. The response of a system to a magnetic field is characterized by its magnetic susceptibility. This quantity is defined as follows. Let us consider a quantum-mechanical system at zero temperature, and calculate the change in the ground state energy, E0 , under the application of a magnetic field. Then, the magnetization density is given by M(H) = − 1 ∂E0 (H) , V ∂H (3.11) where V is the volume. The susceptibility, χ, is defined by χ= ∂M . ∂H (3.12) At finite temperatures, where the system is not in the ground state, we have to replace in the above definitions the ground state energy by the free energy. 22 Larmor diamagnetism. When a solid consists of ions whose all electronic shells are filled, the wave function of the ground state is characterized by zero angular momentum (since such ions are spherically symmetric) and zero spin. In such a case there is no contribution to the ground state energy from the term linear in H [see Eq. (3.10)], and we are left with ∆E0 = X e2 2 H hΨ | ri2 |Ψ0 i . 0 12mc2 i (3.13) The magnetic susceptibility given in Eq. (3.12) is negative, and the material is diamagnetic. This is dubbed ‘Larmor diamagnetism’. Materials in which the magnetic susceptibility is negative are called ‘diamagnetic’ since in the presence of a magnetic field their energy increases, they try to avoid it by directing the induced magnetic moment opposite to the field. ∗ ∗ ∗ exercise: Explain how Eq. (3.13) is obtained, find an explicit form for the diamagnetic Larmor susceptibility and estimate its magnitude. Partially filled shells. A partially filled ion is an ion whose all shells are either completely filled or completely empty, except for one (the ‘outer’ shell). There are two questions to be asked: (a) what is the modification of the ground state energy caused by the magnetic field, and (b) how is the ground state specified. The first question is somewhat easier. Going back to Eq. (3.10), we use perturbation theory to find the change in the energy caused by the extra term in the Hamiltonian, ∆H. The calculation of the change in the energy in perturbation theory is carried out as follows. The full Hamiltonian is H + ∆H, where ∆H is assumed to be small. The eigen functions of the part H of the Hamiltonian are denoted Ψn , and their corresponding energies are En . It is important to remember that the eigen functions form a complete orthonormal basis. In order to find the correction of the ground state energy, we write the (full) Schrödinger equation in the form ³ ´³ i´ Xh (2) H + ∆H Ψ0 + a(1) Ψ + a Ψ + . . . n n n n ³ = E0 + n (1) E0 + (2) E0 i´ ´³ Xh (2) (1) + . . . Ψ0 + an Ψn + an Ψn + . . . . (3.14) n (i) Here, n runs over all eigen values, the coefficients an give the correction of order i (i = 1, 2, . . .) of the eigen functions, (namely, the corrections to the eigen functions are expanded 23 (i) in the complete basis formed by the Ψn ) and E0 is the correction of order i of the ground state energy. The next step is to equate identical orders in Eq. (3.14). (i) (i) At order zero, ∆H = 0, and all an and E0 are zero as well. Equation (3.14) is then HΨ0 = E0 Ψ0 . (3.15) In first order in ∆H, Eq. (3.14) is H X a(1) n Ψn + ∆HΨ0 = E0 X (1) a(1) n Ψn + E0 Ψ0 . (3.16) n n When his equation is multiplied on the left by Ψ0 , it gives (1) E0 = hΨ0 |∆H|Ψ0 i , (3.17) and when it is multiplied from the left by any other eigen function Ψ` , ` 6= 0 it gives (1) a` = hΨ` |∆H|Ψ0 i , E0 − E` ` 6= 0 . (3.18) To second order in the perturbation ∆H, Eq. (3.14) gives H X a(2) n Ψn + ∆H X n a(1) n Ψn = E0 X n (1) a(2) n Ψn + E0 n X (2) a(1) n Ψn + E0 Ψ0 . (3.19) n Multiplying from the left by Ψ0 , we obtain (2) E0 = X a(1) n hΨ0 |∆H|Ψn i n = X hΨn |∆H|Ψ0 ihΨ0 |∆H|Ψn i n6=0 E0 − En = X |hΨn |∆H|Ψ0 i|2 n6=0 E0 − En . (3.20) We have used here Eq. (3.18). Obviously, we can use the second-order equation to find other coefficients in the expansion of the eigen functions, but those are not required for our purposes. ∗ ∗ ∗ exercise: Does the second-order correction to the energy have a definite sign? what is this sign? what happens to the second-order corrections of energies which are not the ground state energy? In our case, ∆H, Eq. (3.10), includes a term linear in the magnetic field, and a term which is quadratic in the magnetic field. Therefore, the correction to the ground state energy, valid 24 up to second order in the magnetic field is ∆E0 =µB H · hΨ0 |L + g0 S|Ψ0 i + + X e2 2 H hΨ | (x2i + yi2 )|Ψ0 i 0 2 8mc i X |hΨ0 |µB H · (L + g0 S|Ψn i|2 n6=0 E0 − En . (3.21) Hund’s rules. In order to find the magnetic nature of systems made of partially filled ions, we now need to (a) specify the ground state (in the absence of the magnetic field), (b) insert the result in Eq. (3.21) to find the change in the energy of the ground state, and (c) take the second derivative with respect to the magnetic field and find the magnetic susceptibility. For example, in the case of transition metals, e.g., copper, the outer shell is the d−shell, of angular momentum ` = 2. [This means that the orbital angular momentum squared of each electron–the expectation value of L2 , has the value `(` + 1).] The projection of the angular momentum vector along the z−direction, `z , can take 2` + 1 values, `z = −`, −` + 1 , . . . ` − 1 , ` . (3.22) Hence the d−shell is five-fold degenerate, namely, there are five single-electron wave functions (or orbitals) corresponding to the d−shell. Each of these orbitals can accumulate two spin directions, namely it may have sz = ±1/2, and therefore the full degeneracy of the d−shell is 10. In other words, we can put up to 10 electrons in the d−shell. Copper, for example, has 9 electrons in that shell. In general, the number of electrons in the outer shell is n, such that 0 < n < 2(2` + 1). If these electrons do not interact with each other, then there are many ways to distribute n electrons on 2(2` + 1) levels. However, the electron-electron interactions, and the spin-orbit interaction, reduce significantly the number of these different possibilities. This is achieved according to famous rules (which are in fact only approximate), called the Hund rules. We shall state these rules without their derivation, assuming that the many-electron eigen states and eigen energies of the ion are characterized by the quantum numbers corresponding to the total spin of the electrons, S, their total orbital angular momentum, L, and their total angular momentum, J. Hund’s first rule. The electronic states with the lowest energy are those with the largest value of the total spin, such that these states are still consistent with the exclusion principle. 25 This means that as long as the number of electrons, n, is such that n ≤ 2` + 1, all their spins are parallel, and S = n/2. When n > 2` + 1, the total spin is reduced. Hund’s second rule. The electronic states with the lowest energy have the largest possible P value of the angular momentum, L = | `z |, which is consistent with Hund’s first rule and with the exclusion principle. Hund’s third rule. This rule has to do with the total angular momentum, J. The total angular momentum takes integral values in the range |L − S| and L + S. Therefore, once S and L are given, there are still (2L + 1)(2S + 1) many-electron possible states. (Remember that the degeneracy of a level with a certain J is 2J + 1, since Jz = −J, −J + 1, . . . , J.) Hund’s third rule uses the spin-orbit interaction to choose the ground state(s) among these states. The spin-orbit interaction reads λL · S, where λ is the spin-orbit coupling. It turns out that λ > 0 for shells that are less than half filled and is negative for shell which are more than half filled. Hund’s third rule tells us that J = |L − S| when n ≤ 2` + 1, because then the spin-orbit interaction (with λ > 0) reduces the energy, and J = L + S, for n ≥ 2` + 1, for the same reason(with negative λ). The ground state of a d−shell ion. n 2 1 0 -1 1 ↑ 2 ↑ ↑ 3 ↑ ↑ ↑ 4 ↑ ↑ ↑ ↑ 5 ↑ ↑ ↑ ↑ 6 ↑↓ ↑ ↑ 7 ↑↓ ↑↓ 8 ↑↓ 9 10 -2 S L J 1/2 2 3/2 1 3 2 3/2 3 3/2 2 2 0 ↑ 5/2 0 5/2 ↑ ↑ 2 2 4 ↑ ↑ ↑ 3/2 3 9/2 ↑↓ ↑↓ ↑ ↑ 1 3 4 ↑↓ ↑↓ ↑↓ ↑↓ ↑ 1/2 2 5/2 ↑↓ ↑↓ ↑↓ ↑↓ ↑↓ 0 0 0 ∗ ∗ ∗ exercise: Prepare a similar table for the ions with partially filled f −shell (L = 3). 26 Hund’s three rules determine the ground state(s) of the partially-filled ion. However, that ground state is still degenerate. Take for example, the case n = 2 in the Table. After applying Hund’s first and second rules, it has total spin S = 1 and total orbital angular momentum L = 3. This means that the states with J = 2, 3, and 4 are all possible. This gives for the case of n = 2 electrons 5 + 7 + 9 = 21 options. (Note that in this case, (2L + 1)(2S + 1) = 21.) However, Hund’s third rule tells us that the lowest energy is obtained for J = |L − S| = 2, and therefore, the ground state of a partially-filled d−shell with two electrons has J = 2 and is 5−fold degenerate. ∗ ∗ ∗ exercise: Repeat this argument and find the degeneracy of all ground states corresponding to the d and f shells. Now that we have specified the ground state(s) of the ions, we turn to the calculation of the ground state energy. Here we distinguish between two possibilities: either the ground state is non degenerate, which happens when J = 0, or it is degenerate. If it is not degenerate, we may use Eq. (3.21) for the energy. It turns out that hΨ0 |L + g0 S|Ψ0 i = 0 when J = 0, but hΨn |L + g0 S|Ψ0 i 6= 0 (this will be explained below). Therefore, only the two terms in Eq. (3.21) which are quadratic in H contribute to the energy. The first one leads to diamagnetism, as we have found above, and yields the Larmor diamagnetic susceptibility. The second term quadratic in H yields positive magnetization, which means that the material is paramagnetic. In a paramagnetic material, the application of a magnetic field reduces the energy, and therefore the material does not try to ‘oppose’ the effect of the magnetic field, as is the case with a diamagnetic material. We see that partially filled band with J = 0 can be either paramagnetic or diamagnetic, depending on the competition between the two H2 −terms in Eq. (3.21). (Note that this is correct as long as one can deduce the magnetization from the ground state energy alone, namely, when the usual thermal energy is not enough to excite higher energy states.) When J 6= 0 the ground state energy is 2J + 1−fold degenerate, and Eq. (3.21) for the ground state energy cannot be used. The application of the magnetic field removes this degeneracy, but then we need to diagonalize an (2J + 1) × (2J + 1) matrix, made of the matrix elements hJLSJz |(Lz + g0 Sz |JLSJz0 i. Luckily enough, there is a theorem, called the 27 Eckart-Wigner theorem, which states that within the 2J + 1 manifold, hJLSJz |(Lz + g0 Sz |JLSJz0 i = g(JLS)Jz δJz Jz0 , (3.23) where g(JLS) is a number which depends on the values of J, L, and S. Therefore, (to first order in the magnetic field H), the ground state energy splits into a ladder-like spectrum of 2J +1 levels. However, since in the absence of the field the ground state energy is degenerate, we must take into account the entropy in calculating the magnetic susceptibility, in addition to the energy. In other words, we need to find the free energy. Curie’s law. The free energy, F, of an ion, whose relevant possible energies are given by E(Jz ) ≡ γHJz , γ = g(JLS)µB , (3.24) is given by −βF e ≡ J X e−βγHJz = Jz =−J eβγH(J+1/2) − e−βγH(J+1/2) . eβγH/2 − e−βγH/2 (3.25) The magnetization of such an ion is given by [cf. Eq. (3.11)] ∂F = γJBJ (βγJH) , ∂H ³ (2J + 1)x ´ ³x´ 1 2J + 1 coth − coth is the Brillouin function . (3.26) where BJ (x) = 2J 2J 2J 2J M≡− Note that the Brillouin function approaches 1 as x → ∞ (since then the coth approaches 1). This means that when the Zeeman energy γJH is much larger than the thermal energy, the magnetization of the ion attains its maximal value, γJ. At temperatures such that the thermal energy is larger than the Zeeman energy, we use the fact that coth(x) ' 1 x + x3 , to find BJ (x) ' (J + 1)x . 3J (3.27) It therefore follows that ¯ ¯ χ¯ single ion = (gµB )2 J(J + 1) , kB T À gµB H . 3 kB T (3.28) To obtain the susceptibility of the entire solid, we multiply this susceptibility by the density of ions in the solid. 28 Equation (3.28) is the Curie’s law. It tells us that partially-filled ions with J 6= 0 are, generally, paramagnetic, and that their inverse susceptibility is proportional to the temperature, at temperatures which are not too low. ∗ ∗ ∗ exercise: Derive in detail Eqs. (3.25), (3.26), (3.27), and (3.28). Give explicit expressions for the case J = 1/2, and compare with Eq. (3.30) below. Plot the magnetization and the susceptibility for this specific case, as function of the temperature. ∗∗∗ exercise: Consider an ion with a partially filled shell of total angular momentum J, and Z additional electrons in filled shells. Show that the ratio of the paramagnetic susceptibility to the Larmor diamagnetic susceptibility is χpar 2J(J + 1) ~2 =− , χdia ZkB T mhr2 i (3.29) and estimate its magnitude. In order to clarify the use of the free energy [see Eq. (3.25) above], let us consider the magnetization of a single spin 1/2, as function of the temperature. A spin half, in the presence of a magnetic field H, can be either aligned with the field, in which case its energy is enhanced by µB g0 H/2, or it can be anti parallel to the field, in which case its energy is reduced by µB g0 H/2. It is hence clear that at zero temperature, the spin will be anti parallel to the field, namely, it will be magnetized. However, at very high temperatures, that spin has equal probabilities to be aligned or anti aligned with the field, in which case its average magnetic approaches zero. At temperature T , the average magnetization of the spin is M = µB g0 0.5eβµB g0 H/2 − 0.5e−βµB g0 H/2 µB g0 βµB g0 H = tanh . eβµB g0 H/2 + e−βµB g0 H/2 2 2 (3.30) We can re-derive this formula using the definition of the free energy, Eq. (3.25), which in this case is simply ´ ³ F = −kB T ln eβµB g0 H/2 + e−βµB g0 H/2 . (3.31) It is easy to verify that using this free energy in Eq. (3.11) gives the result (3.30). Pauli paramagnetism. Here we consider the contribution of the conduction electrons to the magnetic moment of the crystal. Stated in other words, we consider the (para)magnetism of metals, whose conduction electrons can be considered as free electron gas. 29 The magnetic moment of the free electron gas can be obtained as follows. Each electron has spin half, and therefore its energy is enhanced when it is aligned with the field, and is reduced when its spin is anti-parallel to the field. All we have to do is to find how many of the electrons at temperature T are aligned with the field, and how many of them are anti parallel to the field, and take the difference. For simplicity, we assume in this calculation that the Landé factor g0 is 2. The number of electrons having a certain energy E at temperature T is given by the Fermi distribution, f (E) = (eβE + 1)−1 (energies are measure with respect to the chemical potential). The number of energy levels of about the same energy E is given by the density of states (per unit volume), N (E). The chemical potential of the electrons with their spin aligned with the field is decreased by µB H, and the chemical potential of those which are anti parallel to the field is increased by the same amount. Hence, the density of electrons aligned with the field is Z n+ = dEN (E) 1 eβ(E+µB H) +1 , (3.32) and the density of those which are anti parallel to the field is Z 1 n− = dEN (E) β(E−µ H) . B e +1 The magnetic moment of the electron gas is Z ³ M = µB (n− − n+ ) = µB dEN (E) 1 eβ(E−µB H) + 1 − (3.33) 1 eβ(E+µB H) + 1 Expanding in µB H ¿ EF (where EF is the Fermi energy), we find Z ³ ∂f ´ 2 M ' 2µB H dEN (E) − . ∂E ´ . (3.34) (3.35) Since minus the derivative of the Fermi energy is very close to a delta-function confining the energy to be about the Fermi energy, we see that magnetization is simply given by the density of states at the Fermi energy, M ' 2µ2B HN (EF ) . (3.36) It also follows that the paramagnetic susceptibility of the free electron gas, which is called Pauli paramagnetism, is essentially independent of the temperature. ∗ ∗ ∗ exercise: Compare Eqs. (3.30) and (3.36), and discuss the similarity and the difference between the two cases. 30 4. EXCHANGE INTERACTIONS The dipolar interaction. The direct dipolar interaction of two magnetic dipoles, m1 and m2 , separated by a distance r reads ´ 1³ U = 3 m1 · m2 − 3(m1 · r̂)(m2 · r̂) . r (4.1) In order to estimate its magnitude, we take m1 ' m2 ' gµB ' e~/mc. Then, using a0 = ~2 /me2 and α = e2 /(~c) = 1/137, ³ e~ ´2 1 ³ a ´3 e2 ³ e~ ´2 1 ³ me2 ´2 0 U' = 3 mc r r a0 mc e2 ~2 ³ a ´3 e2 ³ 1 ´2 ³ a ´3 ³ 1 ´2 0 0 = Rd . = r a0 137 r 137 (4.2) Since 1 Ry=13.6 eV and r is about several Bohr radii, this energy is about 10−4 eV (which amounts to a temperature of a few degrees), and is far too small small to explain the typical magnetic energies. ∗ ∗ ∗ exercise: What is the preferred direction of two identical magnetic dipoles interacting via the dipolar interaction? [Answer: The dipolar interaction is minimal when the two dipoles are parallel to one another and to the radius vector r. If they are perpendicular to the radius vector, then they prefer to be anti-parallel.] The exchange energy. Let us consider two electrons interacting via the Coulomb interaction alone (namely, we neglect spin-dependent interactions like the spin-orbit interaction, etc.). The Hamiltonian reads H(1, 2) = − ~2 ∂ 2 e2 ~2 ∂ 2 − + V (r ) + V (r ) + , 1 2 2m ∂r21 2m ∂r22 r12 (4.3) where V (r) is due to the ions, and r12 = |r1 − r2 |. When there are two independent orbital states, ψa (r) and ψb (r), the orbital part of the two-electron wave function can be ´ 1 ³ √ Ψ(1, 2) = ψa (1)ψb (2) + ψb (1)ψa (2) , singlet , 2 ´ 1 ³ Ψ(1, 2) = √ ψa (1)ψb (2) − ψb (1)ψa (2) , triplet . 2 Let us now calculate the total energy of the system. We have Z Z Z Z ∗ dr1 dr2 Ψ (1, 2)HΨ(1, 2) = dr1 dr2 |ψ1 (a)ψb (2)|2 H Z Z ± dr1 dr2 ψa∗ (1)ψb∗ (2)ψa (2)ψb (1)H . 31 (4.4) (4.5) (We have used here the fact that the Hamiltonian is invariant under the change 1 ↔ 2.) The integral Z Z Jab = dr1 dr2 ψa∗ (1)ψb∗ (2)ψa (2)ψb (1)H , (4.6) is called the exchange integral. When ψa is orthogonal to ψb , only the Coulomb interaction contributes to this integral. Hence Z Z e2 Jab = dr1 dr2 ψa∗ (1)ψb∗ (2)ψa (2)ψb (1) . r12 (4.7) Let us denote ρ(r) = ψa (r)ψb∗ (r) . Then the exchange integral can be written in the form Z Z e2 Jab = dr1 dr2 ρ∗ (r1 ) ρ(r2 ) . r12 (4.8) (4.9) Let us further denote Z φ(r) = dr0 e2 ρ(r0 ) . |r − r0 | (4.10) By its definition, φ(r) satisfies the Poisson equation, 4φ(r) = −4πe2 ρ(r) . (4.11) Using this in Eq. (4.9), we have 1 Jab = − 4πe2 Z dr(4φ∗ (r))φ(r) . (4.12) We can now use Green’s theorem (in other words, integrate by parts). Since the surface contribution vanishes as |r| → ∞, we obtain Z 1 dr|∇φ(r)|2 > 0 . Jab = 4πe2 (4.13) It follows that the exchange energy is positive. Returning to the expression for the energy, Eq. (4.5), we see that the spatial integration in the first term there can be written in the form Z Z Z Z 2 2 dr1 dr2 |ψa (1)ψb (2)| H = dr|ψa (r)| H0 + dr|ψb (r)|2 H0 Z Z e2 + dr1 dr2 |ψa (1)ψb (2)|2 ≡ Ea + Eb + Kab . r12 32 (4.14) The term Kab is called the Coulomb integral. Here, H0 = −(~2 /2m)4 + V (r) is the single electron part of the Hamiltonian. In summary, we have found that the energy of the two electron system, which is described by a spin-independent Hamiltonian is given by Ea + Eb + Kab + Jab when the two electrons are in the symmetric spatial wave function, and by Ea + Eb + Kab − Jab when they are in the anti-symmetric one. This property of the two electron system can be written in terms of spin operators, in the form 1 H = Ea + Eb + Kab − Jab (1 + 4sa · sb ) . 2 (4.15) In order to prove this statement, we note that 2sa · sb = (sa + sb )2 − s2a − s2b = s2 − 3 . 2 (4.16) In the singlet state, s = 0, and hence sa · sb = −3/4; in the triplet state, s = 1 (and s2 = s(s + 1) = 2), and therefore sa · sb = 1/4. The Heisenberg interaction is based on the above picture. It reads H=− X Jij si · sj . (4.17) hiji This Hamiltonian is the basis for most of the investigations in magnetism. When J is positive, the ground state of the system is ferromagnetic: the spins are aligned all in the same direction. When J is negative, the system has an antiferromagnetic order. Super-exchange. Consider an atom with a single orbital (i.e., a single energy level and a single wave function). When this level is empty, the energy is zero; if one electron is on the level, then the energy is ε; and if two electrons occupy the atom, then the energy is 2ε + U , where U represents the Coulomb repulsion. We denote the single-electron wave function by |Ri. Now consider a molecule made of two such atoms. We denote the wave function of the electron when it is on the second atom by |R0 i, and allow for quantum mechanical tunneling processes between the two atoms, such that the single-electron Hamiltonian, denoted h, has the matrix elements hR|h|R0 i = hR0 |h|Ri = −t , 33 (4.18) in addition to hR|h|Ri = hR0 |h|R0 i = ε . (4.19) ∗ ∗ ∗ exercise: Find the eigen energies and the eigen functions of the molecule when the electrons are spinless. When the two electrons are in the singlet state, their spatial (symmetric) wave functions are ´ 1 ³ 0 0 Φ0 = √ |Ri|R i + |R i|Ri , Φ1 = |Ri|Ri , Φ2 = |R0 i|R0 i . 2 (4.20) The matrix of the Hamiltonian in the singlet subspace is (assuming that hR|R0 i = 0, hR|Ri = hR0 |R0 i = 1) H H H 2ε 00 01 02 √ H10 H11 H12 = − 2t √ H20 H21 H22 − 2t √ − 2t − 2t 2ε + U 0 . 0 2ε + U √ (4.21) This Hamiltonian matrix has the eigen energies U 2ε + U , 2ε + ± 2 r U2 + 4t2 , 4 and therefore the ground state of the singlet state has the energy r U U2 4t2 sing 2 Eg = 2ε + − + 4t ' 2ε − , for t ¿ U . 2 4 U (4.22) (4.23) Namely, the electrostatic energy reduces the total energy of the two electrons. However, this forces them to have opposite spins. In other words, the electrostatic energy favors the antiferromagnetic state. ∗ ∗ ∗ exercise: Derive explicitly Eqs. (4.21) and (4.22). Find the exact ground state of the singlet. Plot the probability to find the two electrons on the same atom in this state as function of U/t and explain the result. Curie-Weiss law and ferromagnetism. Let us first re-visit the derivation leading to the Curie law, Eq. (3.28). When we suppose that each atom behaves like a small magnet of moment µ, then in the magnetic field H it acquires the energy −µ · H. We further assume that each atom is independent of its neighbors and can rotate freely under the effect of the 34 temperature. Since the atom is localized, it satisfies the Boltzmann statistics, and therefore its average moment is given by R dΩµeβ µ·H R hµi = , dΩeβ µ·H (4.24) where dΩ is the element of the solid angle for rotation. The total magnetization of a system of N atoms will be Nhµi, and its magnetic susceptibility will be in general a tensor (the derivative of µi with respect to Hj ). Confining ourselves to cubic symmetry, we find D ∂µ E N 1 Nhµ2 i χ=N ' hµ · µi = , for small enough H . (4.25) ∂H kB T 3 kB T This is the Curie law. ∗∗∗ exercise: Find the temperature dependence of the total magnetization of the system by using Eq. (4.24) in the limit of small magnetic fields, and compare with the results (3.26) and (3.27). In many cases the interaction between neighboring atoms cannot be ignored. To account for this effect approximately, one may introduce an internal magnetic field exerted on each atom by its neighbors. This field is called the Weiss field (or sometimes molecular field). It is plausible to assume that the internal field is proportional to the average of the magnetic moment, and that the proportionality constant reflects the strength of the inter-atom interactions. In other words, λ is related to the exchange coupling J, see Eq. (4.17). Hence, the internal magnetic field is HI = λNhµi . The total field acting on each atom is now H + HI , and therefore we find R dΩµeβ µ·(H+HI ) N Nhµi =N R ' hµµ(H + HI )i β µ·(H+HI ) kB T dΩe N hµ2 i ' (H + λNhµi) . kB T 3 (4.26) (4.27) Let us define an effective temperature, Θ, Θ≡ λNhµ2 i . 3kB (4.28) Then the susceptibility is χ=N D ∂µ E ∂H Nhµ2 i = . 3kB (T − Θ) 35 (4.29) This is the Curie-Weiss law for the magnetic susceptibility of a ferromagnet. In particular, the susceptibility diverges as the temperature approaches the ferromagnetic critical temperature, (the Curie temperature), which in our approximation is Θ. In order to show that Θ is indeed the critical temperature of our model, we need to return to the full calculation of the magnetic moment, as is done for example in Eq. (3.26). There we have found that the total magnetization, i.e., Nhµi, is given by the Brillouin function, ³ ´ ³ ´ (2J+1)x 1 x BJ (x) = 2J+1 coth − coth , with x = H/kB T . For the case J = 1/2 the 2J 2J 2J 2J Brillouin function reduces to tanh, and hence Nhµi = Ntanh ³H + H ´ I kB T = Ntanh ³ H + λNhµi ´ kB T . (4.30) Let us now consider this equation when the applied magnetic field H is zero. Obviously, hµi = 0 is a solution for this equation. However, there might be another solution: since tanhx ' x − x3 /3 for small x, we see that as long as the temperature is less than λN/kB (which is up to a numerical constant equal to Θ above), we can have a nonzero solution for the magnetization. In other words, the system sustains a spontaneous magnetic order below some critical temperature. Antiferromagnetism. We may view an antiferromagnet as consisting of two sublattices. At zero temperature, the average magnetizations of the two sublattices are anti parallel to one another. At finite temperatures, the internal fields acting on the two sublattices are different. The internal field acting on an atom belonging to the ‘-’ sublattice is H− = λNµ+ , (4.31) while the internal field on on an atom of the ‘+’ sublattice is H+ = λNµ− , (4.32) where now λ is negative. At sufficiently high temperatures, we can try to perform the same calculation as the one in Eqs. (4.27), (4.28), and (4.29). This will give us (note that N is the number of atoms in 36 each sub lattice) N kB T N Nhµ− i ' kB T Nhµ+ i ' hµ2 i (H + λNhµ− i) , 3 hµ2 i (H + λNhµ+ i) . 3 (4.33) Adding the two equations, we find that we have exactly the same result as before. However, in the present case, λ is negative, and therefore it is better to define ΘN ≡ − λNhµ2 i , 3kB (4.34) to obtain χ=N D ∂µ E ∂H = Nhµ2 i . 3kB (T + ΘN ) (4.35) The paramagnetic susceptibility of an antiferromagnet is smaller than the one of a ferromagnet. The temperature ΘN is called the Néel temperature. Spin waves. Let us consider the Heisenberg Hamiltonian for a ferromagnet, H=− X J``0 S` · S`0 . (4.36) h``0 i We now assume that in the ground state of our system, all spins are aligned: they have their maximal possible value of S z . S`z |Si` = S|Si` . (4.37) The ground state itself is the product, |0i = |Si1 |Si2 . . . |SiN . (4.38) We now introduce the spin-deviation operators, S ± , (dropping the subscript ` for brevity) S ± = S x ± iS y . (4.39) We can show that the application of these operators on the eigen states of S z gives another eigen state of S z , ³ ´ ³ ´ + z x y x z y y z x z S S |Si = S (S + iS )|Si = S S + iS + i[S S − iS ] |Si ³ ´ = S + S z |Si + S + |Si = (S + 1) S + |Si . 37 (4.40) In a similar fashion, ³ ´ ³ ´ S z S − |Si = (S − 1) S − |Si . (4.41) Using the spin-deviation operators, we re-write the Heisenberg Hamiltonian in the form H=− X ³ J``0 S`z S`z0 h``0 i ´ 1 + − − + + [S` S`0 + S` S`0 ] . 2 (4.42) As a result we see that H|0i = − X J``0 S 2 |0i , (4.43) h``0 i namely, only the product of S z in Eq. (4.42) contributes to the ground state energy. What are the excitations of this system? One may think that an excited state is a state in which one spin deviates from being maximally aligned. However, this is not an eigen state, because the operation of S`− S`+0 will shift the deviation to the nearest neighbor. In order to specify the excited states, we use the following description. Let us denote by |ni a state which has n deviations (n < 2S). Operating on this state with S + will reduce n by 1, and operating with S − will increase it by 1. We see that in some sense S + plays the role of an annihilation operator, denoted a, and S − plays the role of a creation operator, denoted a† . In fact, we know that [S + , S − ] = i[S y , S x ] − i[S x , S y ] = 2S z . (4.44) S+ S− a= √ , a† = √ , 2S z 2S z (4.45) So, if we define we find that the creation and annihilation operators obey the usual commutation law, [a, a† ] = 1 . (4.46) The definition (4.45) is not really proper, since S z is also an operator. However, for large enough values of S, we can replace S z by its average, S. This means that we may consider only small deviations from the ground state (and therefore this theory is not valid at all for spins 1/2). 38 Now the spin deviation states are eigen states of a† a, and we have a† a|ni = n|ni , a|ni = √ √ n|n − 1i , a† |ni n + 1|n + 1i , (4.47) and S z = S − a† a . (4.48) Putting all this back into the Hamiltonian, Eq. (4.42), we find H'− X h``0 i '− X ´ ³ J``0 [S − a†` a` ][S − a†`0 a`0 ] + S[a†` a`0 + a` a†`0 ] ³ ´ J``0 S 2 + S[a†` a`0 + a` a†`0 − a†` a` − a†`0 a`0 ] . (4.49) h``0 i The first (non operator) term here is just the ground state, so we drop it when considering the excitations. The Hamiltonian now resembles the one of lattice vibrations. As in that case, it is expedient to introduce the Fourier transforms r r 1 X 1 X † −iq·R` † iq·R` aq e , a` = a e . a` = N q N q q (4.50) (R` is the radius-vector to site ` of the lattice.) Then, taking into account that in Eq. (4.49) ` and `0 are nearest neighbors and that J``0 depends only on |R` − R`0 | we find H= X³X q ´ 2SJ(1 − e−iq·Rnn ) a†q aq , (4.51) Rnn in which Rnn is the radius-vector to the nearest neighbors. Since the form (4.51) is exactly the same as encountered in the theory of lattice vibrations, we can simply use all the know results for this problem as well. Noting that the dispersion of the spin wave excitations, ωq = X 2SJ(1 − e−iq·Rnn ) ' 2SJq 2 a2 , (4.52) Rnn we can find the average occupation number of each mode, ha†q aq i = 1 eβωq /kB T 39 −1 , (4.53) is given by the Bose distribution. It follows that the temperature dependence of the magP netization, M (T ), is given by N S − q ha†q aq i. ∗∗∗ exercise: Find an explicit form for the temperature dependence of the magnetization using the spin wave theory. Find the specific heat of the spin wave excitations. 40 5. SUPERCONDUCTIVITY The Meissner effect. An ideal superconductor permits no magnetic field in its interior. In other words, a superconductor is a perfect diamagnet. Not only a magnetic field is excluded from entering a superconductor, (which might have been explained by perfect conductivity), but a field in an originally normal (i.e., not superconductive) sample is expelled as the sample is cooled below the superconductive transition temperature, Tc . This fact cannot be explained by perfect conductivity. This is the Meissner effect. On the other hand, the Meissner effect implies that superconductivity can be destroyed once the magnetic field exceeds a certain value, the critical magnetic field, Hc . Let us denote the free energy density of a system when it is above the superconducting transition temperature by fn . As the system is cooled below Tc (in the absence of a magnetic field) it goes into the superconductive state, since in that state its free energy is lower. Let us denote that free energy density by fs . Then the critical magnetic field is given by Hc2 (T ) = fn (T ) − fs (T ) . 8π (5.1) (Remember that the free energy of a magnetic field is given by the volume integral of H 2 /8π.) Equation (5.1) implies that the critical magnetic field depends on the temperature, and it vanishes at T = Tc . In fact, Hc (T ) ' Hc (0)(1 − (T /Tc )2 ) . (5.2) The Meissner effect is described by the (phenomenological) London equations. Let us consider the motion of the electrons in a perfect conductor, i.e., when they are accelerated by an electric field without any dissipation. Let us also assume that the number density of such ‘dissipation-less’ electrons is ns , and their common velocity is vs . We can then write m dvs = −eE , dt (5.3) where E is the electric field, and n e2 dJs = s E, dt m Js = −evs ns . (5.4) On the other hand, the Faraday’s law, ∇×E=− 41 1 ∂H , c ∂t (5.5) gives ∂³ n e2 ´ ∇ × Js + s H = 0 . ∂t mc (5.6) The London equation states that in a superconducting system, not only the time derivative above vanishes, but that ns e2 ∇ × Js + H = 0 , London equation . mc (5.7) Since the Maxwell equation gives us that ∇×H= 4π J , c s (5.8) we arrive at the result that in a superconducting system ∇2 H = 1 1 H , or ∇2 Js = 2 Js , 2 λ λ (5.9) where the penetration depth λ is given by λ2 = mc2 . 4πns e2 (5.10) We note that the penetration depth diverges as the number density of the dissipation-less electrons (i.e., superconducting electrons), ns , tends to zero. In other words, the penetration depth diverges as T → Tc . Let us consider a superconducting slab of finite thickness d, placed in a parallel magnetic field, Ha . The slab is perpendicular to the x direction. According to the first equation of (5.9), the magnetic field within the slab is H(x) = Aex/λ + Be−x/λ , (5.11) where A and B are constants. They are determined from the two requirements that at x = d/2 and at x = −d/2, the magnetic field equals the applied field, Ha . Hence, H(x) = Ha cosh(x/λ) . cosh(d/2λ) The average value of the field within the slab is Z 1 d/2 cosh(x/λ) 2λ dxHa = Ha tanh(d/2λ) . hHi = d −d/2 cosh(d/2λ) d 42 (5.12) (5.13) This average value consists of the (external) applied field, Ha plus the magnetization induced in the slab, i.e., hHi = Ha + 4πM . (5.14) When d À λ, hHi tends to zero, and therefore M →− Ha , dÀλ. 4π (5.15) The susceptibility is −1/4π, which means that a bulk superconductor is a perfect diamagnet. On the other hand, when d ¿ λ, hHi tends to Ha (1 − (d2 /12λ2 )), and therefore M →− ³ H ´³ d2 ´ a . 4π 12λ2 (5.16) We can estimate from this relation the critical magnetic field in the special case where the field is parallel to the slab. The critical field in this case is not the critical field of the material from which the slab is made, but it is the critical field of the material times the ratio λ/d, Hck = √ λ 12 Hc . d (5.17) ∗ ∗ ∗ exercise: Explain intuitively the result (5.17) (by considering the current needed to screen the field). What will be the critical magnetic field perpendicular to the slab? The Ginzburg-Landau (GL) theory. Ginzburg and Landau constructed a theory of superconductivity, by introducing a complex pseudo wave function ψ as an order parameter. The local density of the superconducting electrons is given by ns = |ψ(r)|2 . (5.18) The order parameter obeys the Ginzburg-Landau equations, 1 ³ e ´2 −i∇ − A ψ + β|ψ|2 ψ = −αψ , 2m c (5.19) and the equation for the supercurrent density Js , Js = −i e ³ ∗ e2 ψ ∇ψ − ψ∇ψ ∗ ) − |ψ|2 A . 2m mc 43 (5.20) Here, α and β are (temperature-dependent) parameters, and A is the vector potential which represents the action of a constant magnetic field, H = ∇ × A. Note that if we write the complex order parameter in the form ψ = |ψ|eiφ , (5.21) then the supercurrent is related to thegradient of the phase, ³ e e ´ 2 Js = |ψ| ∇φ − A . m c (5.22) The GL theory introduces a length, ξ= 1 , |2mα| (5.23) which characterizes the distance over which ψ(r) can vary. Near Tc ξ diverges as (Tc −T )−1/2 , since α vanishes as (T − Tc ). Thus, superconductivity is described by two lengths, the coherence length ξ and the penetration length λ. The ratio of these two lengths, κ= λ , ξ (5.24) is approximately temperature-independent. Type II superconductors are those for which √ κ > 1/ 2. The GL equations can be derived from the free energy density, f , which takes the form β 4 1 ¯¯³ e ´ ¯¯2 H 2 f = fn0 + α|ψ| + |ψ| + , ¯ −i∇ − A ψ ¯ + 2 2m c 8π 2 (5.25) by minimizing the free energy with respect to the complex order parameter. In a bulk superconductor, and in the absence of the magnetic field (A = 0) we can take the order parameterψ to be real, since in this case all the coefficient of the differential equation are real. In a homogenous bulk superconductor we do not expect any spatial variation, and therefore Eq. (5.19) takes the form αψ + βψ 3 = 0 . (5.26) This has two solutions: either the bulk is simply not a superconductor, i.e., ψ = 0, or 2 =− ψ 2 ≡ ψ∞ 44 α . β (5.27) This result leads to the identification of the temperature dependence of the coefficient α, α ∝ −Tc + T . (5.28) Namely, for temperatures below the transition temperature Tc where α is negative, the superconducting order parameter is non zero. It vanishes (continuously) at T = Tc , and then system phase transforms into the normal state. Since this happens continuously (by construction) the GL equation describes properly the second order phase transition. Note that this argument ignores the temperature dependence of the other coefficient β. It assumes that β depends only weakly on the temperature. Note that the superconducting free energy density, α|ψ|2 + β2 |ψ|4 is zero when ψ is zero, and is −|α|2 /(2β when |ψ|2 = −α/β, namely, it is lower in the superconducting state. Let us now assume that the superconductor occupies only half of the space, x > 0 (no fields are applied). Then the GL equation becomes αψ + βψ 3 − 1 ∂ 2ψ =0. 2m ∂x2 (5.29) Denoting ψ = ψ∞ f = ( p |α|/β)f , (5.30) (ψ∞ is the full value of the order parameter in the bulk superconductor), this equation becomes 2 1 d2 f 3 2d f + f − f ≡ ξ + f − f3 = 0 , 2 2 2m|α| dx dx (5.31) where we have used the definition (5.23) for the superconducting coherence length ξ. It is quite straightforward to solve Eq. (5.31). Denoting f 0 ≡ df /dx, and f 00 ≡ d2 f /dx2 , we multiply Eq. (5.31) by f 0 . Then, f f 0 = (1/2)df 2 /dx, f 3 f 0 = (1/4)df 4 /dx, and f 0 f 00 = (1/2)df 02 /dx, and hence 1 ´ d ³ 2 02 ξ f + f2 − f4 = 0 . dx 2 (5.32) This implies that the combination of terms within the brackets do not depend on x, and consequently their value is the same as for x → ∞, i.e., f = 1 [see Eq. (5.30)]. Namely, 1 1 1 ξ 2 f 02 + f 2 − f 4 = V (1 − f 2 (x))2 = ξ 2 f 02 (x) 2 2 2 df dx x √ √ V = V f (x) = tanh( ). 1 − f2 2ξ 2ξ 45 (5.33) We see that near the boundary, the order parameter decays to zero over a scale length of order ξ. ∗ ∗ ∗ exercise: Find and discuss the order parameter of a superconducting slab of width d, placed normal to the x axis (the slab is infinite along the y and the z directions). Discuss in particular the cases d ¿ ξ and d À ξ. The critical current. There are certain cases in which one can assume that the absolute value of the order parameter, |ψ|, does not vary spatially, however, its phase φ, does [see Eq. (5.21)]. This occurs when the spatial change in |ψ| has to occur over distances far smaller than ξ, and hence will cost too much kinetic energy. For example, if |ψ| has to change over the width d of a thin film, its change will be of order x/ξ ' d/ξ ¿ 1. In such cases the GL equations, [see Eqs. (5.22)) and (5.25)] take the form e e |ψ|2 (∇φ − A) ≡ e|ψ|2 vs , m c β mv 2 H 2 f = fn0 + α|ψ|2 + |ψ|4 + |ψ|2 s + . 2 2 8π Js = (5.34) In a very thin film or wire, of thickness d ¿ λ, one may neglect the magnetic energy density H 2 /(8π) as compared to the kinetic energy (the latter is of order λ2 , and the former of order d2 ). The super conducting free energy density is then fs = α|ψ|2 + mv 2 β 4 |ψ| + |ψ|2 s . 2 2 (5.35) minimizing it with respect to |ψ|2 , we find ³ ´ α + mvs2 /2 |α| ³ mvs2 ´ 2 2 |ψ| = − 1− = ψ∞ 1 − (ξmvs ) . = β β 2|α| 2 (5.36) The supercurrent is then Js = 2 eψ∞ vs ³ 2 1 − (ξmvs ) ´ . (5.37) We see that the supercurrent vanishes when vs = 0 and vs = 1/(mξ). Its maximal value (found below) is the maximal supercurrent that the system can carry. The value of vs at the maximum is obtained by minimizing Js with respect to vs , yielding mvs2 /2 = |α|/3. Inserting this value into Eq. (5.36), we find 2 2 |ψ|2 = ψ∞ , 3 46 (5.38) namely, in the presence of current, the superconducting order parameter is reduced as compared to its value in the bulk (and in the absence of a current). The critical current, Jc , is given by inserting the results of the minimization into the first of Eqs. (5.34), Jc = eΨ2∞ 2 ³ 2 |α| ´1/2 . 3 3m (5.39) Note that the critical current vanishes at the transition temperature Tc , where α = 0. The upper critical field, Hc2 . Let now consider how superconductivity is nucleated in the bulk, in the presence of a magnetic field H (along the z direction). We use the gauge ay = Hx , (5.40) and ignore in the GL differential equation the cubic term, since the system is only barely superconducting and the order parameter is therefore small. Then ³ ³ 2πH ´2 ´ 4πi ∂ 1 −∇2 + Hx + x2 ψ = 2 ψ , Φ0 ∂y Φ0 ξ (5.41) where the superconducting flux quantum is Φ0 = πc . e (5.42) By substituting ψ(r) = eiky y+ikz z f (x) , (5.43) we find ³1 ´ ∂ 2 f ³ 2πH ´2 2 2 − 2+ (x − x0 ) f = 2 − kz f , ∂x Φ0 ξ x0 = ky Φ 0 . 2πH (5.44) The problem now is the same as the Schrödinger equation for an harmonic oscillator, with the eigenvalues 1 1 ³ eH ´ ²n = (n + )ωc = (n + ) . 2 2 mc (5.45) In our case, however, the energy is given by the term on the right hand side of Eq. (5.44). The maximal magnetic field (corresponding to the lowest value of n, i.e., n = 0) which the system can support is therefore ´ Φ0 ³ 1 2 − kz . H= 2π ξ 2 47 (5.46) The truly maximal field is obtained for kz = 0 (i.e., no variation of the order parameter along the axis parallel to the field). Hence Hc2 = Φ0 . 2πξ 2 (5.47) Note that this field vanishes at the superconducting transition temperature. The BCS Hamiltonian. The ‘reduced’ Hamiltonian, which is assumed to include all interactions important for superconductivity, reads H= X ζk nkσ + X Vkk0 c†k↑ c†−k↓ c−k0 ↓ ck0 ↑ . (5.48) kk0 kσ Here, c†kσ is the operator that creates an electron at state of wave vector k and spin σ, ckσ is the operator that destroys such a state (these operators obey the anti commutation fermionic relations), nkσ is the number operator of electrons in the state k with spin σ nkσ = c†kσ ckσ , (5.49) and Vkk0 is the pairing interaction: it destroys a pair of electrons of opposite spins and momenta, and creates another pair of opposite spins and momenta. Finally, ζk is the single electron energy, measured from the Fermi energy. The BCS Hamiltonian may be solved using the mean-field approximation, which we have already encountered in the discussion of the Heisenberg exchange interaction. We replace c†k↑ c†−k↓ c−k0 ↓ ck0 ↑ =⇒ hc−k0 ↓ ck0 ↑ ic†k↑ c†−k↓ + hc†k↑ c†−k↓ ic−k0 ↓ ck0 ↑ − hc−k0 ↓ ck0 ↑ ihc†k↑ c†−k↓ i , (5.50) and denote bk = hc−k↓ ck↑ i , b∗k = hc†k↑ c†−k↓ i . (5.51) The model Hamiltonian, HM , that results reads HM = X ζk nkσ − X (∆k c†k↑ c†−k↓ + ∆∗k c−k↓ ck↑ − ∆k b∗k ) , kσ (5.52) k where we have defined ∆k = − X Vkk0 bk0 ≡ k0 X k0 48 Vkk0 hc−k↓ ck0 ↑ i . (5.53) This model Hamiltonian is diagonalized by the Bogoliubov transformation: † † ck↑ = u∗k γk0 + vk γk1 , c†−k↓ = −vk∗ γk0 + uk γk1 . (5.54) ∗ ∗ ∗ exercise: Check that in order for the new operators γk0 and γk1 to be fermions (that is, to obey the anti commutation relations), it suffices that |uk |2 + |vk |2 = 1. ∗ ∗ ∗ exercise: Show that † γk0 = uk ck↑ − vk c†−k↓ γk1 = u∗k c†−k↓ + vk∗ ck↑ . (5.55) Explain the meaning of these operators. The next step is to insert Eqs. (5.54) into the model Hamiltonian (5.52). This procedure gives HM = X h i † † † † ζk (|uk |2 − |vk |2 )(γk0 γk0 + γk1 γk1 ) + 2|vk |2 + 2u∗k vk∗ γk1 γk0 + 2uk vk γk0 γk1 k Xh † † (∆k uk vk∗ + ∆∗k u∗k vk )(γk0 γk0 + γk1 γk1 − 1) + (∆k vk∗ 2 − ∆∗k u∗k 2 )γk1 γk0 + k i † † + (∆∗k vk2 − ∆k u2k )γk0 γk1 + ∆k b∗k . (5.56) Now we see that when the condition 2ζk uk vk + ∆∗k vk2 − ∆k u2k = 0 (5.57) is satisfied, the Hamiltonian becomes diagonal. Moreover, upon multiplying this condition by ∆∗k /u2k we obtain ∆∗k vk = (ζk2 + |∆k |2 )1/2 − ζk ≡ Ek − ζk = real . uk (5.58) ¯v ¯ E − ζ 1³ ζ ´ ¯ k¯ k =⇒ |vk |2 = 1 − |uk |2 = 1− k . ¯ ¯= k uk |∆k | 2 Ek (5.59) This means that Inserting all these results into the model Hamiltonian (5.56), we finally obtain HM = X X † † (ζk − Ek + ∆k b∗k ) + Ek (γk0 γk0 + γk1 γk1 ) . k k 49 (5.60) The first term here is a constant, and the second is just the Hamiltonian of free fermions, with excitation energy Ek . We see from Eq. (5.58) that the excitation spectrum has a gap, of magnitude |∆k |. Another important point is that ∆k itself, as given in Eq. (5.53), becomes ∆k = − X Vkk0 hc−k0 ↓ ck↑ i = − k0 X Vkk0 u∗k0 vk0 h1 − γk∗0 0 γk0 0 − γk† 0 1 γk0 1 i , (5.61) k0 where, using Eq. (5.58) u∗k vk = ∆k . 2Ek (5.62) Since the model Hamiltonian is diagonal, we know that † † hγk0 γk0 i = hγk1 γk1 i = 1 e βEk +1 ≡ f (Ek ) . (5.63) Using this in Eqs. (5.61) and (5.62) yields the gap equation, ∆k = − X Vkk0 k0 ∆k0 βE 0 tanh k . 2Ek0 2 (5.64) In particular, with the simplifying BCS assumption, Vkk0 = −V , we find 1 1 X tanh(βEk /2) = . V 2 k Ek (5.65) One should note that the BCS assumption about the pairing potential V cannot hold over the entire Brillouin zone; it is expected to be valid in a narrow range of energies about the Fermi energy. That narrow energy range is limited by a certain energy, ωc . When the pairing potential is due to the electron-phonon interaction, then ωc is a typical phonon energy. Taking all this into account, we may convert Eq. (5.65) into the famous BCS form, p Z ωc tanh 12 (β ζ 2 + ∆2 ) 1 p = dζ , (5.66) N (0)V ζ 2 + ∆2 0 where N (0) is the density of states at the Fermi level. This equation determines the temperature dependence of the gap ∆, and in particular, it yields the transition temperature into the superconducting state. At the transition temperature Tc , ∆ vanishes, and therefore we have 1 = N (0)V Z ωc 0 dζ tanh 12 (βc ζ) ζ 50 =⇒ kB Tc = 1.13ωc e−1/N (0)V . (5.67) Bibliography 1. N. W. Ashcroft and N. D. Mermin, Solid State Physics, Saunders College, 1975. 51