Download ph507weeks1

Document related concepts

Outer space wikipedia , lookup

Dialogue Concerning the Two Chief World Systems wikipedia , lookup

Corona Australis wikipedia , lookup

Circumstellar habitable zone wikipedia , lookup

Corona wikipedia , lookup

Astrobiology wikipedia , lookup

Nebular hypothesis wikipedia , lookup

International Ultraviolet Explorer wikipedia , lookup

Planets beyond Neptune wikipedia , lookup

Perseus (constellation) wikipedia , lookup

Cygnus (constellation) wikipedia , lookup

Boötes wikipedia , lookup

Dyson sphere wikipedia , lookup

Lyra wikipedia , lookup

Hipparcos wikipedia , lookup

Astronomical unit wikipedia , lookup

IAU definition of planet wikipedia , lookup

Rare Earth hypothesis wikipedia , lookup

Cygnus X-1 wikipedia , lookup

Solar System wikipedia , lookup

Orrery wikipedia , lookup

Future of an expanding universe wikipedia , lookup

Observational astronomy wikipedia , lookup

Stellar kinematics wikipedia , lookup

Extraterrestrial life wikipedia , lookup

Exoplanetology wikipedia , lookup

Definition of planet wikipedia , lookup

Stellar evolution wikipedia , lookup

History of Solar System formation and evolution hypotheses wikipedia , lookup

Formation and evolution of the Solar System wikipedia , lookup

Planetary system wikipedia , lookup

CoRoT wikipedia , lookup

Ursa Minor wikipedia , lookup

Corvus (constellation) wikipedia , lookup

Star formation wikipedia , lookup

Aquarius (constellation) wikipedia , lookup

Planetary habitability wikipedia , lookup

Timeline of astronomy wikipedia , lookup

Transcript
PH507
Multi-wavelength
Professor Michael Smith
THE MULTIWAVELENGTH UNIVERSE AND EXOPLANETS
School of Physical Sciences Convenor Prof. Michael Smith
Taught in Term 2
Teaching Provision:
1
PH507
ECTS Credits 7.5
Kent Credits 15 at Level I
30 lectures + 4 workshops + 2 class tests
Prerequisites:
PH300, PH301, PH304
Aims: To provide a basic but rigorous grounding in observational, computational and
theoretical aspects of astrophysics to build on the descriptive course in Part I, and to
consider evidence for the existence of exoplanets in other Solar Systems.
Learning Outcomes:
1. An understanding of the fundamentals of making astronomical observations across the
whole electromagnetic spectrum, including discussion of photometry and spectroscopy,
and the physics of the astrophysical radiation mechanisms.
2. An understanding of the motions of objects in extrasolar systems and the basic techniques
required to solve the 2-body problem to measure their properties.
3. An understanding of observational characteristics of stars, and how their physical
structures are derived from observation and using simple physical models.
4. To be able to discuss coherently the origin and evolution of Solar Systems and be able to
evaluate claims for evidence of Solar Systems other than our own.
SYLLABUS:
•
•
•
•
•
•
Part 1: measurements
Part 2: radiation
Part 3: dynamics
Part 4: star and planet formation
Part 5: telescopes/instruments
Part 6: stars and stellar structure
Assessment Methods:
Examination 70%, Homework 10%, 1st class test 10%, 2nd class test 10%.
Recommended Texts:
Carroll & Ostlie, An Introduction to Modern Astrophysics, Addison-Wesley, [QB461]
Stuart Clark. Extrasolar Planets, Wiley Press
C.R.Kitchin. Astrophysical Techniques, Adam Hilger Press.
[Note: Changes may occur to the syllabus during the year]







Prof. Michael Smith:
101 Ingram, x7654, [email protected]
Office hours: 10-12am Wed
Bad weather
Numbers, names
Locations, times of lectures
Lecturers
PH507
Multi-wavelength
Professor Michael Smith
2
PART 1: Measurement
LECTURE 1
Distance: Distance is an easy concept to understand: it is just a length in some
units such as in feet, km, light years, parsecs etc. It has been excrutiatingly
difficult to measure astronomical distances until this century.
Unfortunately most stars are so far away that it is impossible to directly
measure the distance using the classic technique of triangulation.
Trignometric parallax: based on triangulation – need three parameters to fully
define any triangle e.g. two angles and one baseline.
To triangulate to even the closest stars we would need to use a very large
baseline. In fact we do have a long baseline, because every 6 months the earth
is on opposite sides of the sun. So we can use as a baseline the major axis of the
earth's orbit around the sun.
BASELINE: 2 x earth-sun distance = 2 Astronomical Units (AU)
(The average distance from the earth to the sun is called the Astronomical Unit.)
The parallactic displacement of a star on the sky as a result of the Earth’s orbital
motion permits us to determine the distance from the Sun to the star by the
method of trigonometric (heliocentric) parallax. We define the trigonometric
parallax of the star as the angle  subtended, as seen from the star, by the
Earth’s orbit of radius 1 AU. If the star is at rest with respect to the Sun, the
parallax is half the maximum apparent annual angular displacement of the star
as seen from the Earth.
PH507
Multi-wavelength
Professor Michael Smith
3
PH507
Multi-wavelength
Professor Michael Smith
4
PH507
Multi-wavelength
Professor Michael Smith
5
1 radian is defined as:
360
 57.3 degrees = 206265 arc seconds, approximately. There are 2
2
rad in a circle (360˚), so that 1 radian equals 57˚17´44.81” (206, 264.81”).
1 radian =
Independent distance unit is the light year:
c  t ( year )  9.47 1015 m
The light year is not used much by professional astronomers, who work instead
with a unit of similar size called the parsec, where 1 parsec = 1 pc = 206265
AU = 3.086 x 1016 m = 3.26 light years.

The measurement and interpretation of stellar parallaxes are a branch of
astrometry, and the work is exacting and time-consuming. Consider that
the nearest star, Proxima/Alpha Centauri (Rigil Kent), at a distance of 1.3
pc, has a parallax of only 0.76”; all other stars have smaller parallaxes.
PH507
Multi-wavelength
Professor Michael Smith
6
Formula:
tan p 
1AU
d
or
d 
1
AU
p
where p is in radians
for small angles.
To convert to arcseconds:
2.063 105
d 
AU
p ''
or
d 
1
pc .
p"
Technological advances (including the Hubble Space Telescope) have improved
parallax accuracy to 0.001” within a few years. Before 1990, fewer than 10,000
stellar parallaxes had been measured (and only 500 known well), but there are
about 1012 stars in our Galaxy. Space observations made by the European Space
Agency with the Hipparcos mission (1989-1993) accurately determined the
parallaxes of many more stars. Though a poor orbit limited its usefulness,
Hipparcos was expected to achieve a precision of about 0.002”. It actually
achieved 0.001” for 118,000 stars. The method of trigonometric parallax is
important because it is our only direct distance technique for stars.
The ground-based trigonometric parallax of a star is determined by
photographing a given star field from a number (about 20) of selected points in
the Earth’s orbit. The comparison stars selected are distant background stars of
nearly the same apparent brightness as the star whose parallax is being
measured. Corrections are made for atmospheric refraction and dispersion and
for detectable motions of the background stars; any motion of the star relative
to the Sun is then extracted. What remains is the smaller annual parallactic
motion; it is recognised because it cycles annually.
Because a seeing resolution of 0.25” is considered exceptional (more typical it is
1”), it may seem strange that a stellar position can be determined to ±0.01” in
one measurement; this accuracy is possible because we are determining the
centre of the fuzzy stellar image.
PH507
Multi-wavelength
Professor Michael Smith
7
In 2011 – 2013, Gaia will be set into orbit with a Soyuz rocket (and SIM Space
Interferometric Mission from the US). It will be able to measure parallaxes
of 10 micro-arcseconds. It consists of a rotating frame holding three
telescopes. Some aims:
…….Accurate distances even to the Galactic centre, 8000 parsecs away.
……..Photometry: accurate magnitudes.
……..Planet quest
……..Reference frame from distant quasars (3C273 is 800 Mpc away)
In the meantime, to go further, we construct the COSMIC LADDER.
If we can estimate the luminosity of a star from other properties, they can be
used as STANDARD CANDLES.
2 LUMINOSITY.
We can actually only measure the radiant flux of a flame and need to make a
few assumptions to find the true luminosity. Luminosity depends on the
distance and extinction (as well as relativistic effects).
The measured flux f is in units of W/m2 , the flow of energy per unit area. The
radiated power L, ignoring extinction, is given by:
f 
d2 
L
4d 2
L
4f
’
showing that a standard candle can yield the distance.
The Stellar Magnitude Scale
The first stellar brightness scale - the magnitude scale - was defined by
Hipparchus of Nicea and refined by Ptolemy almost 2000 years ago. In this
qualitative scheme, naked-eye stars fall into six categories: the brightest are of
first magnitude, and the faintest of sixth magnitude. Note that the brighter the star,
the smaller the value of the magnitude. In 1856, N. R. Pogson verified William
Herschel’s finding that a first-magnitude star is 100 times brighter than a sixthmagnitude star and the scale was quantified. Because an interval of five
magnitudes corresponds to a factor of 100 in brightness, a one-magnitude
difference corresponds to a factor of 1001/5 = 2.512. (This definition reflects the
PH507
Multi-wavelength
Professor Michael Smith
8
operation of human vision, which converts equal ratios of actual intensity to
equal intervals of perceived intensity. In other words, the eye is a logarithmic
detector). The magnitude scale has been extended to positive magnitudes
larger than +6.0 to include faint stars (the 5-m telescope on Mount Palomar can
reach to magnitude +23.5) and to negative magnitudes for very bright objects
(the star Sirius is magnitude -1.4). The limiting magnitude of the Hubble Space
Telescope is about +30.
Astronomers find it convenient to work with logarithms to base 10 rather than
with exponents in making the conversions from brightness ratios to magnitudes
and vice versa. Consider two stars of magnitude m and n with respective
apparent brightnesses (fluxes) lm and ln. The ratio of their fluxes fn / fm
corresponds to the magnitude difference m - n. Because a one-magnitude
difference means a brightness ratio of 1001/5, (m - n) magnitudes refer to a ratio
of (1001/5)m-n = 100(m-n)/5, or
fn / fm = 100(m-n)/5
Taking the log10 of both sides (because log xa = a log x and log 10a = a log 10 =
a),
log (fn / fm) = [(m - n)/5] log 100 = 0.4(m - n)
or
m - n = 2.5 log (fn / fm)
This last equation defines the apparent magnitude; note that m > n when fn >
fm, that is: brighter objects have numerically smaller magnitudes. Also note that
when the brightnesses are those observed at the Earth, physically they are
fluxes. Apparent magnitude is the astronomically peculiar way of talking
about fluxes.
Here are a few worked examples:
PH507
Multi-wavelength
Professor Michael Smith
9
(a) The apparent magnitude of the variable star RR Lyrae ranges from 7.1 to 7.8
- a magnitude amplitude of 0.7. To find the relative increase in brightness from
mini-mum to maximum, we use
log (fmax / fmin) = 0.4 x 0.7 = 0.28
so that
fmax / fmin = 100.28 = 1.91
This star is almost twice as bright at maximum light than at minimum.
(b) A binary system consists of two stars a and b, with a brightness ratio of 2;
however, we see them unresolved as a point of magnitude +5.0. We would like
to find the magnitude of each star. The magnitude difference is
mb - ma = 2.5 log (fa / fb) = 2.5 log 2 = 0.75
Since we are dealing with brightness ratios, it is not right to put ma + mb = +5.0.
The sum of the luminosities (fa + fb) corresponds to a fifth-magnitude star.
Compare this to a 100-fold brighter star, of magnitude 0.0 and luminosity l0:
PH507
Multi-wavelength
Professor Michael Smith
10
ma+ b - m0 = 2.5 log [l0 / (fa + fb)]
or
5.0 - 0.0 = 2.5 log 100 = 5.
But
fa = 2 fb, so that fb = (fa + fb)/3.
Therefore
(mb - m0) = 2.5 log (f0 / fb) = 2.5 log 300 = 2.5 x 2.477 = 6.19.
The magnitude of the fainter star is 6.19, and from our earlier result on the
magnitude difference, that of the brighter star is 5.44.
What units are used in astronomical photometry?
The well-known magnitude scale of course, which has been calibrated using standard stars whic
vary in brightness.
But how does the astronomical magnitude scale relate to other photometric units? Here we assum
unless otherwise noted, which are at least approximately convertible to lumes, candelas, and lux
1 mv=0 star outside Earth's atmosphere = 2.54 10-6 lux = 2.54 10-10 phot
Luminance: ( 1 nit =1 candela per square metre)
1 mv=0 star per sq degree outside Earth's atmosphere = 0.84E-2 nit
= 8.4 10-7 stilb
1 mv=0 star per sq degree inside clear unit airmass
= 6.9 10-7 stilb
= 0.69E-2 nit
(1 clear unit airmass transmits 82% in the visual, i.e. it dims 0.2 magnitudes)
One star, Mv=0 outside Earth's atmosphere = 2.451029 cd
Apparent magnitude is thus an irradiance or illuminance, i.e. incident flux per unit
area, from all directions. Of course a star is a point light source, and the incident
light is only from one direction.
Apparent magnitude per square degree is a radiance, luminance, intensity, or
"specific intensity". This is sometimes also called "surface brightness".
Still another unit for intensity is magnitudes per square arcsec, which is the
magnitude at which each square arcsec of an extended light source shines.
Only visual magnitudes can be converted to photometric
units. U, B, R or I magnitudes are not easily convertible
to luxes, lumens and friends, because of the different
wavelengths intervals used. The conversion factors would be
strongly dependent on e.g. the temperature of the blackbody
PH507
Multi-wavelength
Professor Michael Smith
11
radiation or, more generally, the spectral distribution of
the radiation. The conversion factors between V magnitudes
and photometric units are only slightly dependent on the
spectral distribution of the radiation.
What units are used in radiometry/infrared astronomy?
Here we're not interested in the photometric response of some detector with a
well-known passband (e.g. the human eye, or some astronomical photometer).
Instead we want to know the strength of the radiation in absolute units: watts
etc. Thus we have:
Radiance, intensity or specific intensity:
W m-2 ster-1 [Å-1]
SI unit
erg cm-2 s-1 ster-1 [Å-1]
CGS unit
photons cm-2 s-1 ster-1 [Å-1] Photon flux, CGS units
Irradiance/emittance, or flux:
W m-2 [Å-1]
SI unit
-1
erg cm-2 s-1 [Å ]
CGS unit
photons cm-2 s-1 ster-1 [Å-1] Photon flux, CGS units
Note the [A-1] within brackets. Fluxes and intensities can
be total (summed over all wavelengths) or monochromatic
("per Angstrom Å" or "per nanometer").
In Radio/Infrared Astronomy, the unit Jansky is often used as a measure of
irradiance at a specific wavelength, and is the radio astronomer's equivalence to
stellar magnitudes. The Jansky is defined as: 1 Jansky = 10-26 W m-2 Hz-1
Absolute magnitude represents a total flux, expressed in e.g. candela, or lumens.
Absolute Magnitude and Distance Modulus
So far we have dealt with stars as we see them, that is, their fluxes or apparent
magnitudes, but we want to know the luminosity of a star. A very luminous
star will appear dim if it is far enough away, and a low-luminosity star may
look bright if it is close enough. Our Sun is a case in point: if it were at the
distance of the closest star (Alpha Centauri), the Sun would appear slightly
fainter to us than Alpha Centauri does. Hence, distance links fluxes and
luminosities.
PH507
Multi-wavelength
Professor Michael Smith
12
The luminosity of a star relates to its absolute magnitude, which is the
magnitude that would be observed if the star were placed at a distance of 10 pc
from the Sun. (Note that absolute magnitude is the way of talking about
luminosity peculiar to astronomy). By convention, absolute magnitude is
capitalised (M) and apparent magnitude is written lowercase (m). The inversesquare law of radiative flux links the flux f of a star at a distance d to the
luminosity F it would have it if were at a distance D = 10 pc:
F / f = (d / D)2 = (d / 10) 2.
If M corresponds to L and m corresponds to luminosity l, then
m - M = 2.5 log (F / f ) = 2.5 log (d/10)2 = 5 log (d / 10)
Expanding this expression, we have useful alternative forms: since
m1  m2  2.5 log
d1
 5 log d1  5 log d2 ,
d2
defining the absolute magnitude m2 = M at d2 = 10 pc, so m1 = m and d2 = d,
m - M = 5 log d - 5
M = m + 5 - 5 log d
In terms of the parallax,
M = m + 5 + 5 log p”
Here d is in parsecs and p” is the parallax angle in arc seconds.
PH507
Multi-wavelength
Professor Michael Smith
13
The quantity m - M is called the distance modulus, for it is directly related to
the star’s distance. In many applications, we refer only to the distance moduli
of different objects rather than converting back to distances in parsecs or lightyears.
Magnitudes at Different Wavelengths
The kind of magnitude that we measure depends on how the light is filtered
anywhere along the path of the detector and on the response function of the
detector itself. So that problem comes down to how to define standard
magnitude systems.
PH507
Multi-wavelength
Professor Michael Smith
14
Magnitude Systems
Detectors of electromagnetic radiation (such as the photographic plate, the
photoelectric photometer, and the human eye) are sensitive only over given
wavelength bands. So a given measurement samples but part of the radiation
arriving from a star.
Four images of the Sun, made using (a) visible light, (b) ultraviolet light, (c) X rays,
and (d) radio waves. By studying the similarities and differences among these views of
the same object, important clues to its structure and composition can be found.
Because the flux of starlight varies with wavelength, the magnitude of a star
depends upon the wavelength at which we observe. Originally, photographic
plates were sensitive only to blue light, and the term photographic magnitude
(mpg) still refers to magnitudes centred around 420 nm (in the blue region of
the spectrum). Similarly, because the human eye is most sensitive to green and
yellow, visual magnitude (mv) or the photographic equivalent photo visual
magnitude (mpv) pertains to the wave-length region around 540 nm.
Today we can measure magnitudes in the infrared, as well as in the ultraviolet,
by using filters in conjunction with the wide spectral sensitivity of photoelectric
photometers. So systems of many different magnitudes (colour combinations)
are possible. In general, a photometric system requires a detector, filters, and a
PH507
Multi-wavelength
Professor Michael Smith
15
calibration (in energy units). The properties of the filters are typified by their
effective wavelength, 0, and bandpass, ∆ which is defined as the full width at
half maximum in the transmission profile. The three main filter types are wide
(∆≈ 100 nm), intermediate (∆≈ 10 nm), and narrow (∆≈1 nm). There is a
trade-off for the bandwidth choice: a smaller ∆ provides more spectral
information but admits less flux into the detector, resulting in longer
integration times. For a given range of the spectrum, the design of the filters
makes the greatest difference in photometric magnitude systems.
A commonly used wide-band magnitude system is the UBV system: a
combination of ultraviolet (U), blue (B), and visual (V) magnitudes, developed
by H. L. Johnson. These three bands are centred at 365, 440, and 550 nm; each
wavelength band is roughly 100 nm wide. In this system, apparent magnitudes are
denoted by B or V and the corresponding absolute magnitudes are sub-scripted:
MB or MV.
To be useful in measuring fluxes, the photometric system must be calibrated in
energy units for each of its bandpasses. This calibration turns out to be the
hardest part of the job. In general, it relies first on a set of standard stars that
define the magnitudes, for a particular filter set and detector; that is, these stars
define the standard magnitudes for the photometric system to the precision with
which they can be measured.
Infrared Windows
The UBV system has been extended into the red and infrared (in part because of
the development of new detectors, such as CCDs, sensitive to this region of the
spectrum). The extensions are not as well standardised as that for the Johnson
UBV system, but they tend to include R and I in the far red and J, H, K, L, and
PH507
Multi-wavelength
Professor Michael Smith
16
M in the infrared.
As well as measuring the properties of individual stars at different
wavelengths, observing at loner wavelengths, particularly in the infrared,
allows us to probe through clouds of small solid dust particles, as seen below
A visible-light (left) vs. 2MASS infrared-light (right) view of the central regions of the
Milky Way galaxy graphically illustrating the ability of infrared light to penetrate
the obscuring dust. The field-of-view is 10x10 degrees
Infrared passbands which allow transmission (low absorption):
J Band: 1.3 microns
H Band: 1.6 microns
K band: 2.2 microns
L band 3.4 microns
M band 5 microns
N band 10.2 microns
Q band 21 microns
Bolometric magnitudes can be converted to total radiant energy flux: One star
of Mbol = 0 radiates 2.97 1028 Watts.
System is defined by Vega at 7.76 parsecs from the Sun with an apparent
magnitude defined as zero.
With Lbol = 50.1 Lsolar and Mbol = 0.58.
Sun: mbol = -26.8
Full moon: -12.6
Venus: -4.4
Sirius: -1.55
Brightest quasar: 12.8
For Vega: mb = mv = 0. mk = +0.02
Sun: Mb = 5.48, Mv = 4.83, Mk = 3.28
Colour Index: B-V, J-H, H-K are differences in magnitude….flux ratios.
PH507
Multi-wavelength
Professor Michael Smith
17
But cooler, redder objects possess higher values.
Extinction
Interstellar Medium modifies the radiation. Dust particles with size of order of
the wavelength of the radiation.
Blue radiation is strongly scattered compared to red: blue reflection nebulae
and reddened stars.
Colour Excess:
measures the reddening.
E(B-V) = B-V - (B-V)o
Modified distance modulus:
m() = M() + 5 log d – 5 + A()
where A () is the extinction due to both scattering and absorption, strongly
wavelength dependent. The optical depth is given by
exp(  ) 
I
Io
.
Therefore A() = 1.086 

The optical depth is

where N is the total column density of dust (m-2) between the star and the
observer and is the scattering/absorption cross-section (m2).
ISM Law related extinction to reddening:
Av / E(B-V) = 3.2 + - 0.2
Spectroscopic Parallax
Hertzsprung-Russell deduced the main-sequence stars for nearby objects,
relating their luminosity to their colour. Groups of distant stars should
also\line along the same main-sequence strip. However they appear very
dim,\of course due to their distance. On comparison of fluxes, we determine
the distance. This works out to about 100,000 pc, beyond which main-sequence
stars are too \dim.
Cepheids as Standard Candles: The Period-Luminosity
Relationship
Cepheids show an important connection between period and luminosity: the
pulsation period of a Cepheid variable is directly related to its median
PH507
Multi-wavelength
Professor Michael Smith
18
luminosity. This relationship was first discovered from a study of the variables
in the Magellanic Clouds, two small nearby companion galaxies to our Galaxy
that are visible in the night sky of the southern hemisphere. To a good
approximation, you can consider all stars in each Magellanic Cloud to be at the
same distance. Henrietta Leavitt, working at Harvard in 1912, found that the
brighter the median apparent magnitude (and so the luminosity, since the stars
are the same distance), the longer the period of the Cepheid variable. A linear
relationship was found.
Harlow Shapley recognised the importance of this period-luminosity (P-L)
relation-ship and attempted to find the zero point, for then a knowledge of the
period of Cepheid would immediately indicate its luminosity (absolute
magnitude).
This calibration was difficult to perform because of the relative scarcity of
Cepheids and their large distances. None are sufficiently near to allow a
trigonometric parallax to be determined, so Shapley had to depend upon the
relatively inaccurate method of statistical parallaxes. His zero point was then
used to find the distances to many other galaxies. These distances are revised
as new and accurate data become available. Right now, some 20 stars whose
distances are known reasonably well (because they are in open clusters) serve
as the calibrators for the P-L relationship.
Further work showed that there are two types of Cepheids, each with its own
separate, almost parallel P-L relationship.
PH507
Multi-wavelength
Professor Michael Smith
19
The classical Cepheids are the more luminous, of Population I, and found in
spiral arms. Population II Cepheids, also known as W Virginis stars after their
prototype, are found in globular clusters and other Population II systems.
Classical Cepheids have periods ranging from one to 50 days (typically five to
ten days) and range from F6 to K2 in spectral class.
Population II Cepheids vary in period from two to 45 days (typically 12 to 20
days) and range from F2 to G6 in spectral class.
Population I and II Cepheids are both regular, or periodic, variables; their
change in luminosity with time follows a regular cycle.
Cepheids are bright and distinct. They can be used to determine distances to
quite distant galaxies, to about 5 Mpc. HST stretched this to 18 Mpc (Virgo
cluster).
Tully-Fisher Relation
In a spiral galaxy, the centripetal force of gas and stars balances the
gravitational force:
PH507
Multi-wavelength
Professor Michael Smith
20
mV2/R = GmM/R2.
If they have the same surface brightness ( L/R2 is constant) and the same massto-light ratio (M/L is constant), then
L ~ V4. So, provided we can measure V, certain galaxies can be used as standard
candles. (determine V through the 21 cm line of atomic hydrogen in the galaxy).
Type 1a Supernovae.
The peak light output from these supernovae is always about M b = -19.33 +0.25. Therefore we can infer the distance from the inverse square law. Being so
bright , they act as standard cadles to large distances: to 1000 Mpc.
Why are they standard candles? White dwarfs I binary systems. Material from\
a companion red giant is dumped on the white dwarf surface until the WD
reaches a critical mass (Chandrasekhar mass) of 1.4 solar masses. Explosion
occurs with fixed rise and fall of luminosity.
Other methods: time delay of light rays due to gravitational lensing, cluster size
influences Compton scattering of CMB radiation and bremsstrahlung emission
(X-rays). Combining, yields the size estimate (Sunyaev-Zeldovich effect). Or,
rotational properties of stars with starspots…….
New Method?
Reverse argument: knowing the Hubble constant is 72 km/s/Mpc, (WMAP
result), distances can be found directly from the redshift!
PH507
Multi-wavelength
Professor Michael Smith
Questions
How do we scale the solar system?
How do we find the distance to gas clouds?
PLANET REVIEW
The Terrestrial Solar System
21
PH507
Multi-wavelength
Professor Michael Smith
22
In the picture above we see the positions of the asteroid belt (green) and other nearearth objects
The material in the plane of the Solar System is known as the Kuiper Belt. Surrounding
PH507
Multi-wavelength
Professor Michael Smith
23
this is a much larger region known as the Oort Cloud, that contains material that
occasionally falls in, under the influence of gravity, towards the Sun as comets.
The Sun
At over 1.4 million kilometers (869,919 miles) wide, the Sun contains 99.86
percent of the mass of the entire solar system: well over a million Earths
could fit inside its bulk. The total energy radiated by the Sun averages 383
billion trillion kilowatts, the equivalent of the energy generated by 100 billion
tons of TNT exploding each and every second.
Planetary configurations
• Some of the definitions below make the assumption of coplanar circular orbits. True
planetary orbits are ellipses with low eccentricity and inclinations are small so the
concepts are applicable in real cases.
• Copernicus correctly stated that the farther a planet lies from the Sun, the slower it
moves around the Sun. When the Earth and another planet pass each other on the
same side of the Sun, the apparent retrograde loop occurs from the relative motions
of the other planet and the Earth.
PH507
Multi-wavelength
Professor Michael Smith
24

As we view the planet from the moving Earth, our line of sight reverses its
angular motion twice, and the three-dimensional aspect of the loop comes about
because the orbits of the two planets are not coplanar. This passing situation is
the same for inferior or superior planets.

A Retrograde loop occurs when a superior planet moves through opposition, and
occurs as the earth's motion about its orbit causes it to overtake the slower
moving superior planet. Thus close to opposition, the planet's motion relative to
PH507
Multi-wavelength
Professor Michael Smith
fixed background stars, follows a small loop.
25
PH507
Multi-wavelength
Professor Michael Smith
26
Lecture 4
Mass can be measured in two ways. We could count up the atoms, or count up
the molecules and grains of dust and infer the number of atoms. This method
can be used if the object is optically thin and we have good tracer: a radiation
or scattering mechanism in which the number of photons is related to the
number of particles.
Otherwise, measuring the mass of an object relies upon its gravitational
influence….on nearby bodies or on itself (self-gravity).
Newton’s second law states: F = m a, while the first law relates the acceleration
to a change is speed or direction.
Kepler’s empirical laws for orbital motion thus describe the nature of the
acceleration from which masses can be derived.
Kepler's Laws
First Law: The orbit of each planet is an ellipse with the Sun at one focus
p
b
F
r
S f q
a
C
Q
S = Sun, F = other focus, p = planet.
r = HELIOCENTRIC DISTANCE.
f = TRUE ANOMALY
a = SEMI-MAJOR AXIS = mean heliocentric distance),
size of the orbit.
b = SEMI-MINOR AXIS.
e = ECCENTRICITY,
defines shape of orbit.
Ellipse:
SP + PF = 2a
(1)
e = CS / a
(2)
Therefore
b2 = a2(1-e2)
(3)
which defines the
PH507
Multi-wavelength
•When CS = 0,
When CS = ,
e = 0,
e = 1,
Professor Michael Smith
27
b = a, the orbit is a circle.
the orbit is a parabola.
• q = PERIHELION DISTANCE = a - CS = a – ae
q = a(1-e)
(4)
• Q = APHELION DISTANCE = a + CS = a + ae
Q = a(1+e)
(5)
Second Law: For any planet, the radius vector sweeps out equal areas in
equal times
• Time interval t for planet to travel from p to p1 is the same as time taken for
planet to get from p2 to p3. Shaded areas are equal.
• Let the time interval t be very small. Then the arc from p to p1 can be
regarded as a straight line and the area swept out is the area of the triangle
S p p1. If f1 is the angle to p1, and f is the angle to p:
p1
p2
p3
r1
p
r
f
S
i.e
Area = 1/2 r r1 Sin (f1-f).
Since t is very small, r ~ r1 and Sin (f1-f) ~ (f1-f) = f
Area = 1/2 r2 f
The rate this area is swept out is constant according to Kepler's second
law, so
r2 df/dt = h
(6)
where h, a constant, is twice the rate of description of area by the radius
vector. It is the orbital angular momentum (per unit mass.
The total area of the ellipse is πab which is swept out in the orbital period
PH507
Multiwavelength
28
P, so using eqn (7)
2ab/P = h.
The average angular rate of motion is n = 2/P, so
n a2(1-e2)1/2 = h
(7)
Kepler’s Third Law
Kepler's third law took another ten years to develop after the first two. This law
relates the period a planet takes to travel around the sun to its average distance and
the Sun. This is sometimes called the semi major axis of an elliptical orbit.
P 2 = ka3
where P is the period and a is the average distance from the Sun.
Kepler’s Third Law follows from the central inverse square nature of the law of
gravitation. First look at Newton's law of gravitation - stated
mathematically this is
F
Gm1 m2
r2
Newton actually found that the focus of the elliptical orbits for two bodies of masses m1 and
m2 is at the centre of mass. The centripetal forces of a circular orbit are
r1
F1
v2
X
Centre of M ass
m1
m2
v1
F2
r2
2
F1
and
m1 v1
4 2 m1 r1


r1
P2
PH507
Multiwavelength
29
2
F2
m2 v2
4 2 m2 r2


r2
P2
where
v
2r
P
and since they are orbiting each other (Newton’s 2nd law)
r1
m2

r2
m1
Let's call the separation a = r1 + r2. Then;
a  r1 
m1 r1
 m1

 r1 
 1 and multiplying both sides by m2 , am2  m1 r1  m2 r1
m2
m2
or, solving for r1 ,
r1 
am2
m1  m2 
Now, since we know that the mutual gravitational force;
Fgrav  F1  F2 
then substituting for r1,
3
a 
Gm1 m2
2
a
G m1  m2  P2
42
Solving for P:
P  2
a3
G M1  M2 
Third Law is therefore: The cubes of the semi-major axes of the planetary
orbits are proportional to the squares of the planets' periods of
revolution.
Example
Europa, one of the Jovian moons, orbits at a distance of 671,000 km from the centre
of Jupiter, and has an orbital period of 3.55 days. Assuming that the mass of Jupiter
is very much greater than that of Europa, use Kepler's third law to estimate the
mass of Jupiter.
Using Kepler's third law:
PH507
Multiwavelength
30
m jupiter  meuropa
4 2 a 3

GP 2
The semi-major axis, a = 6.71 x 105 km = 6.71 x 108 m, and
the period, P = 3.55 x 3600 x 24 = 3.07 x 105 seconds
Thus:
m jupiter  meuropa 
c
4 2 6.71  108
h
3
 19
.  1027 kg
c6.67  10 hc3.07  10 h
 11
5 2
and since mjupiter >> meuropa, then mjupiter ~ 1.9 x 1027 kg.
Summary of Kepler’s Laws
Summary: Measuring the mass of a planet
• Kepler’s third law gives G(M+m) =  a3/P2
Since M >> m for all planets, it isn't possible to make precise enough
determinations of P and a to determine the masses m of the planets.
However, if satellites of planets are observed, then Kepler's law can be used.
• Let mp = mass of planet
ms = mass of satellite
Ps = orbital period of satellite
as = semi-major axis of satellite's orbit
about the planet.
PH507
Then:
Multiwavelength
31
G(mp+ms) = 42 as3/Ps2
If the mass of the satellite is small compared with the mass of the planet then
mp = 42 as3/(G Ps2)
• All the major planets have satellites except Mercury and Venus. Their
masses were determined from orbital perturbations on other bodies and
later, more accurately from changes in the orbits of spacecraft.
So: we can determine the masses of massive objects if we can detect and follow the
motion of very low mass satellites. That doesn’t lead very far. How can we determine
the masses of distant stars and exoplanets?
BASIC STELLAR PROPERTIES - BINARY STARS
• For solar type stars, single:double:triple:quadruple system ratios are
45:46:8:1.
• Binary nature of stars deduced in a number of ways:
VISUAL BINARIES:
- Resolvable, generally nearby stars (parallax likely to be available)
- Relative orbital motion detectable over a number of years
ASTROMETRIC BINARY: only one component detected
SPECTROSCOPIC BINARIES:
- Unresolved
- Periodic oscillations of spectral lines (due to Doppler shift)
- In some cases only one spectrum seen
SPECTRUM BINARY: 2 sets of lines but no apparent orbital motion but
spectrum is clearly combined from stars of differing spectral class.
ECLIPSING BINARY:
- Unresolved
- Stars are orbiting in plane close to line of sight giving eclipses
observable as a change in the combined brightness with time (‘’light
curves).
Some stars may be a combination of these.
Visual Binaries
• Angular separation ≥ 0.5 arcsec (close to Sun, long orbital periods - years) –
example Sirius:
PH507
Multiwavelength
32
• Observations:
Relative positions:
 = angular
 = position
Absolute positions: Harder to measure orbits of more massive star A and
separation less massive star B about centre
of mass C which has proper motion µ.
Declination
N
M otion of centre of mass
= proper motion µ
Secondary

E

Right Ascension
B
Primary
C
A
NB parallax and aberration must also be accounted for.
• RELATIVE ORBITS:
- TRUE orbit:
q = peri-astron distance (arcsec or km)
Q = apo-astron distance (arcsec or km)
a = semi-major axis (arcsec or km)
a = (q + Q)/2
- APPARENT orbits are projected on the celestial sphere
Inclination i to plane of sky defines relation between true orbit and apparent
orbit. If i≠0° then the centre of mass (e.g. primary) is not at the focus of the
elliptical orbit.
Measurement of the displacement of the primary gives inclination and true
semi-major axis in arcseconds a".
PH507
Multiwavelength
33
i
i
Incline by 45°
Apparent orbit
True orbit
• If the parallax p in arcseconds is observable then a can be derived from a".
Earth
B
radius
of Earth's
orbit
a
a"
Sun
For i=0°
p
r = distance of binary star
a = 1 AU . a"/p"
A
(In general correction for i≠0 required).
Now lets go back to Kepler’s Law …
• From Kepler's Law, the Period P is given by
2 3
2
P =
4 a
G (mA + mB)
(26)
For the Earth-Sun system P=1 year, a=1 A.U., mA+mB~msun so 4π2/G = 1
3
a
P =
(mA + mB)
2
P in years, a in AU, mA,mB in solar masses.
From (25) and (26),
a" 3 1
mA + mB = ( )
p P2
Sum of masses is determined
• ABSOLUTE ORBITS:
d
c
rA
A
*
B
rB
e
f
(27)
B
q
A
Q
PH507
Multiwavelength
34
Semi-major axes aA = (c+e)/2 Minimum separation = q = d + e
aB = (d+f)/2
Maximum separation = Q = c + f
So aA + aB = (c+d+e+f)/2 = (q + Q)/2 = a
a = aA + a B
(28)
(and clearly r =rA + rB)
From the definition of centre of mass, mA rA = mB rB ( mA aA = mB aB)
mA/mB = aB/aA = rB/rA
(29)
So from Kepler’s Third Law, which gives the sum of the masses, and Equation
(29) above, we get the ratio of masses, ==> mA, mB. Therefore, with both, we
can solve for the individual masses of the two stars.
Spectroscopic Binaries
PH507
Multiwavelength
35
• Orbital period relatively short (hours - months) and i≠0°.
• Doppler shift of spectral lines by component of orbital velocity in line of sight
(nominal position is radial velocity of system):
wavelength
wavelength
Time
Time
2 Stars observable
1 Star observable
PH507
Multiwavelength
36
• Data plotted as RADIAL VELOCITY CURVE:
recession
+
v
(km s-1)
recession
+
0
time
approach
approach
3
2
4
1
3
4
1
3
Observer
4
2
time
-
ABSOLUTE
Relative Orbit
2
v
(km s-1) 0
RELATIVE
Relative •radial
velocity
If the
orbit iscurve
tilted to the line of
2sight (i<90°), the shape is unchanged
but velocities are reduced by a factor
1
sin i.
v
1 • Take3 a circular orbit with i=90°
a = rA + rB
v = v A + vB
4
Orbital velocities:
2
vA = 2π rA / P
1
v
v
=
2π
r
/
P
B
B
3
1
Since mA rA = mB rB
4
2
mA/mB = rB/rA = vB/vA (31)
1
rB
v
1
3
vA
r4
1
v
=
rA
v
B
• Shape of radial velocity curves
depends on orbital eccentricity
and orientation.
• In general, measured velocities are vB sin i and vA sin i, so sin i terms cancel.
• From Kepler's law
mA + mB = a3/P2 (in solar units).
Observed quantities: vA sin i => rA sin i
} a sin i
vB sin i => rB sin i
So can only deduce (mA + mB) sin3 i = (a sin i)3/P2
(32)
For a spectroscopic binary, only lower limits to each mass can be derived,
unless i is known independently.
Eclipsing Binaries
• Since stars eclipse i ~ 90°
PH507
Astrophysics
Professor Glenn White
37
• For a circular orbit:
1, 1' FIRST CONTACT
2, 2' SECOND CONTACT
3, 3' THIRD CONTACT
4, 4' FOURTH CONTACT
4' 3'
2' 1'
v
1 2
3 4
Observer in plane
• Variation in brightness with time is LIGHTCURVE.
• Timing of events gives information on sizes of stars and orbital elements.
• Shape of events gives information on properties of stars and relative
temperatures.
PH507
Multiwavelength
38
Case 1
Smaller star is hotter
Case 2
Larger star is hotter
F
or
magnitude
Secondary minimum
Primary minimum
time
Case 1 t'1
Case 2 t 1
t'2
t2
t'3
t3
t'
4
t4
t1
t'1
t2
t'2
t3
t'3
t4
t'4
PH507
Multiwavelength
39
• If orbits are circular: minima are symmetrical ie t2-t1 = t4-t3 = t2'-t1' = t4'-t3';
minima are half a period apart; eclipses are of same duration.
Assymetrical and/or unevenly spaced minima indicate eccentricity and
orientation of orbit.
• For a circular orbit:
t1 t2
t3 t 4
Distance = velocity x time
2RS = v (t2 - t1) (33)
2RL
and
2RS + 2RL = v (t4 - t1) => 2RL =
v(t4 - t2)
(34)
2RS
and ratio of radii
RS/RL = (t2 - t1) / (t4 - t2)
• Lightcurves are also affected by:
Non-total eclises
No flat minimum
Limb darkening
(non-uniform
brightness)
"rounds off"
eclipses
Ellipsoidal stars
(due to
proximity)
"rounds off"
maxima
Reflection effect
(if one star is
very bright)
(35)
PH507
Multiwavelength
40
Eclipsing-Spectroscopic Binaries
• For eclipsing binaries i ≥ 70°
(sin3i > 0.9)
• If stars are spectroscopic binaries then radial velocities are known.
So from eqns (31) and (32) masses are derived,
from eqns (33) and (34) radii are derived,
from (36) ratio of temperatures is derived
Examine spectra and lightcurve to determine which radius corresponds with which
mass and temperature:
+
v
-
B
From radial velocity curve
star A is more massive
A
Initially A is approaching (blue shift)
so first eclipse is A in front of B
Since first eclipse is primary eclipse
B is hotter than A
F
If 2 sets of lines are seen then B is larger
If 1 set of lines is seen then A is larger
time
• Since Luminosity L = 4 R2 T4, the ratio of Luminosities is derived from
(TO BE DISCUSSED LATER IN COURSE!)
LA
LB
=
Summary
Type
Visual
Spectroscopic
Eclipsing
Eclipsing/
LA/LB
spectroscopic
RA
RB
2
TA
4
TB
Observed
p, motion on sky
Apparent magnitudes
Derived
a, e, i, mA, mB
LA, LB
velocity curves
lightcurves
light + velocity curves
MA/MB, (MA+MB)sin3i, a sin i
e, i, RS/RL
MA, MB, RA, RB, TA/TB, a, e, i,
distance
LA, LB, TA, T
164PH507
Multiwavelength
41
Lecture 6: Extrasolar Planets
134 other stars are now known to possess planetary systems. 157 planets have
been discovered. Although none of the stars has been directly imaged, the
effects of the gravity tugging at the stars, as well as the way that gravitation
affects can affect material close to the stars, has been clearly seen.
Disc of material around the star Beta Pictoris – the image of the bright central
star has been artificially blocked out by astronomers using a ‘Coronograph’
• How can we discover extrasolar planets?
• Characteristics of the exoplanet population
• Planet formation
• Explaining the properties of exoplanets
Rapidly developing subject - first extrasolar planet around an ordinary star only
discovered in 1995 by Mayor & Queloz.
Observations thought to be secure, but theory still preliminary...
Resources. For observations, a good starting point is Berkeley extrasolar planets search
homepage
http://exoplanets.org/
Theory: Annual Reviews article by Lissauer (1993) is a good summary of the state of
theory prior to the discovery of extrasolar planets
Definition of a planet
Simplest definition is based solely on mass
164PH507
Multiwavelength
42
• Stars: burn hydrogen (M > 0.075 Msun)
• Brown dwarfs: burn deuterium
• Planets: do not burn deuterium (M < 0.013 Msun)
Deuterium burning limit occurs at around 13 Jupiter masses (1 MJ = 1.9 x 1027 kg ~
0.001 Msun It is important to realise that for young objects, there is no large change in
properties at the deuterium burning limit. ALL young stars / brown dwarfs / planets
liberate gravitational potential energy as they contract
Types of planet
Giant planets (gas giants, `massive’ planets)
• Solar System prototypes: Jupiter, Saturn, Uranus...
• Substantial gaseous envelopes
• Masses of the order of Jupiter mass
• In the Solar System, NOT same composition as Sun
• Presence of gas implies formation while gas was still prevelant
Terrestrial planets
• Prototypes: Earth, Venus, Mars
• Primarily composed of rocks
• In the Solar System (ONLY) orbital radii less than giant planets
Much more massive terrestrial planets could exist (>10 Earth masses), though none are
present in the Solar System. The Solar system also has asteroids, comets, planetary
satellites and rings - we won’t discuss those in this course.
Detecting extrasolar planets
(1) Direct imaging - difficult due to enormous star / planet flux ratio
(2) Radial velocity
• Observable: line of sight velocity of star orbiting centre of mass of star - planet
binary system
• Most successful method so far - all detections to date
(3) Astrometry
164PH507
Multiwavelength
43
• Observable: stellar motion in plane of sky
• Very promising future method: Keck interferometer, GAIA, SIM
(4) Transits
• Observable: tiny drop in stellar flux as planet transits stellar disc
• Requires favourable orbital inclination
• Jupiter mass exoplanet observed from ground HD209458b
• Earth mass planets detectable from space (Kepler (2007 launch. NASA
Discovery mission), Eddington)
(5) Gravitational lensing
• Observable: light curve of a background star lensed by the gravitational
influence of a foreground star. The light curve shape is sensitive to whether the
lensing star is a single star or a binary (star + planet is a special case of the
binary)
• Rare - requires monitoring millions of background stars, and also unrepeatable
• Some sensitivity to Earth mass planets
Each method has different sensitivity to planets at various orbital radii - complete
census of planets requires use of several different techniques
164PH507
Multiwavelength
44
Planet detection method : Radial velocity technique
A planet in a circular orbit around star with semi-major axis a
Assume that the star and planet both rotate around the centre of mass with an angular
velocity:
Using a1 M* = a2 mp and a = a1 + a2, then the stellar speed (v* = a ) in an inertial
frame is:
(assuming mp << M*). i.e. the stellar orbital speed is small.
For a circular orbit, observe a sin-wave variation of the stellar radial velocity, with an
amplitude that depends upon the inclination of the orbit to the line of sight:
Hence, measurement of the radial velocity amplitude produces a constraint on:
mp sin(i)
(assuming stellar mass is well-known, as it will be since to measure radial velocity we
need exceptionally high S/N spectra of the star).
Observable is a measure of mp sin(i).
-> given vobs, we can obtain a lower limit to the planetary mass
In the absence of other constraints on the inclination, radial velocity searches
provide lower limits on planetary masses
Magnitude of radial velocity:
Sun due to Jupiter:
Sun due to Earth:
12.5 m/s
0.1 m/s
164PH507
Multiwavelength
45
i.e. extremely small -
10 m/s is Olympic 100m running pace
Spectrograph with a resolving power of 105 will have a pixel scale ~ 10-5 c ~ few km/s
Therefore, specialized techniques that can measure radial velocity shifts of ~10-3 of
a pixel stably over many years are required
High sensitivity to small radial velocity shifts is achieved by:
• comparing high S/N = 200 - 500 spectra with template stellar spectra
• using a large number of lines in the spectrum to allow shifts of much less than
one pixel to be determined.
Absolute wavelength calibration and stability over long timescales is achieved by:
• passing stellar light through a cell containing iodine, imprinting large number
of additional lines of known wavelength into the spectrum
• with the calibrating data suffering identical instrumental distortions as the data
Error sources:
(1) Theoretical: photon noise limit
• flux in a pixel that receives N photons uncertain by ~ N1/2
• implies absolute limit to measurement of radial velocity
• depends upon spectral type - more lines improve signal
• around 1 m/s for a G-type main sequence star with spectrum recorded at
S/N=200
• practically, S/N=200 can be achieved for V=8 stars on a 3m class
telescope in survey mode
(2) Practical:
• stellar activity - young or otherwise active stars are not stable at the m/s
level and cannot be monitored with this technique
• remaining systematic errors in the observations
Currently, the best observations achieve:
 ~ 3 m/s
...in a single measurement. Thought that this error can be reduced to around 1 m/s with
further refinements, but not substantially further. The very highest Doppler precisions
of 1 m/s are capable\of detecting planets down to about 5 earth masses.
Radial velocity monitoring detects massive planets, especially those at small a, but
is not sensitive enough to detect Earth-like planets at ~ 1 AU.
164PH507
Multiwavelength
46
Examples of radial velocity data
51 Peg b was the first known exoplanet with a 4 day, circular orbit: a hot Jupiter, \lying
close to the central star.
Example of a planet with an eccentric orbit: e=0.67
164PH507
Multiwavelength
47
Summary: observables
(1) Planet mass, up to an uncertainty from the normally unknown inclination of
the orbit. Measure mp sin(i)
(2) Orbital period -> radius of the orbit given the stellar mass
(3) Eccentricity of the orbit
Summary: selection function
Need to observe full orbit of the planet: zero sensitivity to planets with P > Psurvey
For P < Psurvey, minimum mass planet detectable is one that produces a radial velocity
signature of a few times the sensitivity of the experiment (this is a practical detection
threshold)
Which planets are detectable?
m p sin i  a
1
2
164PH507
Multiwavelength
48
Current limits:
• Maximum a ~ 3.5 AU (ie orbital period ~ 7 years)
• Minimum mass ~ 0.5 Jupiter masses at 1 AU, scaling with square root of semimajor axis
• No strong selection bias in favour / against detecting planets with different
eccentricities
Of the first 100 stars found to harbor planets, more than 30 stars host a Jupiter-sized
world in an orbit smaller than Mercury's, whizzing around its star in a matter of days.
Planet formation is a contest, where a growing planet must fight for survival lest it be
swallowed by the star that initially nurtured it.
Planet detection method : Astrometry
Conceptually identical to radial velocity searches. Light from a planet-star binary is
dominated by star. Measure stellar motion in the plane of the sky due to presence
of orbiting planet. Must account for parallax and proper motion of star.
Magnitude of effect: amplitude of stellar wobble (half peak displacement) for an orbit in
the plane of the sky is
 mp 
  a
a1  
 M* 
In terms of the angle:
 m p  a 
 
  
M
 *  d 
for a star at distance d. Note we have again used mp << M*
Writing the mass ratio q = mp / M*, this gives:
164PH507
Multiwavelength
49
Note:
• Units here are milliarcseconds - very small effect
• Different dependence on a than radial velocity method - astrometric planet
searches are more sensitive at large a
• Explicit dependence on d (radial velocity measurements also less sensitive for
distant stars due to lower S/N spectra)
• Detection of planets at large orbital radii still requires a search time comparable
to the orbital period
Detection threshold as function of semi-major axis
• Lack of units deliberate! Astrometric detection not yet achieved
• As with radial velocity, dependence on orbital inclination, eccentricity
• Very promising future: Keck interferometer, Space Interferometry Mission
(SIM), ESA mission GAIA, and others
• Planned astrometric errors at the ~10 microarcsecond level – good enough to
detect planets of a few Earth masses at 1 AU around nearby stars
Lecture 7:
Planet detection method : Transits
Simplest method: look for drop in stellar flux due to a planet
transiting across the stellar disc
Needs luck or wide-area surveys - transits only occur if the orbit is almost edge-on
For a planet with radius rp << R*, probability of transits is:
164PH507
Multiwavelength
50
Close-in planets are more likely to be detected. P = 0.5 % at 1AU, P = 0.1 % at the
orbital radius of Jupiter
What can we measure from the light curve?
(1)
Depth of transit = fraction of stellar light blocked
This is a measure of planetary radius!
In practice, isolated planets with masses between ~ 0.1 MJ and 10 MJ, where MJ
is the mass of Jupiter, should have almost the same radii (i.e. a flat mass-radius
relation).
-> Giant extrasolar planets transiting solar-type stars produce transits
with a depth of around 1%.
Close-in planets are strongly irradiated, so their radii can be (detectably) larger.
But this heating-expansion effect is not generally observed for short-period
planets.
(2)
(3)
(4)
Duration of transit plus duration of ingress, gives measure of the orbital radius
and inclination
Bottom of light curve is not actually flat, providing a measure of stellar limbdarkening
Deviations from profile expected from a perfectly opaque disc could provide
evidence for satellites, rings etc
Photometry at better than 1% precision is possible (not easy!) from the ground.
HST reached a photometric precision of 0.0001.
Potential for efficient searches for close-in giant planets
Transit depth for an Earth-like planet is:
164PH507
Multiwavelength
51
Photometric precision of ~ 10-5 seems achievable from space
May provide first detection of habitable Earth-like planets
NASA’s Kepler mission, ESA version Eddington
A reflected light signature must also exist, modulated on the orbital period,
even for non-transiting planets. No detections yet.
Planet detection method : Gravitational microlensing
Light is deflected by gravitational field of stars, compact objects, clusters of galaxies,
large-scale structure etc
Simplest case to consider: a point mass M (the lens) lies along the line of sight to a
more distant source
Define:
• Observer-lens distance
• Observer-source distance
• Lens-source distance
Azimuthal symmetry -> light from the source appears as a ring
...with radius R0 - the Einstein ring radius - in the lens plane
Gravitational lensing conserves surface brightness, so the distortion of the image of
the source across a larger area of sky implies magnification.
The Einstein ring radius is given by:
Dl
Ds
Dls
164PH507
Multiwavelength
52
Suppose now that the lens is moving with a velocity v. At time t, the apparent distance
(in the absence of lensing) in the lens plane between the source and lens is r0.
Defining u = r0 / R0, the amplification is:
Note: for u > 0, there is no symmetry, so the pattern of images is not a ring and is
generally complicated. In microlensing we normally only observe the magnification A,
so we ignore this complication...
Notes:
(1) The peak amplification depends upon the impact parameter, small impact
parameter implies a large amplification of the flux from the source star
(2) For u = 0, apparently infinite magnification! In reality, finite size of source
limits the peak amplification
(3) Geometric effect: affects all wavelengths equally
(4) Rule of thumb: significant magnification requires an impact parameter
smaller than the Einstein ring radius
(5) Characteristic timescale is the time required to cross the Einstein ring radius:
Optical depth to microlensing
Define the optical depth to microlensing as:
164PH507
Multiwavelength
53
This is just the integral of the area of the Einstein ring along the line of sight to the
source. For a uniform density of lenses, can easily show that the maximum contribution
comes from lenses halfway to the source.
Several groups have monitored stars in the Galactic bulge and the Magellanic clouds to
detect lensing of these stars by foreground objects (MACHO, Eros, OGLE projects).
Original motivation for these projects was to search for dark matter in the form of
compact objects in the halo.
Timescales for sources in the Galactic bulge, lenses ~ halfway along the line of sight:
• Solar mass star ~ 1 month
• Jupiter mass planet ~ 1 day
• Earth mass planet ~ 1 hour
The dependence on M1/2 means that all these timescales are observationally feasible.
However, lensing is a very rare event, all of the projects monitor millions of source
stars to detect a handful of lensing events.
Lensing by a single star
Lensing by a star and a planet
164PH507
Multiwavelength
54
What has this to do with planets?
Binaries can also act as lenses
Light curve for a binary lens is more complicated, but a characteristic is the presence of
sharp spikes or caustics. With good enough monitoring, the parameters of the binary
doing the lensing can be recovered.
Orbiting planet is just a binary with mass ratio q << 1
Planet search strategy:
• Monitor known lensing events in realtime with dense, high precision
photometry from several sites
• Look for deviations from single star lightcurve due to planets
• Timescales ~ a day for Jupiter mass planets, ~ hour for Earths
• Most sensitive to planets at a ~ R0, the Einstein ring radius
• Around 3-5 AU for typical parameters
Sensitivity to planets
Complementary to other methods:
164PH507
Multiwavelength
55
Actual sensitivity is hard to evaluate: depends upon frequency of photometric
monitoring (high frequency needed for lower masses), accuracy of photometry (planets
produce weak deviations more often than strong ones)
Very roughly: observations with percent level accuracy, several times per night, detect
Jupiter, if present, with 10% efficiency
Many complicated light curves observed:
...but no strong evidence for planets seen yet
RV, Doppler technique (v = 3m/s)
164PH507
Multiwavelength
Astrometry: angular oscillation
Photometry: transits - close-in planets
Microlensing:
56
164PH507
Multiwavelength
57
Direct detection! Photometric :
2005 image of 2M1207 (blue) and its planetary companion, 2M1207b, one of the
first exoplanets to be directly imaged, in this case from the Very Large
Telescope array in Chile
Spectroscopic? The starlight scattered from the planet can be distinguished
from the direct starlight because the scattered light is Doppler shifted by virtue
of the close-in planet's relatively fast orbital velocity (~ 150 km/sec).
Superimposed on the pattern given by the planet's albedo changing slowly with
wavelength, the spectrum of the planet's light will retain the same pattern of
photospheric absorption lines as in the direct starlight.
Lecture 8 The extrasolar planet population
Current status of exoplanet searches:
Radial Velocity Method (Doppler technique, gravitational wobble)
• 156 exoplanets hosted by134 stars discovered, with masses M.sin(i) as low as 6
Earth masses. Generally:~ 0.06 MJ and 10MJ…
164PH507
Multiwavelength
58
164PH507
Multiwavelength
59
and orbital radii from 0.02 AU to 6 AU.
• Planet fraction among ~ solar-type stars exceeds 7%
• Most are beyond 1 AU
• Around 1% of stars have hot Jupiters - massive planets at orbital radii a < 0.1
AU
• Four very low mass planets have been detected ….20 earth masses.
• Planet occurrence rises rapidly with stellar metallicity
• Multiple planets are common, often in resonant orbits
Microlensing: two strong detections, low detection rate imply upper limit of ~1/3 on the
fraction of lensing stars (~ 0.3 Msun) with Jupiter mass planets at radii to which lensing
is most sensitive (1.5 - 4 AU)
Transits: 7 known planets (5 found with OGLE photometrically – dimming).
Interesting upper limit from non-detection of transits in globular cluster 47 Tuc
Transits + Doppler yields mass and size, hence the density of the planet: 0.2 – 1.4
gm/cm3 : mainly gaseous. In addition, sodium and nitrogen found in their atmospheres.
Direct Imaging: reports of detections with HST and VLT.
Eccentricity: • Except at very small radii, typical planet orbit has significant
eccentricity
164PH507
Multiwavelength
60
164PH507
Multiwavelength
61
Eccentricity:
Eccentricity vs planet mass
164PH507
Multiwavelength
62
164PH507
Multiwavelength
Nothing very striking in these plots:
63
164PH507
Multiwavelength
64
• Accessible region of mp - a space is fully occupied by detected planets
• Ignoring the hot Jupiters, no obvious correlation between planet mass and
eccentricity...
Results from radial velocity searches
(1) Massive planets exist at small orbital radii. Closest in planet is at a = 0.035 AU,
cf Mercury at ~ 0.4 AU. Less than 10 Solar radii.
(2) Hot Jupiters have close to circular orbits. All detected planets with semi-major
axis < 0.07 AU have low e. This is similar to binary stars, and is likely due to
tidal circularization.
(3) Remaining planets have a wide scatter in e, including some planets with large e.
Note that the distance of closest approach is a(1-e), and that the effect of
tidal torques scales as separation d-6. The very eccentric planet around
HD80606 (a = 0.438 AU, e = 0.93, a(1-e) = 0.03 AU) may pose some
problems for tidal circularization theory.
164PH507
Multiwavelength
65
Account for this by considering only planets with masses large enough to be detectable
at any a < 2.7 AU.
-> dN / dlog(a) rises steeply with orbital radius
Implies that the currently detected planet fraction ~7% is likely to be a substantial
underestimate of the actual fraction of stars with massive planets.
Models suggest 15-25% of solar-type stars may have planets with masses 0.2 MJ < mp <
10 MJ.
Strong selection effect in favour of detecting planets at small orbital radii, arising from:
- lower mass planets can be detected there
- mass function rises to smaller masses
Observed mass function increases to smaller Mp:
164PH507
Multiwavelength
Note: the brown dwarf desert!
66
164PH507
Multiwavelength
67
Constraint from monitoring of 43 microlensing events. Typically, the lenses are
low mass stars.
At most 1/3 of 0.3 Solar mass stars have Jupiter mass planets between 1.5
AU and 4 AU.
Currently consistent with the numbers seen in radial velocity searches - not yet
known whether there is a difference in the planet fraction between 0.3 - 1 Solar
mass stars.
Transit lightcurve from Brown et al. (2001)
Consistent with expectations - the probability of a transiting system is ~10%.
Measured planetary radius rp = 1.35 J:
• Proves we are dealing with a gas giant.
• Somewhat larger than models for isolated (non-irradiated) planets effect of environment on structure.
• In detail, suggests planet reached its current orbit within a few x 10 Myr
after its formation.
Precision of photometry with HST / STIS impressive...
Metallicity distribution of stars with and without planets
164PH507
Multiwavelength
68
Left plot: metallicity of stars with planets (shaded histogram) compared to a sample of
stars with no evidence for planets (open histogram) (data from Santos, Israelian &
Mayor, 2001)
Host star metallicity
Planets are preferentially found around stars with enhanced metal abundance.
Cause or effect? High metal abundance could:
(a) Reflect a higher abundance in the material which formed the star +
protoplanetary disc, making planet formation more likely.
(b) Result from the star swallowing planets or planetesimals subsequent to
planets forming. If the convection zone is fairly shallow, this can apparently
enrich the star with metals even if the primordial material had Solar abundance.
Detailed pattern of abundances can distinguish these possibilities, but results currently
still controversial.
Lack of transits in 47 Tuc
A long HST observation monitored ~34,000 stars in the globular cluster 47 Tuc looking
for planetary transits.
Locally: 1% of stars have hot Jupiters
~ 10% of those show transits
 Expect 10 -3 x 34,000 ~ few tens of planets
None were detected. Possible explanations:
164PH507
Multiwavelength
69
• Low metallicity in cluster prevented planet formation
• Cluster environment destroyed discs before planets formed
• Stellar fly-bys ejected planets from bound orbits
All of these seem plausible - make different predictions for other clusters.
164PH507
Multiwavelength
70
Lecture 9: Radiation processes
Almost all astronomical information from beyond the Solar
System comes to us from some form of electromagnetic
radiation (EMR). We can now detect and study EMR over a
range of wavelength or, equivalently, photon energy,
covering a range of at least 1016- from short wavelength,
high photon energy gamma rays to long wavelength low
energy radio photons. Out of all this vast range of
wavelengths, our eyes are sensitive to a tiny slice of
wavelengths- roughly from 4500 to 6500 Å. The range of
wavelengths our eyes are sensitive to is called the visible
wavelength range. We will define a wavelength region
reaching somewhat shorter (to about 3200 Å) to somewhat
longer (about 10,000 Å) than the visible as the optical part of
the spectrum. (Note: Physicists measure optical
wavelengths in nanometers (nm). Astronomers tend to use
_Angstroms. 1 Å = 10-10 m = 0.1 nm. Thus, a physicist would
say the optical region extends from 320 to 1000 nm.)
All EMR comes in discrete lumps called photons. A photon
has a definite energy and frequency or wavelength. The
relation between photon energy (Eph) and photon frequency
 is given by:
Eph = h
164PH507
Multiwavelength
71
or, since c = 
E ph 
hc

where h is Planck’s constant and  is the wavelength, and c
is the speed of light. The energy of visible photons is
around a few eV (electron volts). (An electron volt is a nonmetric unit of energy that is a good size for measuring
energies associated with changes of electron levels in atoms,
and also for measuring energy of visible light photons. 1 eV
= 1.602 x 10-19 Joules.)
In purely astronomical terms, the optical portion of the
spectrum is important because most stars and galaxies emit
a significant fraction of their energy in this part of the
spectrum. (This is not true for objects significantly colder
than stars - e.g. planets, interstellar dust and molecular
clouds, which emit in the infrared or at longer wavelengths
- or significantly hotter- e.g. ionised gas clouds, neutron
stars, which emit in the ultraviolet and x-ray regions of the
spectrum. Another reason the optical region is important is
that many molecules and atoms have electronic transitions
in the optical wavelength region.
Blackbody Radiation
Where then does a thermal continuous spectrum come from?
Such a continuous spectrum comes from a blackbody whose
spectrum depends only upon the absolute temperature. A
blackbody is so named because it absorbs all electromagnetic
energy incident upon it - it is completely black. To be in perfect
thermal equilibrium, however, such a body must radiate energy
at exactly the same rate that it absorbs energy; otherwise, the
164PH507
Multiwavelength
72
body will heat up or cool down (its temperature will change).
Ideally, a blackbody is a perfectly insulated enclosure within
which radiation has come into thermal equilibrium with the
walls of the enclosure. Practically, blackbody radiation may be
sampled by observing the enclosure through a tiny
pinhole in one of the walls. The gases in the interior of a star are
opaque
(highly
absorbent)
to all radiation (otherwise, we would see the stellar interior at
some wavelength!); hence, the radiation there is blackbody in
character. We sample this radiation as it slowly leaks from the
surface of the star - to a rough approximation, the continuum
radiation from some stars is blackbody in nature.
We will define the regions of the Electromagnetic Spectrum to
have wavelengthds as follows:
 Gamma-rays: < 0.1Å, highest frequency, shortest
wavelength, highest energy.

X-Rays: 0.1Å -- 100Å

Ultraviolet light: 100Å -- 3000Å

Visible light: 3000Å -- 10000Å = 1µm (micrometer or
micron)

Infrared Light: 1µm -- 1mm

Radio waves: >1mm, lowest frequency, longest
wavelength, lowest energy.
Planck’s Radiation Law
After Maxwell's theory of electromagnetism appeared in 1864,
many attempts were made to understand blackbody radiation
theoretically. None succeeded until, in 1900, Max K. E. L. Planck
(1858-1947) postulated that electromagnetic energy can propagate only in discrete quanta, or photons, each of energy E = hv.
164PH507
Multiwavelength
73
He then derived the spectral intensity relationship, or Planck
blackbody radiation law:



2h 3  1
I( )d   2  h 

 c   kT



 e  1 


where I(v)dv is the intensity (J/m2 . s . sr) of radiation from a
blackbody at temperature T in the frequency range between v
and v + dv, h is Planck's constant, c is the speed of light, and k
is Boltzmann's constant. Note the exponential in the denominator.
Because the frequency v and wavelength of electromagnetic
radiation are related by v = c, we may also express Planck's
formula in terms of the intensity emitted per unit wavelength
interval:
This is illustrated for several values of T:
164PH507
Multiwavelength
74
Note
that
both
I() and I(v)
increase as the
blackbody temperature increases - the blackbody becomes
brighter. This effect is easily interpreted when we note that
I(v)∆v is directly proportional to the number of photons emitted
per second near the energy hv. The Planck function is special
enough so that its given its own symbol, B() or B(v), for
intensity.
Wien’s Law
A blackbody emits at a peak intensity that shifts to shorter
wavelengths as its temperature increases.
164PH507
Multiwavelength
75
 Wilhelm Wien (1864-1928) expressed the wavelength
at which the maximum intensity of blackbody
radiation is emitted - the peak (that wavelength for
which dI()/d = 0) of the Planck curve (found from
taking the first derivative of Planck's law) - by Wien's
displacement law:
max = 2.898 x 10-3 / T
where max is in metres when T is in Kelvin. Note that
because maxT = constant, increasing one proportionally
decreases the other.
164PH507
Multiwavelength
76
For example, the continuum spectrum from our Sun is
approximately blackbody, peaking at max ≈ 500 nm.
Therefore, the surface temperature is near 5800 K.
The Law of Stefan and Boltzmann
The area under the Planck curve (integrating the Planck
function) represents the total energy flux, F (W/m2),
emitted by a blackbody when we sum over all wavelengths
and solid angles:
164PH507
Multiwavelength
77
where = 5.669 x 10-8 W/m2 . K4. The strong temperature
dependence of this formula was first deduced from
thermodynamics in 1879 by Josef Stefan (1835-1893) and
was derived from statistical mechanics in 1884 by
Boltzmann. Therefore we call the expression the StefanBoltzmann law. The brightness of a blackbody increases as
the fourth power of its temperature. If we approximate a
star by a blackbody, the total energy output per unit time of
the star (its power or luminosity in watts) is just L =
4R2T4 since the surface area of a sphere of radius R is
4R2
To summarise: A blackbody radiator has a number of
special characteristics. One, a blackbody emits some energy
at all wavelengths. Two, a hotter blackbody emits more
energy per unit area and time at all wavelengths than does a
cooler one. Three, a hotter blackbody emits a greater
proportion of its radiation at shorter wavelengths than does a
cooler one. Four, the amount of radiation emitted per
second by a unit surface area of a blackbody depends on the
fourth power of its temperature.
Stellar Material
Our Sun is the only star for which I( has been accurately
observed. Indeed, Ibol is related to the solar constant: the total
164PH507
Multiwavelength
78
solar radiative flux received at the Earth’s orbit outside our
atmosphere (1370 W/m2). The solar luminosity L (3.86 x 1026
W) is calculated from the solar constant in the following manner.
Using the inverse-square law, we find the radiative flux at the
Sun’s surface R. Then Lis just  times this flux. The solar
energy distribution curve may be approximated by a Planck
blackbody curve at the effective temperature Teff, defined as the
temperature of a blackbody that would emit the same total
energy as an emitting body, such as the Sun or a star. Then the
Stefan-Boltzmann law implies
L = 4π R2 T4eff
J s-1
where is the Stefan-Boltzmann constant.
Stellar Atmospheres
The spectral energy distribution of starlight is determined in a
star’s atmosphere, the region from which radiation can freely
escape. To understand stellar spectra, we first discuss a model
stellar atmosphere and investigate the characteristics that
determine the spectral features.
Physical Characteristics
The stellar photosphere, a thin, gaseous layer, shields the stellar
interior from view. The photosphere is thin relative to the stellar
radius, and so we regard it as a uniform shell of gas. The physical
properties of this shell may be approximately specified by the
average values of its pressure P, temperature T, and chemical
composition µ (chemical abundances).
164PH507
Multiwavelength
79
We also assume that the gas obeys the perfect-gas law:
P =nkT
where k is Boltzmann’s constant. This relationship is also known
as Boyle’s Law.
An important result that follows from it is that the kinetic energy
of a particle, or assemblage of particles, is given by the
relationship;
KE 
3
kT
2
Thus temperature is just a measure of the kinetic energy of a gas,
or an assemblage of particles. This equation applies equally well
to a star as a whole, as to a single particle, and later we will look
at the comparison between a star’s kinetic and gravitational
(potential) energies.
The kinetic energy is also a measure of the velocity that atoms or
molecules are moving about at - the hotter they are, the faster
they move. Thus, for a cloud of gas surrounding a hot star of
temperature T = 15,000 K, which consists of hydrogen atoms
(mass = 1.67 10-27 kg);
3
1
kT  KE  mv2
2
2
v
3kT
 19 km s 1  50,000 mph
m
The particle number density is related to both the mass density
(kg/m3) and the composition (or mean molecular weight) µ by
the following definition of µ:
164PH507
Multiwavelength
80
1


mH n

where mH = 1.67 x 10-27 kg is the mass of a hydrogen atom. For
a star of pure atomic hydrogen, µ = 1. If the hydrogen is
completely ionised, µ = 1/2 because electrons and protons
(hydrogen nuclei) are equal in number and electrons are far less
massive than protons. In general, stellar interior gases are
ionised and

1
3
1
2X  Y  Z
4
2
where X is the mass fraction of hydrogen, Y is that of helium,
and Z is that of all heavier elements. The mass fraction is the
percentage by mass of one species relative to the total. Thus, for a
pure hydrogen star (X=1.0, Y = 0.0, Z = 0.0),  ~ 0.5, and for a
white dwarf star (X = 0.0, Y = 1.0, Z = 0.0)  ~ 1.33.
Temperatures
The continuous spectrum, or continuum, from a star may be
approximated by the Planck blackbody spectral-energy
distribution. For a given star, the continuum defines a colour
temperature by fitting the appropriate Planck curve. We can
also define the temperature from Wien’s displacement law:
maxT = 2.898 x 10-3 m . K which states that the peak intensity of
the Planck curve occurs at a wavelength max that varies
inversely with the Planck temperature T. The value of max
then defines a temperature. Also note here that the hotter a star
is, the greater will be its luminous flux (in W/m2), in accordance
with the Stefan-Boltzmann law: F = T4 where = 5.67 x 10-8
W/m2 . K4. Then the relation
164PH507
Multiwavelength
81
L = 4πR2T4eff
defines the effective temperature of the photosphere.
A word of caution: the effective temperature of a star is usually
not identical to its excitation (Boltzmann eqn) or ionisation
temperature (Saha eqn) because spectral-line formation
redistributes radiation from the continuum. This effect is called
line blanketing and becomes important when the numbers and
strengths of spectral lines are large.
When spectral features are not numerous, we can detect the
continuum between them and obtain a reasonably accurate value
for the star’s effective surface temperature. The line blanketing
alters the atmosphere’s blackbody character.
Spectrophotometry
164PH507
Multiwavelength
82
The goal of the observational astronomer to to make
measurements of the EMR from celestial objects with as
much detail, or the finest resolution, possible. There are of
course different types of detail that we want to observe.
These include angular detail, wavelength detail, and time
detail. The perfect astronomical observing system would
tell us the amount of radiation, as a function of wavelength,
from the entire sky in arbitrarily small angular slices. Such a
system does not exist!
We are always limited in angular and wavelength coverage,
and limited in resolution in angle and wavelength. If we
want good information about the wavelength distribution
of EMR from an object (spectroscopy or spectrophotometry)
we have to give up angular detail. If we want good angular
resolution over a wide area of sky (imaging) we usually
have to give up wavelength resolution or coverage.
The ideal goal of spectrophotometry is to obtain the
spectral energy distribution (SED) of celestial objects, or
how the energy from the object is distributed in
wavelength. We want to measure the amount of energy
received by an observer outside the Earth's atmosphere, per
second, per unit area, per unit wavelength or frequency
interval. Units of spectral flux (in cgs) look like:
f  = ergs s-1 cm-2 Å -1
if we measure per unit wavelength interval, or
f = ergs s-1 cm-2 Hz -1
(pronounced f nu if we measure per unit frequency interval.
164PH507
Multiwavelength
83
Classifying Stellar Spectra
Observations
A single stellar spectrum is produced when starlight is focused
by a telescope onto a spectrometer or spectrograph, where it is
dispersed (spread out) in wavelength and recorded
photographically or electronically. If the star is bright, we may
obtain a high-dispersion spectrum, that is, a few mÅ per
millimetre on the spectrogram, because there is enough radiation
to be spread broadly and thinly. At high dispersion, a wealth of
detail appears in the spectrum, but the method is slow (only one
stellar spectrum at a time) and limited to fairly bright stars.
Dispersion is the key to unlocking the information in starlight.
The Spectral-Line Sequence
At first glance, the spectra of different stars seem to bear no
relationship to one another. In 1863, however, Angelo Secchi
found that he could crudely order the spectra and define
different spectral types. Alternative ordering schemes appeared
in the ensuing years, but the system developed at the Harvard
Observatory by Annie J. Cannon and her colleagues was
internationally adopted in 1910. This sequence, the Harvard
spectral classification system, is still used today. (About 400,000
stars were classified by Cannon and published in various
volumes of the Henry Draper Catalogue, 1910-1924, and its
Extension, 1949.
At first, the Harvard scheme was based upon the strengths of the
hydrogen Balmer absorption lines in stellar spectra, and the
spectral ordering was alphabetical (A through to P). Some letters
were eventually dropped, and the ordering was rearranged to
correspond to a sequence of decreasing temperatures (see the
effects of the Boltzmann and Saha equations): OBAFGKMRNS.
164PH507
Multiwavelength
84
Stars nearer the beginning of the spectral sequence (closer to O)
are sometimes called early-type stars, and those closer to the M
end are referred to as late-type. Each spectral type is divided
into ten parts from 0 (early) to 9 (late); for example, . . . F8 F9 G0
G1 G2 . . . G9 K0 . . . . In this scheme, our Sun is spectral type
G2. In 1922, the International Astronomical Union (IAU)
adopted the Harvard system (with some modifications) as the
international standard.
Many mnemonics have been devised to help students retain the
spectral sequence. A variation of the traditional one is “Oh, Be a
Fine Girl, Kiss Me Right Now, Smack.”
The next Figure shows exemplary stellar spectra arranged in
order; note how the conspicuous spectral features strengthen and
diminish in a characteristic way through the spectral types.
164PH507
Multiwavelength
85
Comparison of spectra observed for seven different stars having a
range of surface temperatures. The hottest stars, at the top, show
lines of helium and multiply-ionised heavy elements. In the
coolest stars, at the bottom, helium lines are not seen, but lines of
neutral atoms and molecules are plentiful. At intermediate
temperatures, hydrogen lines are strongest. The actual
compositions of all seven stars are about the same.
164PH507
Multiwavelength
86
The Temperature Sequence
The spectral sequence is a temperature sequence, but we must
carefully qualify this statement. There are many different kinds
of temperatures and many ways to deter-mine them.
Theoretically, the temperature should correlate with spectral
type and so with the star’s colour. From the spectra of
intermediate-type stars (A to K), we find that the (continuum)
164PH507
Multiwavelength
87
colour temperature does so, but difficulties occur at both ends of
the sequence. For O and B stars, the continuum peaks in the far
ultraviolet, where it is undetectable by ground-based
observations.
Through satellite observations in the far
ultraviolet, we are beginning to understand the ultraviolet
spectra of O and B stars. For the cool M stars, not only does the
Planck curve peak in the infrared, but numerous molecular
bands also blanket the spectra of these low-temperature stars.
164PH507
Multiwavelength
88
When the strengths of various spectral features are plotted
against excitation-ionisation (or Boltzmann-Saha) temperature;
the spectral sequence does correlate with this temperature as
seen below;
164PH507
Multiwavelength
89
In practice, we measure a star’s colour index, CI = B - V, to
determine the effective stellar temperature.
If the stellar
continuum is Planckian and contains no spectral lines, this
procedure clearly gives a unique temperature, but observational
uncertainties and physical effects do lead to problems: (a) for the
very hot O and B stars, CI varies slowly with Teff and small
uncertainties in its value lead to very large uncertainties in T; (b)
for the very cool M stars, CI is large and positive, but these faint
stars have not been adequately observed and so CI is not well
determined for them; (c) any instrumental deficiencies,
calibration errors, or unknown blanketing in the B or V bands
affect the value of CI - and thus the deduced T. Hence, it is best
to define the CI versus T relation observationally.
SPECTROSCOPY
• Last year discussed stellar spectra and classification on an
empirical basis:
Spectral sequence
O B A F G K M
Temperature
~40,000 K
---->
2500 K
Classification based on relative line strengths of He, H,
Ca, metal, molecular lines.
• We will now look a little deeper at stellar spectra and
what they tell us about stellar atmospheres.
Radiative Transfer Equation
• Imagine a beam of radiation of intensity I passing
through a layer of gas:
164PH507
Multiwavelength
Power passing into volume
90
Area
dA
E = I d dA d

Power passing out
of volume
E  + dE 
where I = intensity into
solid angle element d 
path length ds
NB in all these equations subscripts  can be replaced by 
In the volume of gas there is:
ABSORPTION - Power is reduced by amount
dE = -   E ds = -   I d dA d ds
where 
is the ABSORPTION COEFFICIENT or
OPACITY
= the cross-section for absorption of radiation of
wavelength  (frequency ) per unit mass of gas. Units of
are m2 kg-1
The quantity  is the fraction of power in a beam of radiation of wav
 absorbed by unit depth of gas. It has units of m-1. (NB in many texts
simply given the symbol  in the equations given here beware!)
EMISSION - Power is increased by amount
dE = j  d dA d ds
(1)
where j = EMISSION COEFFICIENT = amount of energy
emitted per second per unit mass per unit wavelength
into unit solid angle.
Units of j (j) are W kg-1 µm-1 sr-1 (W kg-1 Hz-1 sr-1) or m s-3 sr-1
(NB power production per unit volume per unit wavelength into unit so
164PH507
Multiwavelength
91
angle is  =j More confusion is possible here, since is also the
symbol used for total power output of a gas, units are W kg-1, - Bewar
So total change in power is
dE = dI d dA d = -   I d dA d ds + j  d dA d ds
which reduces to
dI = -   I ds + j  ds
dI
ds
= - I + j 
(2)
(3)
This is a form of the radiative transfer equation in the
plane parallel case.
Optical depth
• Take a volume of gas which only absorbs radiation (j =
0) at  :
dI = -   I ds
For a depth of gas s, the fractional change in intensity is
given by
I (s)

dI
 I
I (0)

s

=

0 -  ds

ln (
Integrating ==>
I (s)

I (0)

s
) = -
  ds
0

s
-
==>
I (s) = I (0) e

 ds
0

We define Optical depth 
s

   ds



(4)
164PH507
Multiwavelength
92
-

So
(5)
I (s) = I (0) e


• Intensity is reduced to 1/e (=1/2.718 = 0.37 ) of its
original value if optical depth = 1.
• Optical depth is not a physical depth. A large optical
depth can occur in a short physical distance if the
absorption coefficient  is large, or a large physical
distance if  is small.
Full Radiative Transfer Equation again
dI
= - I + j 
ds
divide by  
dI

 ds
= -I +


j



dI

d
= -I + S


As ds --> 0,  is constant over ds. 
This is the RADIATIVE TRANSFER EQUATION in the
plane parallel case.
Define:

S =
where

j


or
j =  S




and S is the SOURCE FUNCTION.
Radiative transfer in a blackbody
• Remember definition of a blackbody as a perfect absorber
and emitter of radiation. Matter and radiation are in
THERMODYNAMIC EQUILIBRIUM, ie gross properties
do not change with time. Therefore a beam of radiation
in a blackbody is constant:
(6)
164PH507
Multiwavelength
dI
ds
93
= 0 = - I + j 
from definition of source function, j =  S
==> 0 =  (I - S),
i.e. I = S.
but for a blackbody I = B
the PLANCK FUNCTION
2
B =

2hc

3
1
hc/kT
(e
- 1)
B =

2h
2
c
1
hkT
(e
- 1)
Summary: in complete thermodynamic equilibrium the source
function equals the Planck function,
i.e.
j =  B
• In studies of stellar atmospheres we make the assumption
of LOCAL THERMODYNAMIC EQUILIBRIUM (LTE),
i.e. thermodynamic equilibrium for each particular layer
of a star.
• Note that if incoming radiation at a particular
wavelength (e.g. in a spectral line) enters a blackbody gas
it is absorbed, but emission is distributed over all
wavelengths according to the Planck function. All
information about the original energy distribution of the
radiation is lost. This is what happens in interior layers of
a star where the density is high and photons of any
wavelength are absorbed in a very short distance. Such a
gas is said to be optically thick (see below).
Emission and Absorption lines
•  the absorption coefficient describes the efficiency of
absorption of material in the volume of gas. In a low
density gas, photons can generally pass through without
interaction with atoms unless they have an energy
corresponding to a particular transition (electron energy
level transition, or vibrational/rotational state transition
in
molecules).
At
this
particular
(7)
(K
164PH507
Multiwavelength
94
energy/frequency/wavelength the absorption coefficient
 is large.
• Let's imagine the volume of gas shown earlier with both
absorption and emission:
I
I  (0)
path length s
dI

d
= S - I



Multiply both sides by e and re-arrange
dI

==>
d



e + I e = S e



d
==>
d


(I e ) = S e 



integrate over whole volume, i.e. from 0 to s, or 0 to 


I e

==>


=
0

S e


0
assuming S = constant along path
I e - I(0) = S e - S
==>
==>
I
I(0) e- +
radiation left
over from light
entering box.
=
S (1 - e- )
(8)
light from radiation
emitted in the
box.
 >> 1: OPTICALLY THICK CASE
If  >> 1, then e- --> 0, and eqn (8) becomes
• Case 1
I =
164PH507
Multiwavelength
95
S
In LTE
S  = B ,
the Planck function.
So for an optically thick gas, the emergent spectrum is
the Planck function, independent of composition or
input intensity distribution. True for stellar photosphere
(the visible "surface" of a star).
• Case 2
(9)
 << 1 OPTICALLY THIN CASE
If  << 1, then e- ≈ 1 - 
(first two terms of Taylor series expansion)
eqn (8) becomes I = I(0) (1 - ) + S (1 - 1 + )
==>I = I(0) +  ( S - I(0) )
(10)
• If I(0) = 0 : no radiation entering the box (from
direction of interest):
From eqn (8)
I =  S
(= 
B in LTE)
Since  = ∫  , then
I =   s S
If  is large (true at wavelength of spectral lines) then I is large,
we see EMISSION LINES. This happens for example in gaseous
nebulae or the solar corona when the sun is eclipsed.
• If I(0) ≠ 0 , let's examine eqn (8)
I = I(0) +  ( S - I(0) )
If S > I(0) then right hand term is +ve
when  is large (ie  is large) we see higher intensity
than I(0)
==> EMISSION LINES ON BACKGROUND INTENSITY.
If S < I(0) then right hand term is -ve
when  is large (ie  is large) we see lower intensity
than I(0)
==>ABSORPTION LINES ON BACKGROUND INTENSITY.
For stars we see absorption lines. This means I(0) > S,
164PH507
Multiwavelength
96
i.e. (intensity from deeper layers) > (source function for the top layers
Assuming LTE (S = B) the source function increases as
temperature increases:I(0) = B(Tdeep layer) > S = B(Touter layer)
Therefore temperature must be increasing as we go into
the star for absorption lines to be observed.
• To summarise
- We see CONTINUUM RADIATION for an optically
thick gas
(= PLANCK FUNCTION assuming LTE).
- We see EMISSION LINES for an optically thin gas.
- We see ABSORPTION LINES + CONTINUUM for an
optically thick gas
overlaid by optically thin gas with temperature decreasing outwards.
- We see EMISSION LINES + CONTINUUM for an
optically thick gas overlaid by an optically thin gas with
temperature increasing outwards.
Atomic Spectra - Absorption & Emission line series and
continua
• Bohr theory (last year's physics unit) adequately describes
electron energy levels in Hydrogen. Quantum mechanics
is required for more massive atoms to describe the
dynamics of electrons. However, we are interested here
only in the energy levels of electron states rather than a
detailed model or description of atomic structure. We can
therefore use ENERGY LEVEL DIAGRAMS without
164PH507
Multiwavelength
97
worrying too much about the theory behind them.
• There are 3 basic photon absorption mechanisms related
to electrons. Using Hydrogen as the example, the electron
energy levels, n, are described by
E(n) = - 2 2 me e4 Z2 / n2 h2
from Bohr Theory
Bound-Bound Transitions
• BOUND - BOUND transitions give rise to spectral lines.
• ABSORPTION LINE if a photon is absorbed, causing
increase in energy of electron.
Energy of absorbed
photon
h = E(nu) - E(nl)
(1)
where E(nu) and E(nl) are energies of upper and lower
energy levels respectively.
This is RADIATIVE
EXCITATION.
• Note energy can also be absorbed from collision of a free
particle (COLLISIONAL EXCITATION) - no absorption
line is seen in this case.
• Atom remains in excited state until
SPONTANEOUS EMISSION (photon is emitted typically
after ~10-8 s)
or INDUCED EMISSION (Photon emitted at same energy
and coherently
with incoming photon - as in lasers).
Both produce EMISSION LINES.
• Narrow lines are seen since () is high, otherwise gas
transitions can only occur if is transparent and  ()
photon
has
energy is low.
(frequency/wavelength)
• Energy
level
diagram
corresponding to difference shows electron energy
in energy levels. If this is level
changes
for
the
case
then
the absorption of a photon.
absorption coefficient  Lowest energy level set to
PH507
Astrophysics
Dr. S.F. Green
zero energy. 1eV = 1.6 x
10-19 J.
98
n=•
n=4
n=3
13.6 eV
12.73 eV
12.07 eV
n=2
10.19 eV
n=1
Lyman
Series
Balmer Paschen
Series Series
0 eV
• Series of lines seen
-LYMAN SERIES transitions to/from n=1 lines seen in
ultraviolet
-BALMER SERIES ""
n=2
""
visual
-PASCHEN SERIES""
n=3
""
infrared ...
• Energy of absorbed photon
is
h = ( - E(nl)) + 1/2 me
v2
Bound-free transitions
• If photon has energy
1/2 m ev 2
13.6 eV
greater than that required n=•
12.73 eV
to move an electron in an n=4
12.07 eV
atom from its current n=3
energy level to level n=∞, n=2
10.19 eV
the
electron
will
be
released, ionizing the atom.
• Ionization potential for n=1
0 eV
Hydrogen is
13.6 e
• Since one of the states (free electron) can have any energy,
the transition can have any energy and the photon any
frequency (above a certain value determined by  and
E(nl)).
Thus BOUND-FREE transitions give an ABSORPTION
CONTINUUM.
• RE-COMBINATION is a FREE-BOUND transition and
PH507
Astrophysics
Dr. S.F. Green
99
results in an EMISSION CONTINUUM.
• The spectrum produced by
absorption from a single continuum • 
energy level will therefore
appear as a series of lines


of
increasing
energy
(increasing
frequency, decreasing wavelength) up to a limit defined by
-E(nl), with an absorption continuum shortward of this
limit.
• For nl=1 the Lyman series (Lyman-, Lyman- etc) is
observed together with the Lyman continuum shortward
of =91.2 nm. (Since interstellar space is populated by
very low density and low temperature hydrogen (ie with
n=1) photons with <91.2nm are easily absorbed so it is
opaque in the near-UV).
For nl=2 the Balmer series (H, H etc) is observed
together with the Balmer continuum shortward of
=364.7 nm.
Free-free transitions
• Absorption of a photon by a free electron in the vicinity of
an ion.
Electron changes from free energy state with velocity v1
to one with velocity v2
i.e. h = 1/2 me v22 - 1/2 me v12
Determination of 
• The actual spectrum of a star depends on the physical
conditions (notably temperature) and composition of the
stellar atmosphere. The intensity is produced at a
physical level in the star where  ~ 2/3. In order to
determine the total spectrum, the value of  needs to be
determined at all wavelengths. The overall  is the sum
PH507
Astrophysics
Professor Glenn White
100
of the contributions from each atomic/molecular species
in the atmosphere. Each component of  depends on the
number of atoms/molecules with a given energy state
capable of absorbing radiation at that frequency and the
absorption efficiency. We will deal with the energy state
populations first:
Boltzmann's equation (Excitation equilibrium)
• Boltzmann's
equation
describes
the
population
distribution of energy states for a particular atom in a gas.
The ratio of number of atoms per m3 in energy state B to
energy state A:
NB
NA
gB (EA - EB)/kT
e
gA
=
(50)
where gA and gB are STATISTICAL WEIGHTS (number
of different quantum states of the same energy), k =
Boltzmann const and T = temperature of gas.
NB EB > EA so exponential power is -ve.
• The probability of finding an atom in an excited state
decreases exponentially with the energy of the excited
state, but increases with increasing temperature.
Saha Equation (Ionization Equilibrium)
• The Boltzmann eqn does not describe all the possible
atomic states. Excitation may cause electrons to be lost
completely. There are therefore a number of different
ionization states for a given atom, each of which has one
or more energy states.
• The ratio of the number of atoms of ionization state i+1 to
those of ionization state i (i=I is neutral, i=II is singly
ionized, etc) is given by
3/2
Ni+1
Ni
=
Ui+1 2
Ui Ne
2 me k T
2
h
-i /kT
e
where Ne is the electron density (number of electrons per
PH507
Astrophysics
Professor Glenn White
101
m3),
i is the ionization potential of the ith ionization state,
Ui+1 and Ui are PARTITION FUNCTIONS obtained from
•
weights:
Ui = gi1 +
-Ein /kT
e
in
g
n=2
the statistical
• The higher the Ionization potential, i, the lower the
fraction of atoms in the upper ionization state,
The higher the Temperature, , the higher the fraction of
atoms in the upper ionization state, (Collisional excitation
is more likely to ionize atom),
The higher the electron density, the lower the fraction of
atoms in the upper ionization state (due to recombination).
• The Boltzmann and Saha Equations give the fraction of
atoms in a given ionization state and energy level
allowing (when combined with absorption/emission
probabilities)  and hence the line strengths to be related
to abundances.
Example - Abundances in the Sun
• In line forming regions in the Sun:T ~ 6000 K, Ne ~ 7x1019 m3.
Gas
I
II UII/UIUIII/UII g1
Hydrogen 13.6 eV
2
2
Calcium
6.1 eV 11.9 eV ~2
~0.5
1
g2
2
6
From Saha Equation for Hydrogen, the ratio of ionized to un-ionized H
NII/NI ≈ 6x10-5
i.e. most of Hydrogen is un-ionized.
From Boltzmann equation, ratio of number of atoms with electrons in l
n=2 to those in level n=1 (E1-E2 = -10.19 eV) is
N2/N1 ≈ 3x10-9
i.e almost all H atoms in ground state.
The H Balmer lines which originate from level n=2 are
PH507
Astrophysics
Professor Glenn White
102
strong only because the H abundance is so high.
From Saha Equation for Calcium,NII/NI ≈ 600 and NIII/NII ≈ 2x10
i.e. most of Calcium is in singly ionized state.
From Boltzmann equation, ratio of number of atoms with electrons in
energy states which contribute to the H and K lines to those in the
ground state (E1-E2 = -3.15, -3.13 eV) is (NB/NA)II ≈ 10-2
i.e most
Ca atoms in ground state.
The H and K lines of Calcium are therefore strong
because most Ca atoms in the Sun are in an energy state
capable of producing the lines.
• For stars cooler than the Sun more H is in the ground
state so Balmer lines will be weaker, for stars hotter than
the Sun more H is in n=2 state so Balmer lines will be
stronger. (T ~ 85000 K needed for N2/N1 =1). But at this
temperature NII/NI = 105 so little remains un-ionized.
• Balmer line strength depends on excitation (function of T)
and ionization (function of T and Ne). Balance of effects
occurs at T ~ 10,000 K so Balmer lines are strongest in A0
stars.
• A similar effect occurs for other species but at different
temperatures.
Transition probabilities
• Once we know the population of all energy states for a
given gaseous species we need to know the transition
probabilities for each energy state change before the
absorption coefficient can be determined.
• The transition probabilities must be calculated from
atomic theory or determined by experiment - much time
has been invested in this major problem in astrophysics.
• The EINSTEIN TRANSITION PROBABILITY (inverse of
lifetime):
PH507
Astrophysics
Professor Glenn White
103
for spontaneous emission, A21  2
for stimulated emission
B21  -1
for absorption
A12  -1
Total 
• We can now calculate  for a given gaseous species. For
Hydrogen (removing spectral line opacities for clarity):
Lyman continuum
absorption
falls off with decreasing 

due to  -1 dependence
Log 
T~25000K (B star)
Balmer
continuum
absorption
Paschen
continuum
absorption
T~5000K (G star)
(nm)
• Similar diagrams exist for other species. The total  will
be the sum for all species in the star.
• The region of a star for which optical depth ~2/3
determines where observed radiation originates. So if 
is large, then = 2/3 at a high level in the atmosphere
and if  is low, = 2/3 deep in the atmosphere.
Solar photospheric opacity
• The solar atmosphere is dominated by hydrogen. The
visible surface, the photosphere, has a temperature ~5800
K. However, as can be seen from the diagram above, 
for hydrogen at low temperatures is low in the visible
region (~400-700nm). This is because the continuum
absorption in the visible is due to Paschen absorption
PH507
Astrophysics
Professor Glenn White
104
(electrons originating in level n=3) and most hydrogen is
in ground state or n=2 level. We would therefore expect
the continuum to come from much deeper in the sun
where temperatures are higher. So what causes the high
solar photospheric opacity?
500.
•The solar opacity comes from the
H- ion. The ionization potential for
H- -->
Log 
T~25000K (B star)
is 0.75 eV (=1650nm).
H - bound-free H - free-free
From Boltzmann eqn
N3/N1 =
T~5000K (G star)
But from Saha eqn
(nm)
N(H)/N(
Therefore N(H-)/N3 ≈
i.e. number of H- ions is greater than number of H atoms
in level n=3, so absorption of photons to dissociate H- to
H dominates the continuum absorption in the optical.
Limb darkening
• The Sun is less bright near the limb than at the centre of
the disk.
 The continuum spectrum of the entire solar disk defines a
Stefan-Boltzmann effective temperature of 5800 K for the
photosphere, but how does the temperature vary in the
photosphere?
A clue is evident in a white-light
photograph of the Sun.
PH507
Astrophysics
Professor Glenn White
105

 We see that the brightness of the solar disk decreases
from the centre to the limb - this effect is termed limb
darkening.
Limb darkening arises because we see deeper, hotter gas
layers when we look directly at the centre of the disk and
PH507
Astrophysics
Professor Glenn White
106
higher, cooler layers when we look near the limb.
Assume that we can see only a fixed distance d through the
solar atmosphere. The limb appears darkened as the
temperature decreases from the lower to the upper
photosphere because, according to the Stefan-Boltzmann
PH507
Astrophysics
Professor Glenn White
107
law (Section 8-6), a cool gas radiates less energy per unit
area than does a hot gas. The top of the photosphere, or
bottom of the chromosphere, is defined as height = 0 km.
Outward through the photosphere, the temperature drops
rapidly then again starts to rise at about 500 km into the
chromosphere, reaching very high temperatures in the
corona.
Formation of solar absorption lines. Photons with energies well
away from any atomic transition can escape from relatively deep
in the photosphere, but those with energies close to a transition
are more likely to be reabsorbed before escaping, so the ones we see
on Earth tend to come from higher, cooler levels in the solar
atmosphere. The inset shows a close-up tracing of two of the
thousands of solar absorption lines, those produced by calcium at
about 395 nm.
PH507
Astrophysics
Professor Glenn White
108
At this point, you may have discerned an apparent paradox:
how can the solar limb appear darkened when the
temperature rises rapidly through the chromosphere?
Answering this question requires an understanding of the
concepts of opacity and optical depth. Simply put, the
chromosphere is almost optically transparent relative to the
photosphere. Hence, the Sun appears to end sharply at its
photospheric surface - within the outer 300 km of its 700,000
km radius.
Our line of sight penetrates the solar atmosphere only to the
depth from which radiation can escape unhindered
(where the optical depth is small). Interior to this point,
solar radiation is constantly absorbed and re-emitted (and
so scattered) by atoms and ions.
PH507
Astrophysics
Professor Glenn White
109
Y
Length of each solid bar is
approximately the same,
i.e. depth for which =2/3
R
y
Observer
X
Rx
Since R y > R x, radiation from the edge
of the disk, Y, originates from a higher
(cooler) region than at the centre of the
disk, X.
Assuming LTE, the continuum radiation
is described by the Planck function since
Y is at lower temperature, radiation is of
lower intensity
Spectral line formation
• Lines form higher in atmosphere than continuum. For
optical lines this corresponds to lower temperature than
continuum and therefore lower intensity (absorption
lines) (see p21 where S < I).
small
~2/3 low in
atmosphere
6500
T (K)
high
~2/3 high in
atmosphere
4500
0
200
400 km
Height above photosphere
F

PH507
Astrophysics
Professor Glenn White
110
Spectral line strength
Spectral lines are never perfectly monochromatic.
Quantum mechanical considerations govern minimum
line width, and many other processes cause line
broadening .
• For abundance calculations we want to know the total
line strength. Total line strength is characterised by
EQUIVALENT WIDTH.
F

Normal line
F

Fully saturated line

Shaded areas are equal
 = equivalent width


Stellar composition
• Derived from spectral line strengths in stellar
atmospheres.
In the solar neighbourhood, the
composition of stellar atmospheres is:
Element
H
He C,N,O,Ne,Na,Mg,Al,Si,Ca,Fe, others
% mass
70
28
~2.
Spectral line structure
• NATURAL WIDTH:
Due to uncertainty principle,
E=h/t, applied to lifetime of excited state.
For
"normal" lines the atom is excited (by a photon or
collision) to an excited atate which has a short lifetime t
~ 10-8 s. The upper energy level therefore has uncertai
energy E and the resultant spectral line (absorption or
emission) has an uncertain energy (wavelength). Line has
a Lorentz profile,  ~ 10-5 nm for visible light.
• COLLISIONAL/PRESSURE BROADENING:
Outer energy levels of atoms affected by presence of
neighbouring charged particles (ions and electrons).
random effects lead to line broadening since the energy of
PH507
Astrophysics
Professor Glenn White
111
upper energy level changes relative to the unexcited state
energy level.
This is the basis of the Luminosity
classification for A,B stars. Gaussian profile.  ~ 0.02 - 2
nm.
• DOPPLER BROADENING:
Due to motions in gas producing the line. Doppler shift
occurs for each each photon emitted (or absorbed) since
the gas producing the line is moving relative to the
observer (or gas producing the photon).
Thermal Doppler broadening due to motions of
individual atoms in the gas. ~0.01 - 0.02 nm for Balmer
lines in the Sun. Gaussian profile.
Bulk motions of gas in convection cells. Gaussian profile.
• ROTATION:
If there is no limb darkening, then lines have
hemispherical profile due to combination of radiation
from surface elements with different radial velocities.
Effect depends on rotation rate, size of star and angle of
polar tilt.
In general, V sin i is derived from the profile.
_
V -1
(km s )
200
Receding
+V
A
Approaching
-V
B
C
F

A
B
o
C
100

0
O B A F G K
• ATMOSPHERIC OUTFLOW:
Many different types.
Star with expanding gas shell (result of outburst) gives PCYGNI PROFILE.
Continuum (+ absorption lines) from star, emission or
PH507
Astrophysics
Professor Glenn White
112
absorption lines from shell:
F
Expanding
gas
shell
B

D
D
C
Star
D
A

o
Observer
B
C
A
C
B
Radiation from star, A, passes through cooler cloud
giving absorption line due to shell material which is blue
shifted relative to star. Elsewhere, emission lines are
seen.
Be STARS: Very rapid rotators with material lost from
the equator:
Radiation from star, A, passes through cooler cloud
giving absorption line.
Overall line structure is
hemispherical rotation line (B,D). Emission lines seen due
to shell material (C,E).
C
F
Rotating
gas
shell
E
Star
B
A
o
Observer
C
B
A
D
E
D

PH507
Astrophysics
Professor Glenn White
113
Forbidden lines
• Only certain transitions are generally seen for two
reasons:
1) Outer energy levels are far from the nucleus so in dense
gases, levels are distorted or destroyed by interactions.
2) Selection rules for change of quantum numbers restrict
possible transitions.
• In fact forbidden transitions are not actually forbidden.
However, the probability of a forbidden transition is very
low, so an allowed transition will generally occur. The
lifetimes in an excited state for which there are no allowed
downward transitions are ~10-3 - 109 seconds (ie very
low transition probability).
These are called
METASTABLE STATES.
• De-excitation from a metastable state can be by:
1) Collisional excitation, or absorption of another photon
to higher energy state allowing another downward
transition to the equilibrium state,
2) FORBIDDEN TRANSITION producing a FORBIDDEN
LINE. Usually denoted with [], e.g. [OII 731.99].
• Forbidden lines are usually much fainter than those from
allowed transitions due to low probability.
• In interstellar nebulae excited by UV from nearby hot
stars, some elements' excited states have no allowed
downward transitions to the ground state. In the absence
of frequent collisions (due to low density) or high photon
flux, a forbidden transition is the only way to the ground
state.
• These lines were not understood for a long while. A new
element Nebulium was invented to account for them.
PH507
Astrophysics
Professor Glenn White
114
For instance, the UBV system has about 100 standard stars measured to about ±
0.01 magnitude. Then if we can calibrate the flux of just one of these stars, we
have calibrated the system. The calibration is usually given for zero magnitude
at each filter; all fluxes are then derived from this base level. The star usually
chosen as the calibration star is Vega.
Colour index in the BV system. Blackbody curves for 20,000 K and 3000 K, along with
their intensities at B and V wavelengths. Note that B - V is negative for the hotter star,
positive for the cooler one.
Lectures 13-14 : Nearby objects
PH507
Astrophysics
Professor Glenn White
115
PH507
Astrophysics
Professor Glenn White
116
PH507
Astrophysics
Professor Glenn White
117
PH507
Astrophysics
Professor Glenn White
118
PH507
Astrophysics
Professor Glenn White
119
STARS
The Hertzsprung-Russell Diagram
In 1911, Ejnar Hertzsprung plotted the first such two-dimensional diagram (absolute
magnitude versus spectral type) for observed stars, followed (independently) in 1913 by
Henry Norris Russell; today, this plot is called a Hertzsprung-Russell (H-R) diagram.
We infer the properties of stars from their light - in bulk for fluxes and spread out for
spectra. This chapter deals with the wealth of information that can be discerned by
studying stellar spectra. First we consider stellar atmospheres, for here the stellar
spectra originate. Then we tell the story of spectral observations - how they have been
made, correlated, and interpreted. Finally, we present that famous and crucial synthesis
- the Hertzsprung-Russell diagram - and some of its implications. This discussion will
lead to an understanding of the stars themselves.
PH507
Astrophysics
Professor Glenn White
120
Most stars have properties within the shaded region known as the main sequence. The points plotted here
are for stars lying within about 5 pc of the Sun. The diagonal lines correspond to constant stellar radius,
so that stellar size can be represented on the same diagram as luminosity and temperature. (Recall that
stands for the Sun.)
PH507
Astrophysics
Professor Glenn White
121
An H-R diagram for the 100 brightest stars in the sky. Such a plot is biased in favour of the most
luminous stars--which appear toward the upper left--because we can see them more easily than we can the
faintest stars.
PH507
Astrophysics
Professor Glenn White
122
PH507
Astrophysics
Professor Glenn White
123
Stellar (Main Sequence) Properties With Mass
Mass
Temp Radius Luminosity
40 MSun 35,000 K 18 RSun 320,000 LSun
tMS
habitable zone
106 yrs
350-600 AU
17
21,000
8
13,000
107
7
13,500
4
630
8x107
2
8,100
2
20
2x109
1
5,800
1
1
1010
1-2
0.2
2,600
0.32
0.0079
5x1011
0.1-0.2
As will soon become clear to you, this simple diagram represents one of the great
observational syntheses in astrophysics. Note that any two of luminosity, magnitude,
temperature, and radius could be used, but visual, magnitude, and temperature are
universally obtained quantities for stars.
Stellar luminosity classes in the H-R diagram. Note that a star's location could be specified by its spectral
type and luminosity class instead of by its temperature and luminosity.
PH507
Astrophysics
Professor Glenn White
124
PH507
Astrophysics
Professor Glenn White
125
Magnitude versus Spectral Type
The first H-R diagrams considered stars in the solar neighbourhood and plotted absolute
visual magnitude, M, versus spectral type, Sp, which is equivalent to luminosity
versus spectral type or luminosity versus temperature. The Figure below shows this
type of plot for the stars with well-determined distances within about 5 pc of the Sun.
Note (a) the well-defined main sequence (class V) with ever-increasing numbers of stars
toward later spectral types and an absence of spectral classes earlier than A1 (Sirius), (b)
the absence of giants and supergiants (class III and I), and (c) the few white dwarfs at
the lower left.
In contrast, the H-R diagram for the brightest stars includes a significant number of
giants and supergiants as well as several early-type main-sequence stars. Here we have
made a selection that emphasises very luminous stars at distances far from the Sun.
Note that the H-R diagram of the nearest stars is most representative of those throughout
the Galaxy: the most common stars are low-luminosity spectral type M.
We can also trace the birth of young stars on this plot:
PH507
Astrophysics
Professor Glenn White
126
Magnitude versus Colour
Because stellar colours and spectral types are roughly correlated, we may construct a
plot of absolute magnitude versus colour - called a colour-magnitude diagram. The
relative ease and convenience with which colour indices (such as B - V) may be
determined for vast numbers of stars dictates the popularity of colour-magnitude plots.
The resulting diagrams are very similar to the magnitude-spectral type H-R diagrams
considered above. Let’s see what information we can glean from them.
The Mass-Luminosity Relationship
Just as the determination of the period and size of the Earth’s orbit (by Kepler’s third
law) leads to the Sun’s mass, so also have we deduced binary stellar masses. Because it
is necessary to know the distance to the binary system in order to establish these masses,
we need only observe the radiant flux of each star to find its luminosity.
When the observed masses and luminosities for stars in binary systems are plotted, we
obtain the correlation called the mass-luminosity relationship.
PH507
Astrophysics
Professor Glenn White
127
In 1924, Arthur S. Eddington calculated that the mass and luminosity of normal stars
like the Sun are related by

L  M 


L  M  

His first crude theoretical models indicated that 
≈ 3. On a log-log plot, this gives
a straight line with a slope of . Main sequence stars do seem to conform to this
relationship, although the exponent varies from 
≈ 3 for luminous and massive
stars through 
-type stars to 
2 for dim red stars of low mass.
From a sample of 126 well-studied binary systems, we find that the break in slope
below this value is 2.26; above it, 3.99.
3
L
~M
The value of the exponent varies for different kinds of stars, and in normally between ~
2.7 and 4. The value of 3 is appropriate to stars more massive than the sun.
PH507
Astrophysics
Professor Glenn White
128
The more massive stars burn their fuel very rapidly, leading to short lifetimes:
M*/Msun
60
30
10
3
1.5
1
0.1
time (years)
3 million
11 million
32 million
370 million
3 billion
10 billion
1000's billions
Spectral type
O3
O7
B4
A5
F5
G2 (Sun)
M7
For L Mn ,
value of exponent n
3.9
3.0
2.7
Mass range M
M < 7 M
7 M < M < 25 M
25 M < M
Today, astrophysical theories of stellar structure explain these results in terms of the
different internal structures of stars of different mass and the opacities of stellar
atmospheres at different temperatures. Note that the M-L law does not apply to highly
evolved stars, such as red giants (with extended atmospheres) and white dwarfs (with
degenerate matter. While most stellar masses lie in the narrow range from 0.085M
to 100M , stellar luminosities cover the vast span 10-4 ≤ L/L ≤ 106!
A useful relationship to give a rule of thumb estimate of a stars surface temperature is;
0.5
 M 
T  5870  
M* 
PH507
Astrophysics
Professor Glenn White
129
PH507
Astrophysics
Professor Glenn White
130
PH507
Astrophysics
Professor Glenn White
131
PH507
Astrophysics
Professor Glenn White
132
PH507
Astrophysics
Temperature inside the Sun
Professor Glenn White
133
PH507
Astrophysics
Professor Glenn White
134
PH507
Astrophysics
Professor Glenn White
135
Photospheric Temperatures

The continuum spectrum of the entire solar disk defines a Stefan-Boltzmann
effective temperature of 5800 K for the photosphere, but how does the
temperature vary in the photosphere? A clue is evident in a white-light
photograph of the Sun.

We see that the brightness of the solar disk decreases from the centre to the
limb - this effect is termed limb darkening.
Limb darkening arises because we see deeper, hotter gas layers when we look
directly at the centre of the disk and higher, cooler layers when we look near
the limb.
PH507
Astrophysics
Professor Glenn White
136
Assume that we can see only a fixed distance d through the solar atmosphere. The limb
appears darkened as the temperature decreases from the lower to the upper photosphere
because, according to the Stefan-Boltzmann law, a cool gas radiates less energy per unit
area than does a hot gas. The top of the photosphere, or bottom of the chromosphere, is
defined as height = 0 km. Outward through the photosphere, the temperature drops
rapidly then again starts to rise at about 500 km into the chromosphere, reaching very
high temperatures in the corona.
Formation of solar absorption lines. Photons with energies well away from any atomic transition can
escape from relatively deep in the photosphere, but those with energies close to a transition are more likely
to be reabsorbed before escaping, so the ones we see on Earth tend to come from higher, cooler levels in the
solar atmosphere. The inset shows a close-up tracing of two of the thousands of solar absorption lines,
those produced by calcium at about 395 nm.
PH507
Astrophysics
Professor Glenn White
137
At this point, you may have discerned an apparent paradox: how can the solar limb
appear darkened when the temperature rises rapidly through the chromosphere?
Answering this question requires an understanding of the concepts of opacity and
optical depth. Simply put, the chromosphere is almost optically transparent relative to
the photosphere. Hence, the Sun appears to end sharply at its photospheric surface within the outer 300 km of its 700,000 km radius.
Our line of sight penetrates the solar atmosphere only to the depth from which radiation
can escape unhindered (where the optical depth is small). Interior to this point, solar
radiation is constantly absorbed and re-emitted (and so scattered) by atoms and ions.
PH507
Astrophysics
Professor Glenn White
138
The Chromosphere

The solar chromosphere extends about 10,000 km above the photosphere, and its
gas density is far less than that of the photosphere. This thin layer has a reddish hue
- as a result of the Balmer (H) emission of hydrogen - visible during a total solar
eclipse.

At the limb of the Sun, tenuous jets of glowing gas 500 to 1500 km across extend to
a distance of 10,000 km upward from the chromosphere.
PH507
Astrophysics
Professor Glenn White
139
In the spicules, which are best observed in H, gas is rising at about 20 to 25 km/s.
Although spicules occupy less than 1% of the Sun’s surface area and have lifetimes of
15 min or less, they probably play a significant role in the mass balance of the
chromosphere, corona, and solar wind, and occur in regions of enhanced magnetic fields
PH507
Astrophysics
Professor Glenn White
140
Solar spicules, short-lived narrow jets of gas that typically last mere minutes, can be seen sprouting up
from the solar chromosphere in this H alpha image of the Sun. The spicules are the thin, dark, spikelike
regions. They appear dark against the face of the Sun because they are cooler than the solar photosphere
The Corona

At solar eclipses, the corona appears as a pearly white halo extending far from the
Sun’s limb. A brighter inner halo hugs the solar limb, and coronal streamers extend
far into space.
PH507
Astrophysics
Professor Glenn White
141
The change of gas temperature in the lower solar atmosphere is dramatic. The minimum temperature
marks the outer edge of the chromosphere. Beyond that, the temperature rises sharply in the transition
zone, finally levelling off at over 1,000,000 K in the corona.
Why the Sun's atmosphere is shockingly hot
Nobody would have guessed it: the atmosphere of the Sun is much, much
hotter than its surface. By more than one million degrees Centigrade in fact. Since 1939,
when scientists first determined the temperature of the solar atmosphere - known as the
corona - they were unable to come up with a convincing theory of why it greatly
exceeded the "mere" 6000° of the visible surface.
Nearly 60 years later, SOHO solved the mystery. Once again the MDI acronym (short
for Michelson Doppler Imager) is the code needed to decipher the secrets of our nearest
star. With MDI, scientists gathered data showing that huge numbers of small, closely
intertwined magnetic loops continuously emerge from the Sun's visible surface, clash
with one another and dissolve within 40 hours.
PH507
Astrophysics
Professor Glenn White
142
The loops seem to form a tight pattern that scientists call a magnetic carpet. Their
interaction generates electrical and magnetic short-circuits and releases enough energy
to heat the corona to temperatures hundreds of times higher than those of the solar
surface.
The solar atmosphere is permeated with magnetic fields, generated by electrified gas, or
plasma, churning violently beneath the visible surface. Solar astronomers have long
observed loops of plasma, called coronal loops, which appear to trace out the corona's
complex magnetic-field structure, much as iron filings reveal the invisible magnetic
field surrounding a magnet. Coronal loops come in various sizes, but most are
enormous, capable of spanning several Earths.
Solar astronomers know the particles comprising plasma are electrically charged and
feel magnetic forces. Thus, scientists thought coronal loops were tubes of plasma
trapped by and enclosed in the arch-shaped magnetic fields of the corona. The coronal
loops have puzzling features, however. The strong pull of solar gravity led astronomers
to believe that the plasma should be dense at the bases of the loop and thin at the top,
just as the Earth's gravity pulls our atmosphere close to the surface, causing it to thin
with increasing altitude. In fact, coronal loops seem to be about the same density
throughout their height, even though some of them extend several hundred thousand
miles (over a million kilometers) above the solar surface.
PH507
Astrophysics
Professor Glenn White
143
Coronal loops come in a variety of shapes and sizes, but most are enormous, capable of spanning several
Earth's. Photo: NASA and the TRACE team
The Visible Corona
Coronal continuum radiation has a temperature of 1 to 2 x 106 K.
PH507
Astrophysics
Professor Glenn White
144
Photograph of the solar corona during the July, 1991 eclipse, at the peak of the sunspot cycle. At
these times, the corona is much less regular and much more extended than at sunspot
minimum. Astronomers believe that coronal heating is caused by surface activity on the Sun.
The changing shape and size of the corona are the direct result of variations in prominence and
flare activity over the course of the solar cycle.
PH507
Astrophysics
Professor Glenn White
145
PH507
Astrophysics
Professor Glenn White
146
Line Emission
Forbidden Lines
Superimposed on the visible coronal continuum are some emission lines that
were unidentified until about 1942, when W. Grotrian of Germany and B. Edlén
of Sweden interpreted them. (They were long called “coronium” lines, for they
did not fit any known atomic transition). The two strongest lines are the green
line of Fe XIV (530.3 nm) and the red line of Fe X (637.4 nm); both are forbidden
lines.
Two significant obstacles hindered the identification of the coronal emission
lines: (1) the responsible transitions are forbidden, and (2) the temperatures of
the corona are unexpectedly high. In quantum mechanics, certain energy levels
of an atom are metastable because downward transitions from such levels are
strongly prohibited. While an ordinary permitted transition takes place in about
10-8 s, these metastable levels may persist for seconds or even days before a
forbidden transition occurs. In most laboratory and astrophysical situations,
gas densities are so high that collisional de-excitation empties metastable levels
very rapidly - there is just not enough time for a forbidden transition to take
place. In the near vacuum of the corona, however, metastable levels populated
by either photospheric radiation or collisions can decay and forbidden emission
features are formed.
PH507
Astrophysics
Professor Glenn White
147
Solar Activity
The Solar Cycle
Sunspots




Sunspots are photospheric phenomena that appear darker than the
surrounding photo-sphere (at about 5800 K) because they are cooler
(sunspot continuum temperatures are about 3800 K, and sunspot excitation
temperatures are about 3900 K). The darkest, central part (with the
temperatures just mentioned) is termed the umbra; the umbra is usually
surrounded by the lighter penumbra with its radial filamentary structure.
The most important characteristic of a sunspot is its magnetic field. Typical
field strengths are near 0.1 T, but fields as strong as 0.4 T have been
measured. Related to the magnetic field is a horizontal flow of gas in the
sunspot penumbra: gas moves out along the lower filaments and inward
along the higher filaments (at speeds up to 6 km/s).
A given sunspot has an associated magnetic polarity. Lines of magnetic
force diverge from a north magnetic pole and converge at a south pole; you
are familiar with this characteristic of bar magnets and our Earth. Two
sunspots of complementary polarity are generally found together in a
bipolar spot group.
The magnetic field in a typical sunspot is about 1000 times greater than the
field in neighbouring, undisturbed photospheric regions (which is itself
several times stronger than the Earth's field).
PH507
Astrophysics
Professor Glenn White
148
PH507
Astrophysics
Professor Glenn White
149
(a) The looplike structure of this prominence clearly reveals the magnetic field lines connecting the two
members of a sunspot pair. (b) This image of a particularly large solar prominence was observed by
ultraviolet detectors aboard the Skylab space station in 1979
Sunspot Numbers
In 1610, shortly after viewing the sun with his new telescope, Galileo Galilei made the
first European observations of Sunspots. Daily observations were started at the Zurich
Observatory in 1749 and with the addition of other observatories continuous
observations were obtained starting in 1849. The sunspot number is calculated by first
counting the number of sunspot groups and then the number of individual sunspots. The
"sunspot number" is then given by the sum of the number of individual sunspots and ten
times the number of groups. Since most sunspot groups have, on average, about ten
spots, this formula for counting sunspots gives reliable numbers even when the
observing conditions are less than ideal and small spots are hard to see. Monthly
averages (updated monthly) of the sunspot numbers (25 kb GIF image), (37 kb
postscript file), (62 kb text file) show that the number of sunspots visible on the sun
waxes and wanes with an approximate 11-year cycle.
PH507
Astrophysics
Professor Glenn White
150
PH507
Astrophysics
Professor Glenn White
151
Positional Variation
Detailed observations of sunspots have been obtained by the Royal Greenwich
Observatory since 1874. These observations include information on the sizes and
positions of sunspots as well as their numbers. These data show that sunspots do not
appear at random over the surface of the sun but are concentrated in two latitude bands
on either side of the equator. A butterfly diagram (142 kb GIF image) (610 kb postscript
file) (updated monthly) showing the positions of the spots for each rotation of the sun
since May 1874 shows that these bands first form at mid-latitudes, widen, and then
move toward the equator as each cycle progresses. The cycles overlap at the time of
sunspot cycle minimum with old cycle spots near the equator and new cycle spots at
high latitudes. An alternate version of this diagram with different colors for even and
odd numbered cycles is available as a 610kb postscript file.
The distribution of sunspots in solar latitude varies in a characteristic way
during the 11-year sunspot-number cycle. Sunspots tend to reside at high
latitudes (±35˚) at the start of a cycle.
PH507
Astrophysics
Professor Glenn White
152

Most spots are near ±15 at maximum, and the few spots at the end of the
cycle cluster near ±8. Very few sunspots are ever found at latitudes greater
than ±40. The lifetime of a sunspot ranges from a few days (small spots) to
months (large spots). In fact, a sunspot dies at the same latitude where it
was born (a characteristic that permits us to determine the solar rotation).
What takes place is this: as the cycle progresses, new spots appear at ever
lower latitudes. The first high-latitude spots of a cycle appear even before
the last low-latitude spots of the previous cycle have vanished.

So the sunspots follow active latitude belts during the course of a cycle. Less
clear, but just as tantalising, large active regions and spot groups seem to fall
into preferred longitudes during a cycle - active longitude belts.
Concentrations of magnetic fields appear to persist below the photosphere,
so that new spot groups arise from about the same locations as previous
ones. The active longitude belts sometimes appear about 180 apart, though
not all the time; frequently they are not symmetrical across the equator.
PH507
Astrophysics
Professor Glenn White
153
They reveal a different persistent pattern in the magnetic field structure in
the convective zone.

Predictions of sunspot numbers are now routinely made – an excellent web
site is www.spaceweather.com
The Babcock Model
PH507
Astrophysics
Professor Glenn White
154
Solar Rotation

By observing sunspots with his telescope, Galileo determined that the Sun’s
surface rotates eastward (synodically) in about one month. Today, the same
method is used in the sunspot zone (other methods, such as Doppler shifts,
are necessary above latitude ±40), and we know that the Sun rotates
differentially. That is, the rotation period is shorter at the solar equator
(about 25d) than at higher latitudes (about 27d at 40 and 30d at 70).

To sum up, sunspots reveal on a small scale the complexity and variability
of solar magnetic phenomena. The parts of a sunspot are all transient
magnetic structures - basically a sheaf of magnetic flux tubes filling the
umbra and penumbra and fanning out above them.
PH507
Astrophysics
Professor Glenn White
155
The Sun: A Model Star

Our Sun is the nearest star. The fascinating properties and phenomena of
the solar surface layers are easily observed and have been studied intensely.
Unfortunately, models for understanding solar phenomena have not kept
pace with such detailed data. Because the Sun is a fairly typical star and
because it is the only star that spans a large angular diameter as seen from
the Earth, the discussion here serves as the physical basis to investigate the
other stars.
Sun
Mass (1024 kg)
1,989,100.
6
3
2
GM (x 10 km /s )
132,712.
Volume (1012 km3)
1,412,000.
Volumetric mean radius (km) 696,000.
Mean density (kg/m3)
1408.
Surface gravity (eq.) (m/s2) 274.0
Escape velocity (km/s)
617.7
Ellipticity
0.00005
Moment of inertia (I/MR2) 0.059
Visual magnitude V(1,0) -26.74
Absolute magnitude
Luminosity (1024 J/s)
Mass conversion rate (106 kg/s)
Mean energy production (10-3 J/kg)
Surface emission (106 J/m2s)
Spectral type
Model values at center of Sun:
Central pressure:
Central temperature:
Central density:
Earth
5.9736
0.3986
1.083
6371.
5515.
9.78
11.2
0.0034
0.3308
-3.86
+4.83
384.6
4300.
0.1937
63.29
G2 V
(Sun/Earth)
333,000.
333,000.
1,304,000.
109.2
0.255
28.0
55.2
0.015
0.178
-
2.477 x 1011 bar
1.571 x 107 K
1.622 x 105 kg/m3
The Structure of the Sun

1 AU from the Earth with a radius of 6.96 x 105 km (109Ro) and a mass of 1.99 x
1030 kg (333,000M), and luminosity, (rate of total radiative energy output) of 3.86
x 1026 W.

The average density of the Sun is only 1400 kg/m3 - consistent with a composition
of mostly gaseous hydrogen and helium.

From its angular size of about 0.5° and its distance of almost 150 million kilometres,
we determine that its diameter is 1,392,000 kilometres (109 Earth diameters and
almost 10 times the size of the largest planet, Jupiter).

All of the planets orbit the Sun because of its enormous gravity. It has about 333,000
times the Earth's mass and is over 1,000 times as massive as Jupiter.
PH507

Astrophysics
Professor Glenn White
156
The Sun is made of 94% Hydrogen, 6% Helium, - the other elements make up just
0.13% (the three most abundant ‘metals’ Oxygen, Carbon, and Nitrogen make up
0.11%).
PH507
Astrophysics
Professor Glenn White
157
This spectacular image of the Sun was made by capturing X rays emitted by our star's most active
regions. It was taken by a camera on a rocket lofted shortly before the total solar eclipse of July 1991.
(Note the shadow of the Moon approaching from the west, at top.) The brightest regions in all these
images have temperatures of about 3 million Kelvin.
Lectures Week 7: Star Formation
Lectures Week 8: Theory of exoplanets
PH507
Astrophysics
Professor Glenn White
158
Debris Disks
Debris disks are remnant accretion disks with little or no gas left (just dust &
rocks), outflow has stopped, the star is visible.
Theory: Gas disperses, “planetesimals” form (100 km diameter rocks), collide &
stick together due to gravity forming protoplanets (Wetherill & Inaba 2000).
Protoplanets interact with dust disks: tidal torques cause planets to migrate inward
toward their host stars. Estimated migration time ~ 2 x 105 yrs for Earth-size
planet at 5 AU (Hayashi et al. 1985).
Perturbations caused by gas giants may spawn smaller planets (Armitage 2000):
Start with a stable disk
around central star.
Jupiter-sized planet forms
& clears gap in gas disk.
Planet accretes along spiral Disk fragments into more
arms, arms become unstable. planetary mass objects.
Debris Disks – Outer Disk
AB Aurigae outer
debris disk nearly
face on – see
structure &
condensations
(possible protoplanet formation
sites? Very far
from star) .
(Grady et al. 1999)
Two obvious differences between the exoplanets and the giant planets in the Solar
System:
• Existence of planets at small orbital radii, where our previous theory suggested
formation was very difficult.
• Substantial eccentricity of many of the orbits. No clear answers to either of these
surprises, but lots of ideas...
PH507
Astrophysics
Professor Glenn White
159
Most conservative possibility:
• Planet formation in these extrasolar systems was via the core accretion model ie same as dominant theory for the Solar System
• Subsequent orbital evolution modified the planet orbits to make them closer to
the star and / or more eccentric
We will focus on this option. However, more radical options in which exoplanets form
from gravitational instability are also possible.
Orbital evolution – migration
Need a migration mechanism that can move giant planets from formation at ~5 AU to a
range of radii from 0.04 AU upwards.
Three theories have been proposed:
• Gas disc migration: planet forms within a protoplanetary disc and is swept
inwards with the gas as the disc evolves and material accretes onto the star. The
most popular theory, as by definition gas must have been present when gas giants
form.
• Planetesimal disc migration: as above, but planet interacts with a disc of rocks
rather than gas. Planet ejects the rocks, loses energy, and moves inwards.
• Planet scattering: several massive planets form – subsequent chaotic orbital
interactions lead to some (most) being ejected with the survivors moving inwards
as above.
Gas disc migration
Planet interacts with gas in the disc via gravitational force
Strong interactions at resonances, eg where disc = nplanet, with n an integer.
For example the 2:1 resonance, where n = 2, which lies at 2-2/3 rp = 0.63 rp
Resonances at r < rp: Disc gas has greater angular velocity than planet. Loses
angular momentum to planet -> moves inwards
Resonances at r > rp: Disc gas has smaller angular velocity than planet. Gains
angular momentum from planet -> moves outwards
PH507
Astrophysics
Professor Glenn White
160
Interaction tends to clear gas away from location of planet, Result:
planet orbits in a gap largely cleared of gas and dust
This process occurs for massive planets (~ Jupiter mass) only - Earth mass planets
remain embedded in the gas though gravitational torques can be very important source
of orbital evolution for them too.
How does this lead to migration?
PH507
Astrophysics
Professor Glenn White
161
Angular momentum transport in the gas (viscosity) tries to close the gap (recall,
diffusive evolution of an accretion disc).
Gravitational torques from planet try to open gap wider.
Gap edge set by a balance:
-> Internal viscous torque = planetary torque
Planet acts as a angular momentum ‘bridge’:
• Inside gap, outward angular momentum flux transported by viscosity within disc
• At gap edge, flux transferred to planet via gravitational torques, then outward
again to outer disc
• Outside gap, viscosity again operative
Typically, gap extends to around the 2:1 resonances interior and exterior to the planet’s
orbit.
As disc evolves, planet moves within gap like a fluid element in the disc - ie usually
inwards.
Inward migration time ~ tn = R2 / n ~ few x 105 yr from 5 AU.
Mechanism can bring planets in to the hot Jupiter regime.
Believe that this mechanism is quantitatively consistent with the distribution of
exoplanets at different orbital radii – though the error bars are still very large!
PH507
Astrophysics
Professor Glenn White
162
Points: data Curve: theoretical migration model
Eccentricity
Substantial eccentricities of many exoplanets orbits do not have
completely satisfactory explanation. Possibilities:
(1) Scattering among several massive planets
Assumption: planet formation often produces a multiple system
which is unstable over long timescales:
• Chaotic evolution of a, e (especially e)
• Orbit crossing
• Eventual close encounters -> ejections
• Eccentricity for survivors
Advantages:
• Given enough planets, close together, definitely works
• Can produce very eccentric planets cf e=0.92 example discovered
• Some (stable) multiple systems are already known
Disadvantages:
• Requires planets to form very close together.
For two planets, with mass ratios to star q1 = m1 / M*, q2,
approximate stability boundary for separations:
> 2.4 (q1 + q2)1/3
System is unstable on short timescale if: a2 < a1 (1 + )
For Jupiter mass planets, (1 + ) ~ 1.3
Is it plausible that unstable systems formed in a large fraction
of extrasolar planetary systems?
• Collisions may produce too many low e systems
(2) Disc interactions
Assumption: gravitational interaction with disc generates eccentricity
PH507
Astrophysics
Professor Glenn White
163
Advantages:
• Same mechanism as invoked for migration
• Works for just one planet
• Theoretically, interaction is expected to increase eccentricity
if dominated by 3:1 resonance
Disadvantages:
• Gap is only expected to reach the 3:1 resonance for brown dwarf type
masses, not massive planets. Smaller gaps definitely tend to circularize the
orbit instead.
• Seems unlikely to give very large eccentricities
(3) Protoplanetary disc itself is eccentric
Assumption: why should discs have circular orbits anyway?
Eccentric disc -> eccentric planet?
Not yet explored in much depth. A possibility, though again seems unlikely to lead to
extreme eccentricities.
Scattering theory is currently most popular, possibly augmented
by interactions with other planets in resonant orbits.
PH507
Astrophysics
Professor Glenn White
164