Download Modeling and Inversion of Dispersion Curves of Surface Waves in

Document related concepts

Shear wave splitting wikipedia , lookup

Magnetotellurics wikipedia , lookup

Earthscope wikipedia , lookup

Seismic inversion wikipedia , lookup

Surface wave inversion wikipedia , lookup

Transcript
University of Nevada, Reno
Dissertation Title
Modeling and Inversion of Dispersion Curves of Surface
Waves in Shallow Site Investigations
A dissertation submitted on partial fulfillment of the
requirement for the degree of Doctor of Philosophy
in Geophysics
By
Donghong Pei
Dr John N. Louie, Dissertation Advisor
August, 2007
Copyright by Donghong Pei 2007
All Rights Reserved
Abstract
The shallow S-wave velocity structure is very important for the seismic
design of engineered structures and facilities, seismic hazard evaluation of a region,
comprehensive earthquake preparedness, development of the national seismic hazard
map, and seismic-resistant design of buildings. The use of surface waves for the
characterization of the shallow subsurface involves three steps: a) acquisition of highfrequency broadband seismic surface wave records generated either by active sources
or passive ambient noise (microtremors or microseisms), b) extraction of phase
dispersion curves from the recorded seismic signals, and c) derivation of S-wave
velocity profiles either using inversion algorithms or manually error and trial forward
modeling. The first two steps have been successfully achieved by several techniques.
However, the third step (inversion) needs more improvements. An accurate and
automatic inversion method is needed to generate shallow S-wave velocity profiles.
With the achievement of a fast forward modeling method, this study focuses
on the inversion of phase velocity dispersion curves of surface waves contained in
ambient seismic noise for a one dimensional, flat-layered S-wave velocity structure.
For the forward modeling, we present a new more efficient algorithm, called
the fast generalized R/T (reflection and transmission) coefficient method, to calculate
the phase velocity of surface waves for a layered earth model. The fast method is
based on but is more efficient than the traditional ones. The improvements by this
study include 1) computation of the generalized reflection and transmission
coefficients without calculation of the modified reflection and transmission
coefficients; 2) presenting an analytic solution for the inverse of the 4X4 layer matrix
E. Compared with traditional R/T methods, the fast generalized R/T coefficient
method, when applied on Rayleigh waves, significantly improves the speed of
computation, cutting the computational time at least by half while keeping the
stability of the traditional R/T method.
On inversion study, the dissertation explored a linear inversion technique, a
non-linear inversion method, and a joint method on the dispersion data of surface
waves. Chapter 3 explores the Occam’s linear inversion technique with a higher-order
Tikhonov regulization. The blind tests on a suite of nine synthetic models and two
field data sets show that the final model is heavily influenced by a) the initial model
(in terms of the number of layers and the initial S-wave velocity of each layer); b) the
minimum and the maximum depth of profiles; c) the number of dispersion picks; d)
the frequency density of dispersion picks; and e) other noise.
To minimize this initial-model-dependence of the Occam’s inversion, the nonlinear simulated annealing (SA) inversion technique is proposed in Chapter 4.
Following previous developments I modified the SA inversion yielding onedimensional shallow S-wave velocity profiles from high frequency fundamentalmode Rayleigh dispersion curves and validated the inversion with blind tests. Unlike
previous applications of SA, this study draws random numbers from a standard
Gaussian distribution. The numbers simultaneously perturb both S-wave velocities
and layer thickness of models. The annealing temperature is gradually decreased
following a polynomial-time cooling schedule. Phase velocities are calculated using
ii
the reflectivity-transmission method. The reliability of the model resulting from our
implementation is evaluated by statistically calculating the expected values of model
parameters and their covariance matrices. Blind tests on the same data sets as these in
Chapter 3 show that the SA implementation works well for S-wave velocity inversion
of dispersion curves from high-frequency fundamental-mode Rayleigh waves. Blind
estimates of layer S-wave velocities fall within one standard deviation of the
velocities of the original synthetic models in 78% of cases. A hybrid method is also
explored in Chapter 4. The hybrid idea is that the models obtained by the SA can
used as input to the Occam’s inversion. Tests show that the hybrid method does not
always provide better results.
Dispersion curves of fundamental mode Rayleigh waves alone do not contain
sufficient information to uniquely determine a model. The velocity-depth trade-off
gives rise to model non-uniqueness. A joint SA inversion method is proposed in
Chapter 5 using the fundamental-mode Love wave dispersion curves to constrain the
Rayleigh wave inversion by the SA optimization. The SA technique described in
Chapter 4 is applied on the dispersion data of both fundamental-mode Love and
Rayleigh waves with equal weighting factor. Three synthetic tests show that Love
wave constraints result in significant improvement of inverted model in terms of
resolution of low velocity zones and high velocity contrasts.
iii
TABLE OF CONTENTS
Abstract ………………………..…..………………...…………………….………… i
Table of Contents ………………………………………………….………………... iv
List of Figures ……………………………………..……….……………….…...…. vii
List of Tables ……………………………………………………….…….………... xii
Acknowledgments ….…..…………………………....…………….………… ...… xiii
Chapter 1 Introduction…………………………………..…………...………..…. 1
1.1 Motivation and research objectives ..………..………………………..…..1
1.2 Ambient seismic noise …………………….…………….………..……... 4
1.3 Surface wave properties …………....………………………..…….…….. 9
1.4 Seismic acquisition techniques used in shallow site investigations ….…. 13
Chapter 2 Forward modeling of surface-wave dispersion ………..…….…....… 27
2.1 Motion-stress vector ..…………………..……………….….….....….…. 29
2.2 Reflection and transmission coefficients ..…………...….....……..….…. 34
2.3 Plane waves in a layered model ..………..……….……….………….…. 35
2.4 Phase velocity of Love waves …………..……………….…………...…. 39
2.5 Phase velocity of Rayleigh waves …………..………….……..…..….…. 42
2.6 Improvements on calculation of phase velocity of Rayleigh waves …..…. 46
2.7 Numerical examples on dispersion calculation of Rayleigh waves …...…. 48
2.8 Improvements on calculation of phase velocity of Love waves .….….…. 51
iv
2.9 Numerical examples on Love waves .……..……….……….…….…..…. 52
2.10 Group velocity calculation of surface waves ….………………....….…. 53
2.11 Published calculation codes .………………….……………….....….…. 54
2.12 RTgen .……………………………………..…………………....…..…. 55
Chapter 3 Linearized inversion of surface-wave dispersion ..……….…..…..… 61
3.1 Linear model estimation …………….…………..……………...…..…… 62
3.2 Solving a linear system ….………………………………………….…… 64
3.3 Regularization ….……………………………………….…….…….…… 66
3.4 Singular value decomposition (SVD) ……………………..…….….…… 68
3.5 Proposed linear inversion algorithm ………………………….…….…… 69
3.6 Model appraisal method ….…………………………………..……..…… 73
3.7 Test data sets and numerical tests ….………………………..…..….…… 75
3.8 Initial model dependence ….…………………………………..…….…… 81
3.9 The effect of minimum and maximum depth ………………..…..….…… 82
3.10 The effect of number of dispersion picks …….……………..….….…… 83
3.11 The effect of frequency density of dispersion picks ………..….….…… 84
3.12 The effect of the weighting matrix ……………………………..….…… 85
Chapter 4 Non-linear inversion of surface-wave dispersion based on simulated
annealing optimization
…..…………..……………….…….….…… 99
4.1 Global searching optimization …………………………..…….........….. 100
v
4.2 Simulated annealing optimization method …………………..……..….. 103
4.3 Model appraisal …….………………………………………….….….... 108
4.4 Inversion results ………………..…………….………….…..….….…... 109
4.5 Comparison with linearized inversion results ………….…..……..…..... 113
4.6 Difference from previous implementation ……………….…..…….…... 114
4.7 A hybrid inversion approach: simulated annealing followed by the
linearized inversion ……………………………………….……….….... 116
Chapter 5 A joint SA inversion using both Rayleigh and Love surface-wave
dispersions ………….……….…………………………...…………. 128
5.1 Equalized cost function …..……………….………………….………… 130
5.2 Synthetic tests .……………….…………………………..……….…..... 132
5.3 Inversion results .……………….………………………………….…..... 132
Chapter 6 Summary and suggestions …….……………………………………. 145
6.1 Summary …..………………………………….………………...……… 145
6.2 Suggestions …….……………….……………………………..….…..... 149
References Cited ………………………………………………………...…….… 152
Appendix A: Matrices for Rayleigh Waves ……….………………….….…...… 163
Appendix B: Matrices for Love waves …………….………………….………… 165
vi
LIST OF FIGURES
Figure 1.1 Three steps involved in utilizing dispersion curves of surface waves for
imaging geologic profiles ……………….………….….…….……..…. 21
Figure 1.2 The acceleration power spectrum of microtremors recorded at 75
permanent seismic observatories throughout the world …………..….. 22
Figure 1.3 Body wave motion ……………………………………….….….…..…. 23
Figure 1.4 Particle motion and amplitude of Rayleigh waves ………….……...…. 23
Figure 1.5 Surface wave dispersion ………………………..……...…..…….……. 24
Figure 1.6 Phase velocities vs. frequencies ………………………..….….……..... 24
Figure 1.7. Modes of surface waves ……..…………………………....…..……..... 25
Figure 1.8 Dispersion curves of higher-mode surface waves ...………….…..…..... 25
Figure 1.9 A typical ReMi field configuration .....…………………….….…..…… 26
Figure 1.10 A typical ReMi analysis …..…………………………....…....…..….... 26
Figure 2.1 Illustration of coefficients of reflection and transmission due to SH
incident down (a) and up (b) to an interface ….…..……...…...…….… 56
Figure 2.2 Illustration of coefficients of reflection and transmission due to SV
incident down (a) and up (b) to an interface and P incident down (c) and
up (d) to an interface …..………………………………………....….. 56
Figure 2.3 Configuration and coordinate system of a multiple-layered half-space
…………………………………………………………..…………...... 57
vii
Figure 2.4 Phase velocity dispersion curves of the fundamental-mode Rayleigh waves
for large scale models 1, 2, and 3 (a) and small scale models 4, 5, 6 (b)
…………………………………………………….…………………… 58
Figure 2.5 The normalized errors between phase velocities calculated by RTgen and
CPS of large scale models 1, 2, and 3 (a) and small scale models 4, 5, 6
(b) ………………..…..………………………………………..….….. 58
Figure 2.6 Phase velocity dispersion curves of Rayleigh waves for the Gutenberg
model ………………………………………………...…...………….. 59
Figure 2.7 Phase velocity dispersion curves of Rayleigh waves for model 4 ….….. 59
Figure 2.8 Computational time against number of layers in models ………………. 60
Figure 2.9 Phase velocity dispersion curves of Love waves for the Gutenberg model.
……..……………………………………………………………...……60
Figure 2.10 Phase velocity dispersion curves of Love waves for model 4 ………... 60
Figure 3.1 The inverse problem viewed as a combination of an estimation problem
plus appraisal problem .……………...…..…..……………………… 86
Figure 3.2 Flow chart showing the Occam’s inversion procedure …....……..…… 87
Figure 3.3 A typical synthetic seismic record with strong Rayleigh waves ..…….. 88
Figure 3.4 Slowness-frequency spectrum (p-f) image with ReMi dispersion picks of a
typical synthetic seismic record …………………...………………..... 88
Figure 3.5 Linearized inverted S-wave velocities against the original synthetic models
for nine synthetic data sets. …………….………….….…….……..…. 89
viii
Figure 3.6 The depth-averaged velocities in m/s against the known values for
Occam’s inverted models.………………………………….……..…… 92
Figure 3.7 Dispersion picks on the slowness-frequency spectrum (p-f) images of
Newhall (left) and Coyote Creek (right) data
…………….…..…..…. 93
Figure 3.8 Linearized inverted profiles of S-wave velocity against the OYO
suspension S-wave logs of Newhall and CCOC data ……………..…. 94
Figure 3.9 Inverted S-wave velocity against the OYO S-wave log of Newhall data,
showing the effect of the number of layers ………..….…….….……. 95
Figure 3.10 Inverted S-wave velocity against the OYO S-wave log of Newhall data,
showing the effect of the layer S-wave velocity ……..….….….…..... 96
Figure 3.11 Inverted S-wave velocity against the OYO S-wave log of Newhall data,
showing the effect of the maximum depth ……………....….….…..... 96
Figure 3.12 Inverted S-wave velocity against the OYO S-wave log of Newhall data,
showing the effect of the number of the picks ………………....…..... 97
Figure 3.13 Inverted S-wave velocity against the OYO S-wave log of Newhall data,
showing the effect of the frequency density of the picks ….…...….… 97
Figure 3.14 Effects of the weighting matrix on the inverted models ....……..….... 98
Figure 4.1 Multimodality of the surface wave dispersion curve inversion problem
……..……..……………………………………………………...…… 119
Figure 4.2 A cartoon showing an annealing process …………...…………....….. 120
Figure 4.3 A flowchart showing the annealing process on inversion of dispersion
curve of surface waves ……………………………….…………...... 120
ix
Figure 4.4 A cartoon showing the role of the conditional acceptance ……...…… 121
Figure 4.5 Inverted profiles with standard deviation of S-wave velocity against the
original synthetic models …………...…..……………………….….. 122
Figure 4.6 The depth-averaged velocities in m/s against the known values for SA
inverted models …………………………………………………….. 123
Figure 4.7 Inverted profiles with standard deviation of layer thickness against the
original synthetic models ……..…………………………...……….. 124
Figure 4.8 Comparison of the OYO suspension S-wave velocity logs and the inverted
models for the Newhall (left) and the Coyote Creek data (right) .…. 125
Figure 4.9 Calculated dispersion curves (lines) of fundamental-mode Rayleigh waves
plotted atop the ReMi dispersion picks (circles) for the Newhall (left) and
the Coyote Creek data (right) ……………………..……….…...……125
Figure 4.10 The flow chart of the hybrid inversion algorithm …………...……... 126
Figure 4.11 Final inverted models for Newhall data using the simulated annealing
method (left) and the hybrid inversion method (right) ……...……… 127
Figure 4.12 Final inverted models for CCOC data using the simulated annealing
method (left) and the hybrid inversion method (right) ...……..…….. 127
Figure 5.1 Two different models with same number of layers (left) and corresponding
dispersion curves (right) …………………………………...……….. 136
Figure 5.2 Two different models with different number of layers (left) and
corresponding dispersion curves (right) …………………………..... 136
x
Figure 5.3 A theoretical distribution of the value of the cost function for the joint
inversion
………………………………...…..………...…..…….… 137
Figure 5.4 SA inversion results of the data N102 …………………………....….. 138
Figure 5.5 Joint inversion results of the data N102 ………………….………...... 138
Figure 5.6 The distribution of the value of cost functions for joint inversion on N102
………………………………………………………..……….……… 139
Figure 5.7 The depth-averaged velocities in m/s against the known values for both SA
and joint inverted models ………..…..………………..….….….….. 140
Figure 5.8 SA inversion results of the data N103 ……………………………….. 141
Figure 5.9 Joint inversion results of the data N103 ……………….....………….. 141
Figure 5.10 The distribution of the value of cost functions of the joint inversion on
N103 ………………………………………………...……...………. 142
Figure 5.11 SA inversion results of the data N104 ……………………….....……143
Figure 5.12 Joint inversion results of the data N104 ……………………..……... 143
Figure 5.13 The distribution of the value of cost functions of joint inversion on N104
………………………...……………...…..…………………...……… 144
xi
LIST OF TABLES
Table 1.1 Summary of characteristics of seismic ambient noise ..….………......… 8
Table 1.2. Seismic acquisition techniques used in shallow site investigations …… 14
Table 2.1 Methods of forward modeling of dispersion curves …………….……… 27
Table 2.2 Definition of elastic constants ………………………………....……….. 31
Table 2.3 Relationship between elastic constants ………………………..……….. 31
Table 2.4 Harmonic wave parameters ……………………………….……….…… 81
Table 2.5 Gutenberg’s layered model of continental structure ..….………....….… 49
Table 2.6 Test models at crustal scale ……………………………………….….… 49
Table 2.7 Test models at local site scale …………………………………..….…… 50
Table 3.1 Linearized inversion methods of surface waves used by major research
groups ………………………………………………………..………….. 70
Table 3.2 Sources of uncertainty in surface wave dispersion measurements …….. 73
Table 3.3. Depth-averaged velocities in m/s for Occam’s inverted models and
percentage difference from known profiles in parentheses ………..…… 79
Table 4.1 SA-inverted depth-averaged velocities in m/s and percentage difference
from known profiles in parentheses ..….…………………………....… 111
Table 4.2 Implementation difference of SA from previous study ………….….… 115
Table 5.1 Joint-inverted depth-averaged velocities in m/s and percentage difference
from known profiles in parentheses…………….……………………… 133
xii
ACKNOWLEDGEMENTS
I thank my advisory committee Drs. John Louie, John Anderson, James Brune,
Satish Pullammanappallil, and Ilya Zaliapin for their thorough reviews that
significantly improved this manuscript. I thank my academic advisor Dr. John Louie
for his financial support at the beginning, critical academic encouragements for
challenges, and positive writing guidance for papers and this dissertation. I am
grateful for the valuable learning experiences I gained during the numerous lab
exercises and fieldworks from him. Without his help, I can’t image how I survive at
UNR. The courses taught by John Anderson and James Brune went a long way in
furthering my understanding of geophysical inversion and earthquake seismology.
This dissertation is financially supported by Optim Inc. I want to give my
thanks to Satish Pullammanappallil and Bill Honjas of Optim Inc. The simulated
annealing methods developed in this dissertation are a direct influence of Dr.
Pullammanappallil’s contribution. He provided his own code and value advice to
keep me on the right track. Most importantly, he continuously funds this study.
Without funding, I would not have been able to pursue the topic that interested me
most.
I spent four years in the Nevada Seismological Lab. I thank Drs. Ken Smith,
David Von Seggern, Glenn Biasi, Rasool Anooshehpoor, and Gary Oppliger for their
knowledge of seismology from which I benefit a lot. During my staying, I was
fortunate to come in contact with students who displayed enthusiasm for research and
xiii
unselfish sharing of scientific ideas. They are Aasha Pancha, James Scott, and
Michelle Heimgartner. I am grateful for the friendship and love shared by all of my
friends at UNR and the department. They help me smoothly settle down and make my
staying in Reno much more enjoyable.
Most of all, I thank my wife, Xin Yu, for her love, encouragement, sacrifice,
and moral support throughout this study; my parents, Pei Yunji and Tao Wanzhi, for
their never-failed-support throughout my education; and my parents-in-law, Fulai Yu
and Jinwen Li, for their caring for my baby Steven Pei during the study.
xiv
Chapter 1 Introduction
1.1 Motivation and research objectives
The factors influencing seismic ground motion were divided into source, path,
and site effects, a distinction that has proven useful for understanding and predicting
seismic shaking (e.g. Aki, 1993). The properties of the geological materials beneath a
site (site condition) have a major impact on the ground motion by modifying the
amplitude, phase, duration, and shape of seismic waves. Historical earthquakes have
taught us that damage is often significantly greater on unconsolidated soil than on rock
sites when the surface structure is more than a few kilometers from the earthquake
source (e.g., the Mw 6.9 1989 Loma Prieta earthquake in California, Borcherdt and
Glassmoyer, 1994). Thus, the characterization of the medium underlying a site is one of
the most important tasks in seismic hazard evaluation of a region, comprehensive
earthquake preparedness, development of the national seismic hazard map, and seismicresistant design of buildings (Field et al., 1992).
The use of surface waves (ground roll) for the characterization of the shallow
subsurface has become of growing interest to geotechnical engineers and geophysicists.
In a vertically heterogeneous medium, the phase velocity of surface waves is a function
of frequency (called dispersion curves). The curve is a function of shear wave (S-wave)
velocity, layer thickness, density, and compressional wave (P-wave) velocity of each
geological layer, listed in a decreasing order of priority according to Xia et al. (1999). If
the dispersion curves are measured experimentally, it is in principle possible to obtain
the mechanical parameters of the medium from the dispersion curves.
1
In fact, the dispersion curve has been employed for imaging geological profiles
in a variety of applications for several reasons. First, it is a robust property that can be
quite easily observed without contamination by other wavefields. Second, various
forward modeling techniques exist to generate the dispersion curves of surface waves
rapidly and accurately for a layered geological structure. Finally, compared to the
inversion of waveforms, the complexity of the inversion of dispersion curves is greatly
reduced.
Three steps are involved in utilizing dispersion curves of surface waves for
imaging geological profiles for seismic hazard assessment (Fig. 1.1):
1) acquire high-frequency (>=1 Hz) broadband ground roll,
2) create efficient and accurate algorithms organized in a basic data processing
sequence designed to extract surface wave dispersion curves from the ground roll, and
3) develop stable and efficient inversion algorithms to obtain shear wave
velocity profiles.
The application of dispersion curves for geotechnical site characterization was
originally proposed during the 1950s (e.g., SPAC method of Aki, 1957). The new
improvements do not appear until the 1980s when the SASW technique (e.g., Nazarian
and Stokoe, 1985) was proposed. The main reason for the slow progress is the lengthy
procedure of data acquisition on site. Since then, FK (Horike, 1985), MSM (Okada,
2003), MASW (Park et al., 1999), DASW (Phillips et al., 2004), and wavefield
transformation (Forbriger, 2003a, 2003b) were developed for surface waves acquisition
in shallow site investigations. A significant simplification of the field acquisition of the
2
surface waves did not appear until Louie published his paper on the ReMi technique in
2001.
Current research is still focused on the first two steps, acquisition of broadband
ground roll and extraction of dispersion curves. They are important to successfully
estimate the geological material properties. However, the third step, inversion, is
essential for obtaining proper geotechnical profiles.
This dissertation focuses on the third step, the inversion of the dispersion curves
of surface waves, with the aim of finding the best procedure to get a more accurate and
reliable estimate of the geological material properties. The inversion actually is
comprised of two sub-steps:
3a) estimate a model employing the theory of surface wave propagation and
mathematical optimization;
3b) appraise the model for its accuracy, either deterministically or statistically.
The dissertation uses surface waves contained in ambient seismic noise, which is
an assemblage of body and surface waves (Toksoz and Lacoss, 1968). The extraction of
dispersion curves of surface waves has been achieved by several techniques. The
refraction microtremor (ReMi) technique (Louie, 2001), licensed as SeisOpt ReMi (©,
Optim Inc.) software, is being used widely for commercial and research purposes (Scott
et al., 2004, 2006; Stephenson et al., 2005; Thelen et al., 2006) to produce reliable
dispersion curves. Thus, the ReMi technique is adopted here to generate dispersion
picks for all test data.
3
1.2 Ambient seismic noise
Ambient seismic noise is defined as the constant vibrations of the Earth’s
surface at seismic frequencies, even without earthquakes (Okada, 2003). They are also
called microtremors or microseisms. The ambient noise is ubiquitous and its amplitude
is generally very small, far below human sensing. With some extreme exceptions, the
displacement amplitudes are on the order of 10-4 to 10-2 mm (Okada, 2003). But they
vary greatly between different sites and different frequencies.
Studying Earth noise has become a part of the science at least since Brune and
Oliver (1959) published curves of high and low seismic background displacement based
on a world-wide survey of station noise. Later development is largely due to the efforts
of Japanese seismologists (Aki, 1957; Horike, 1985; Okada, 2003). Figure 1.2 plots
typical microtremor levels for over 75 permanent seismic observatories from the global
seismic networks (Peterson, 1993). Globally, the microtremor level is high
(microtremor peak) at periods at about 5 to 8 seconds and low at 20 to 200 second
periods. These frequency ranges have very little for engineering seismology. Another
relatively high level appears at periods at about 0.15 to 0.5 seconds with large variation
between stations. Most dispersion acquisition technique (for example ReMi technique)
is sensitive to the noise signals between 0.15 to 0.5 seconds. Thus, the seismic peak
within this range is useful for seismic hazard assessment of sites. The large variation
within the range between sites contains the site-dependent information.
Although the noise is studied for its own intrinsic interest, seismologists have
generally considered it as pure random signal because it hampers observations of small
and/or distant earthquakes at least until emerges of noise cross-correlation technique
4
(Campillo and Paul, 2002). Recent developments in seismology and earthquake
engineering have demonstrated experimentally and theoretically that an estimate of the
Green's function for wave propagation between two seismic stations can be obtained
from the time-derivative of the long-time average cross correlation of ambient noise
between these two stations (Campillo and Paul, 2002; Sabra et al., 2005). Shapiro et al.
(2005) showed that the dispersion characteristics of the estimated Green's functions
provide information about the wave propagation between the stations, hence, about
seismic velocities in the crust and uppermost mantle. Thus, Earth structure can be
gained from analysis of seismic noise.
However, our knowledge of ambient seismic noise is still very incomplete.
Understanding the physical nature and composition of the ambient seismic noise
wavefield, especially in urban areas, requires answering two sets of questions that are
not independent of each other:
1) What is the origin of the ambient vibrations (where and what are the
sources)?
2) What is the nature of the corresponding waves, i.e., body or surface waves?
The second set of questions also includes 2a) what is the ratio of body and
surface waves in the seismic noise wavefield? 2b) within surface waves, what is the
ratio of Rayleigh and Love waves? and 2c) again within surface waves, what is the ratio
of fundamental and higher modes?
While there is a relative consensus on the first question (p.3, Okada, 2003), only
a few and partial answers were proposed for the second set of questions, for which a lot
of experimental and theoretical work still lies ahead.
5
As known and taught for a long time in Japan, sources of ambient vibrations are
usually separated in two main categories, natural and human (Shearer, 1999, P.215).
The ratio of these two sources varies in different frequency bands (particularly within
urban areas).
At low frequencies (f < fn = 1 Hz), the origin is essentially natural, with a
particular emphasis on ocean waves, which emit their maximal energy around 0.2 Hz
(Tanimoto, 2005, 2007). This energy corresponds to the peak at period of 5 to 8 seconds
in Fig. 1.2. They are generally called microseisms by seismologists. Higher frequencies
(around 0.5 Hz) are emitted along coastal areas due to the non-linear interaction
between sea waves and the coast line (Tanimoto, 2007). Some lower frequency waves (f
< 0.1 Hz) are reported (Kobayashi and Nishida, 1998) and often referred to as the
“hum”. The “hum” is associated with atmospheric movements or excitation by oceans
(Rhie and Romanowicz, 2004).
Energetic low frequency sources are often distant (being located at the closest
oceans). The most energy is carried from the source to sites by surface waves guided in
the Earth's crust (Lay and Wallace, 1995). However, locally, these waves may (and
actually often do) interact with the local geological structure (especially deep basins)
(Yamanaka et al., 1993, 1994). Their long wavelength induces a significant penetration
depth, so that the resulting local wavefield contains local geology signature. Subsurface
inhomogeneities, excited by the long period crustal surface waves, may act as
diffraction points and generate local surface waves, and even possibly body waves.
Thus, it is possible to extract the local geologic information by studying microseisms.
Extracting information from microseisms is easier on islands (such as Japan) than in the
6
heart of continental areas because the energy at frequencies between 0.1 and 1.0 Hz
decreases with increasing distance from oceans (SESAME, 2004).
At high frequencies (f > fn = 1 Hz), the origin is predominantly related to human
activities (traffic, machinery) (Shearer, 1999, P.215) and may also be associated with
wind and water flows (Okada, 2003 p.3). These waves are generally called
microtremors by engineers. Their sources are mostly located at the surface of the earth
(except some sources like metros), and often exhibit a strong day/night and
week/weekend variability (Okada, 2003, P.14).
High frequency waves generally have much closer sources, which most of the
time are located very close to the surface. While the wavefield in the immediate vicinity
(less than a few hundred meters) includes both body and surface waves, at longer
distances, surface waves become predominant (Lay and Wallace, 1995).
The 1 Hz limit for fn is only indicative, and may vary from one city to another
(SESAME, 2004). Some specific civil engineering works (highways, dams) involving
large engines and/or trucks may also generate low frequency energy. Locally, this limit
may be found by analyzing the variations of seismic noise amplitude between day and
night, and between work and rest days as well. I do not distinguish between
microseisms and microtremors here. The terms are interchangeable in this dissertation.
Besides this qualitative information, only little information is available on the
quantitative proportions between body and surface waves, and the different kinds of
surface waves that may exist (Rayleigh/Love, fundamental/higher). The few available
results, reviewed in Bonnefoy-Claudet et al. (2006), report that low frequency
microseisms predominantly consist of fundamental mode Rayleigh waves, while there
7
is no real consensus for higher frequencies (> 1 Hz). Different approaches were
followed to reach these results, including analysis of seismic noise amplitude at depth
and array analysis to measure the phase velocity (SESAME, 2004).
The very few investigations on the relative proportion of Rayleigh and Love
waves all agree on more or less comparable amplitudes, with a slight trend toward a
slightly higher energy carried by Love waves (around 60% - 40%) (SESAME, 2004). In
addition, there are a few reports about the presence of higher surface wave modes from
several very different sites (some very shallow, other much thicker, some other with
low velocity zone at depth).
The following table simplifies the above discussion.
Table 1.1 Summary of characteristics of seismic ambient noise
Natural
Human
Name
Microseism
Microtremor
Frequency
0.1 - fnh (0.5 Hz to fnh (0.5 Hz to 1 Hz) - 10 Hz
1 Hz)
Origin
Ocean
Traffic / Industry / Human activity
Incident wavefield Surface waves
Surface + body
Amplitude
Related to oceanic Day / Night, Week / week-end
variability
storms
Rayleigh / Love
Incident wavefield Comparable amplitude – slight
issue
predominantly
indication that Love waves carry a little
Rayleigh
more energy
Fundamental /
Mainly
Possibility of higher modes at high
Higher mode issue Fundamental
frequencies (at least for 2-layer case)
Further Comments Local wavefield
Some monochromatic waves related to
may be different
machines and engines. The proximity of
from incident
sources, as well as the short wavelength,
wavefield
probably limits the quantitative
importance of waves generated by
diffraction at depth
In summary, ambient seismic noise is ubiquitous and its amplitude is small. The
low frequency ambient noise is essentially nature while the high frequencies are related
8
to human activities. The acceleration power spectral of ambient noise shows several
peaks. The peak at periods of about 0.15 – 0.5 seconds is useful for seismic hazard
assessment of sites by providing a potential seismic source for shallow S-wave velocity
investigations. Due to the short wavelength contained in the ambient noise, effective
investigation depth is limited (for example less than 100 m by ReMi technique).
The review on the origin of the ambient seismic noise shows that the seismic
noise wavefield is complex. When extracting dispersion curves of microseisms, one has
therefore to consider the possible contributions to the microseisms from both surface
and body waves, including higher modes of surface waves.
1.3 Surface wave properties
Seismic waves can be categorized by whether they travel through a medium
(body waves) or along the medium’s surface (surface waves). Body waves propagate by
a series of compressions and dilatations of the material or by shearing the material back
and forth. The first type of body wave is variously known as a dilatational, longitudinal,
irrotational, compressional, or P-wave, the latter name being due to the fact that this
type is usually the first (primary) event on an earthquake seismogram. The P-wave
forces particles of the medium to move back and forth parallel to the direction of
propagation (Fig. 1.3). The second type is referred to as the shear, transverse, rotational,
or S-wave (because it is usually the second event observed on an earthquake
seismogram). Under S-waves, the medium is displaced transversely to the direction of
propagation (Fig. 1.3). Moreover, because the rotation varies from point to point at any
given instant, the medium is subjected to varying shearing stresses as the wave moves
9
along. S-wave particle motion is often divided into two components: the motion within
a vertical plane through the propagation vector (SV-wave), and the horizontal motion in
the direction perpendicular the plane (SH-wave)
In an infinite homogeneous isotropic medium, only body waves exist (Aki and
Richards, 2002). However, when the medium does not extend to infinity in all
directions, surface waves (known in seismic exploration as ground roll) can be
generated. The primary type of surface wave is the Rayleigh wave. This wave travels
along the surface of the earth and involves an interference of the P-wave and SV-wave.
The particle motion is confined to the vertical plane that includes the direction of
propagation of the wave. The motion is counter clockwise (retrograde) at the surface,
changing to purely vertical motion at a depth of about one fifth of a wavelength, and
becoming clockwise (prograde) at greater depths (Fig. 1.4). The amplitude of the
Rayleigh wave motion decreases exponentially with depth. Because of the existence of
vertical medium-velocity gradients in the real world, the velocity of the Rayleigh wave
varies with wavelength (called dispersion curves); longer period waves travel faster
because they sense the faster material at greater depth. The second type of surface wave
is the Love wave. Love waves are formed through the constructive interference of high
order SH multiples. The particle motion is horizontal and in the direction of SH waves.
The amplitude of this wave motion decreases exponentially with depth. They exhibit
dispersion as well.
Geometrical spreading for surface waves is proportional to r-0.5, in contrast to the
body wave where the geometrical spreading is proportional to r-1, where r is distance
10
from the source (Anderson, 1991). Rayleigh waves often are dominant events in seismic
records.
The amplitude of surface waves decreases exponentially with depth (Fig. 1.4).
Most of the energy propagates in a shallow zone, roughly equal to one wavelength.
Consequently, the wave propagation is influenced by the properties of this limited, nearsurface portion of the geological or geotechnical profile.
The propagation of surface waves in a vertically heterogeneous medium shows a
dispersive behavior. Dispersion means that different frequencies have different phase
velocities. In a homogeneous medium, the different wavelengths (Rayleigh wave only)
“sample” different depths of the subsoil. Since the material is homogeneous, all the
wavelengths have the same velocity (Fig. 1.5 left). In other words, Rayleigh waves are
non-dispersive and Love waves do not exist in a homogeneous medium. If the medium
is not vertically homogeneous, for instance it is layered, with layers having different
mechanical properties, the different wavelengths “sample” different depths to which
different mechanical properties are associated. Each wavelength propagates at a phase
velocity depending on the mechanical properties of the layers involved in the
propagation (Fig. 1.5 right). So the surface wave does not have a single velocity, but a
phase velocity that is a function of frequency.
This relation between frequency and phase velocity is called a dispersion curve
and depends on the geology underneath. At high frequency, the phase velocity is close
to the S-wave velocity through the uppermost layer. At low frequency, the effect of
deeper layers become important, and the phase velocity tends asymptotically to the Swave velocity of the deepest material, as if it extends infinitely in depth (the half space).
11
The shape of dispersion curves is related to geologic profiles. Longer
wavelengths penetrate deeper than shorter wavelengths for a given mode and are more
sensitive to the elastic properties of the deeper layers. Thus, for a profile where S-wave
velocity increases with depth, a normal dispersion curve (phase velocities decrease with
frequency) will be observed (Fig. 1.6). For a profile where S-wave velocity decreases
with depth, a reverse dispersion curve (phase velocities increase with frequency) will be
observed. In an irregular S-wave velocity profile, the phase velocities show a complex
relation with frequency (Fig. 1.6).
In reality, the S-wave velocities increase with depth in most geological
structures. Most of the observed dispersion curves are normal. A complex shape of
dispersion curves is also observed in our surface wave surveys in Las Vegas. This is due
to the regional distributed cliché layer in Las Vegas basin. The reverse dispersion
curves are observed from our synthetic models. The observation of a reverse dispersion
curve in the surveys might be caused by the limited frequency bandwidth of the
surveys. The observed reverse curves actually is one part of the complex dispersion
curves.
Like a vibrating string, the surface wave propagation in vertically heterogeneous
media is actually a multi-modal phenomenon. For a given geology, at each frequency
different wavelengths can exist (Fig. 1.7). Hence different phase velocities are possible
at each frequency, each corresponding to a mode of propagation. The different modes
can exist simultaneously (Aki and Richards, 2002) (Fig. 1.8).
The different modes, except the first one, exist only above their cut-off
frequency, which is for each mode the lowest limit frequency at which the mode can
12
exist. With a finite number of layers, in a finite frequency range, the number of modes
is limited. At very low frequency, below the cut-off frequency of the first higher mode,
only the fundamental mode exists.
Modes are not just theory or mathematically possible solutions; they are often
observed in experimental data, also in the frequency ranges of interest for engineering
purposes. The energy associated to the different modes depends on many factors, the
geology at first, but also the depth and the kind of source.
The first mode is sometimes dominant over a wide frequency range, but in many
common situations higher modes play important roles and are dominant in energy. So
they cannot be neglected. The different modes have different phase velocities.
Therefore, they are separated at distance from the source. at short distances modes
superimpose on one another, and mode identification can be impossible.
At the engineering scale, the modal superposition is important. The effective
Rayleigh phase velocity deriving from the modal superposition is only an apparent
velocity that depends on the observation layout, source orientation, and position.
1.4 Seismic acquisition techniques used in shallow site investigations
Many seismic methods have been used by seismologists to determine the
velocity structure of the Earth at different scales (Lay and Wallace, 1995). They include
the reflection seismic method used by exploration geophysicists, and the use of bodywave arrival times, surface-wave dispersion, and free-oscillation periods of the Earth.
Those methods are now being successfully adopted in the determination of shallow S-
13
wave structure to help in the specification of design ground motions for engineering
purposes (Horike, 1985, 1988; Nazarian and Stokoe, 1985; Stephenson et al., 2005).
According to Boore (2006), the seismic methods used in shallow site
investigations are categorized according to invasiveness (Table 1.2). The noninvasive
methods are further organized according to number of stations used. The multiplestation group is subdivided into those methods that use active sources, those that use
passive sources, and those that combine active and passive sources.
Table 1.2. Seismic acquisition techniques used in shallow site investigations
Invasive
Noninvasive
Surface
Receiver in borehole
HVSR
source
(Boore &
Receiver in cone
Thompson,
penetrometer
SASW
2007)
Active sources
DASW
Suspension P-S
logger (Nighbor and
MASW
Multiple
Imai, 1994)
F-K
stations
Downhole
Passive sources
SPAC
source
Crosshole
ReMi
(ASTM, 2003)
Combined active and
MASW
passive sources
HVSR: Horizontal/vertical spectral ratio method ( on single station
microtremor data) (Bonnefoy-Claudet et al., 2006)
SASW: Spectral analysis of surface wave (Nazarian and Stokoe, 1984)
DASW: Distance analysis of surface wave (Phillips et al., 2004)
MASW: Multichannel analysis of surface wave (Park et al., 1999)
Acronyms
F-K: Frequency-wavenumber method of processing seismic array data
(Horike, 1985)
SPAC: Spatial autocorrelation method of processing seismic array data
(Okada, 2003)
ReMi: Refraction microtremor method (Louie, 2003)
1.4.1 Invasive methods
These methods require data from seismometers placed beneath the Earth’s
surface. They can be divided into two groups: those using surface sources and those
using down-hole sources.
14
Surface-source methods (Boore & Thompson, 2007) employ the source at the
surface and a sensor either clamped to the edges of a cased borehole at a series of
depths, or mounted near the tip of a special tool (a seismic cone penetrometer) that is
pushed into the ground (seismic cone penetration testing, or SCPT). The surface sources
are activated and the seismic waves are recorded in the sensors. Usually a threecomponent seismometer is used as the sensor, and two types of sources are commonly
used. Either a plank struck with a sledge hammer on the ends or an air-activated slide
hammer (in either case the device is held to the ground by the weight of a truck’s tires)
is used to generate S-wave energy. For P-wave energy, usually a metal plate is struck
with a sledge hammer (e.g. Liu et al., 1988). The first arrivals on the resulting record
section are picked, and then a velocity model is found from these arrivals. In some
cases the velocities are determined from a line fit through adjacent arrivals, thus
providing velocities over various intervals of depths. A model such as this can be used
in correlation with shear-wave velocities and geologic units (e.g., Holzer et al., 2005).
In the past, downhole-source methods usually involved crosshole studies, where
a source in one hole emitted waves that traveled more-or-less horizontally to receivers
in an adjacent hole(s). The crosshole method has several limitations (ASTM, 2003): 1)
it is very expensive in that it requires multiple holes whose spatial orientation needs to
be known precisely; 2) the velocities are measured in the horizontal direction and may
not be appropriate for waves traveling essentially vertically, as are those of most
concern in earthquake engineering; 3) the velocity model may not extend without gaps
from the surface to depth. On the other hand, the method is useful for detecting local
15
variations in soil properties, which might be important for liquefaction potential or for
foundation design.
For most purposes related to earthquake engineering, the crosshole method has
largely been replaced by a method developed by the OYO Corporation (Nighbor and
Imai, 1994). The method is known by several names, the most common being variants
of “Suspension P-S Velocity Logging Method”. Information on this widely-used
method can be found at http://www.geovision.com/PDF/M_PS_Logging.PDF. The
method makes use of a probe lowered into a hole, on which a source near the bottom of
the probe emits acoustic waves that are coupled into P- and S-waves at the edges of the
borehole. These waves travel in the surrounding material and are reconverted into
acoustic waves that are then recorded on two receivers mounted 1 m apart. The wave
velocities are given by the difference in travel times at the two receivers. The method
works best in uncased boreholes and can be used in relatively deep holes. It provides
much finer resolution than the surface-source downhole-receiver methods discussed
earlier. Possible drawbacks are that the method sometimes does not yield accurate
velocities near the surface, and does not formally produce a model extending to the
surface. In addition, it is not possible to interpolate across any zones where data are not
obtained. This is in contrast to the surface-source downhole-receiver method where a
single well-recorded travel time below a depth interval with poor data still provides an
average velocity across the skipped interval.
1.4.2 Noninvasive methods
A major disadvantage of the invasive methods is the need for a borehole and the
cost of drilling. For this reason, many noninvasive methods have been devised for
16
obtaining a subsurface velocity structure. As shown in Table 1.2, these methods are
conveniently divided into those that use active sources, those that use passive sources,
and those that use both. Most of the methods attempt to measure fundamental mode
dispersion curves of Rayleigh waves (Boore, 2006). The velocity models are obtained
by inverting these dispersion curves, using either iterative forward modeling or various
inversion algorithms.
SASW is the popular noninvasive method in earthquake engineering community
(e.g., Nazarian and Stokoe, 1984; Brown et al., 2002). This method uses the phase
difference between two receivers, calculated by cross power spectra of the recorded
signals, and a variety of sources, ranging in size from small hammers for high
frequencies to large vehicles (such as those used in petroleum exploration that emit
vibrations at different frequencies, or a large tractor rocking back and forth) for longer
periods. Given spatial spacing of two receivers, the phase difference gives phase
velocity of Rayleigh waves. DASW (Phillips et al., 2004) is proposed to complement
SASW and to evaluate horizontal homogeneity of a medium by examining the phase of
surface waves with respect to horizontal distance.
The field configuration of MASW is the same as that used in conventional
common midpoint (CMP) body-wave reflection surveys. The generated seismic signals
by various sources are simultaneously recorded by a large number of channels (e.g.,
Park et al., 1999). After a wavefield transformation, 1D Fourier transformation on time
followed by an integral transformation (equation (4) of Park et al., 1998), the recorded
wavefields of a single shot gather give rise to images of dispersion curves. Recent
17
developments of MASW is to use the ambient noise (Park et al., 2004) and both active
and passive sources (Park et al., 2005).
A limitation to the active source methods in general is the difficulty of
generating low frequency waves. The amount of active-source energy to push down the
low frequency end of a dispersion curve often increases by several orders of magnitude,
rendering efforts with an active source impractical and uneconomical (Park et al.,
2004). This limits the depths for which velocity models can be obtained. Passive
sources include microtremors produced by a range of natural phenomena (e.g., ocean
surf and wind) and artificial sources (e.g., traffic, machinery). The frequencies can be
quite low (Earth noise at periods near 8 sec required the development of both long- and
short-period sensors in the first global scale seismographic network) (Peterson, 1993).
Measurements of microtremors are usually made on arrays of instruments placed in
two-dimensional configurations, although one method uses linear arrays (the ReMi
method of Louie, 2001). Extraction of the phase velocities can be done using beamforming or frequency-wavenumber (f-k) methods (e.g., Horike, 1985; Liu et al., 2000),
or by using the SPAC method first proposed by Aki (1957) and now experiencing a
resurgence of interest (e.g., Okada, 2003; Asten, 2005a, 2005b). One limitation in
practice is that 2D instrument arrays are usually not dense enough to resolve nearsurface velocities, and yet these velocities can have an important effect on site
amplifications.
Single-station methods for determining shear-wave velocities have been used
over the years (e.g., Bard, 1998; Scherbaum et al., 2003). There is an excellent project
called Site EffectS assessment using AMbient Excitations (SESAME at http://sesame18
fp5.obs.ujf-grenoble.fr/index.htm). The web site provides excellent reports and
publications for the implementation of the H/V spectral ratio technique on ambient
vibrations (measurements, processing, and interpretation). The methods make use of the
frequency-dependence of Rayleigh-wave ellipticity, which in turn depends on the
subsurface velocities (e.g., Boore and Toksšz, 1969). Contamination by higher modes
can complicate the determination of the velocity structure from the observed ellipticity
(e.g., Arai and Tokimatsu, 2004, 2005). Most methods based on the inversion of
apparent velocities vs. frequency make the assumption that the velocities correspond to
fundamental-mode surface waves. This assumption is not always true, particularly at
longer periods for which the offset between the source and the receivers may not be
sufficient for the body and surface waves to be differentiated in time and in amplitude.
This is one reason that some studies use a combination of active and passive sources,
combining the dispersion curves for the two observation methods (e.g. Yoon and Rix,
2005).
1.4.3 ReMi methods
The Refraction Microtremor (ReMi) method (Louie 2001) transforms the timedomain velocity results of microtremor recordings on a linear array into the frequency
domain by a two-dimensional slant transformation (p-τ) followed by a one-dimensional
Fourier transformation on τ. The method allows for separation of Rayleigh waves from
body waves and other coherent noise, and for easy recognition of dispersive Rayleigh
waves.
ReMi data acquisition consists of setting up a linear array of geophones and
recording ambient seismic noise with no need for a specially cased borehole or any
19
sources (Fig. 1.9). After transformations, a Rayleigh wave dispersion curve is derived
and displayed in a p-f image where dispersion curve picks can be made. These picks are
used to model the subsurface geology and seismic velocities. The effective depth of
investigation is related to the length of the geophone array. Examples of the p-f image,
the dispersion curve fitting, and the shear-wave velocity model are shown in Fig. 1.10.
ReMi surveys provide an effective and efficient means to acquire general, onedimensional, information about large volumes of the subsurface with one setup (Louie,
2001). This method measures ambient seismic noise. It can be conducted in seismically
noisy areas such as construction zones and urban environments. The ReMi method,
licensed as SeisOpt® ReMi™ (©, Optim Inc.) software, is being used widely for
commercial and research purposes (Scott et al., 2004, 2006; Stephenson et al., 2005; Liu
et al., 2005; Thelen et al., 2006) to produce reliable dispersion curves. This dissertation
uses the SeisOpt ReMi software to generate dispersion curve picks, and models, for all
test data.
20
0.0
Trace Sequence
0.0
20.0
Rayleigh waves
Synthetic example
Time, sec
1. Acquisition
4.0
0.0
0.0
5.0
Frequency (Hz)
10.0
15.0
20.0
25.0
2. Extraction
Slowness (s/m)
0.002
0.004
Dispersion
picks
(squares) on p-f image
0.006
0.008
0.01
2.5
ReMi Spectral Ratio
0.0
3. Inversion
0
700
10
600
500
Vs (m/s)
Depth (m)
20
30
40
300
200
50
60
400
100
400
600
Vs (m/s)
800
1D S-wave velocity
profile with uncertainty
0
0
0.2
0.4
Period (s)
0.6
0.8
Observed and calculated
dispersion curves
Figure 1.1 Three steps involved in utilizing dispersion curves of surface waves for
imaging geologic profiles.
21
Figure 1.2 The acceleration power spectrum of microtremors recorded at 75
permanent seismic observatories throughout the world (Peterson, 1993).
22
Figure 1.3 Body wave motion. From http://www.eas.purdue.edu/~braile/edumod/slinky/slinky4.doc
Figure 1.4 Particle motion and amplitude of Rayleigh waves. The motion in a homogenous, isotropic half space is retrograde at the surface, passing through purely verticle
at about lamda/5 then becoming prograde at depth (Cuellar, 1997).
23
Figure 1.5. Surface wave
dispersion. In a homogeneous
half space (left) all the wave
lengths sample the same
material and the phase velocity is constant. When the
properties changes with depth
(right) the phase velocity
depends on the wavelength,
forming a dispersion curve.
z (m)
Normal
c (m/s)
β (m/s)
f (Hz)
z (m)
Inverse
c (m/s)
β (m/s)
f (Hz)
Irregular
c (m/s)
z (m)
β (m/s)
f (Hz)
Figure 1.6. Phase velocities vs frequencies. A normal dispersion curve results from a
profile where S-wave velocity increases with depth. For a profile where S-wave velocity decreases with depth, a reverse dispersion curve will be observed over some range
of frequency. For an irregular S-wave velocity profile, phase velocities show a complex
relation with frequencies.
24
0
5
10
Depth (m)
15
20
25
30
35
40
45
50
Fundamental-mode 1st-higher-mode 2nd-higher-mode
Figure 1.7. Modes of surface waves. For
the same frequency, higher modes
penetrate deeper.
phase velocity
3rd-higher
4th mode
2nd-higher
3rd mode
1st-higher
2nd mode
Fundamental-mode
1st mode
f C1
f C2
f C3
frequency
Figure 1.8. Dispersion curves of higher-mode
surface waves. For the same frequency, higher
modes exist only above their cut-off frequency
and propogate faster than the fundamental mode.
25
noise
a) linear array recording
ambient seismic noise
b) field deployment
Figure 1.9. A typical ReMi field configuration.
a) p-f image with dispersion picks
c) 1D S-wave velocity profile
0
500
1000 1500 2000 2500 3000 3500 4000 4500 5000
0
-10
Vs100' = 1299 ft/s
-20
-30
-50
b) dispersion curve fitting
Rayleigh Wave Phase Velocity,ft/s
-60
Dispersion Curve Showing Picks and Fit
3000
Calculated Dispersion
2500
Depth, ft
-40
-70
Picked Dispersion
2000
-80
1500
1000
-90
500
-100
0
0
0.05
0.1
0.15
0.2
0.25
0.3
Period, s
Shear-Wave Velocity, ft/s
Figure 1.10. A typical ReMi analysis.
26
Chapter 2 Forward modeling of surface-wave dispersion
Many methods have been proposed to calculate dispersion curves of surface
waves. They can be categorized into propagator matrix methods and numerical methods
(Table 2.1).
The matrix methods start from Thomson and Haskell (Thomson 1950; Haskell
1953) who used matrices to solve the eigenvalue problem of the system of differential
equations. The matrix methods construct a dispersion equation (or secular equation),
which is an implicit function of frequency, phase velocity (wave-number), thicknesses,
elastic parameters, and damping of the layers. The dispersion curves are the roots
(eigenvalues) of the dispersion equation for possible modes of propagation at any
particular frequency. Therefore, the solutions are analytic.
Table 2.1 Methods of forward modeling of dispersion curves
Propagator matrix
Transfer matrix method
(Thomson, 1950; Haskell, 1953)
Stiffness matrix method
(Kausel and Roesset, 1981)
Reflection-transmission coefficient
(Kennett, 1983; Luco and Aspel,1983)
Numerical methods
Finite element method
(Lysmer and Drake, 1972)
Finite difference method
(Boore, 1972)
Numerical integral
(Takeuchi and Saito, 1972)
There are basically three matrix methods:
1). Transfer matrix -Most commonly used method, especially in earthquake and
exploration seismology, after Thomson (1950) and Haskell (1953);
2). Stiffness matrix - Complementary method, often favoured by engineers, after
Kausel and Roesset (1981); and
27
3). Reflection-transmission (R/T) coefficient matrix (called R/T method
thereafter) for the entire stack of layers by Kennett (1983) and Luco and Aspel (1983).
The above three categories are collectively known as propagator matrix methods
because these methods allow the propagation of the stress-motion or stress-displacement
field through the layered stack from a known value at a reference depth. They are
analytically exact and all equivalents (Buchen and Ben-Hador, 1996).
The implicit dispersion equation could be solved by numerical methods (Table
2.1). The methods include finite element method, finite difference method, and direct
numerical integration. The fundamental difference is how they approximate the
dispersion equation.
The R/T method is well studied and provides the best numerical technique for
computing the surface waves dispersion curves (Zeng and Anderson, 1995). The method
is stable for high frequencies (Chen 1993; Hisada, 1994, 1995). Phase velocities over 100
Hz for a layered crustal model are calculated (Chen, 1993). Thus, it is suitable for ReMi
phase velocity picks because surface wave phase velocity picks are generally made at
frequencies as low as 2 Hz and as high as 100 Hz (Louie, 2001). However, like other
methods, the R/T method is time-consuming. For example, calculation of phase velocity
dispersion curves for fundamental-mode Rayleigh waves for a twenty-four-layer model
takes about 7 seconds on a 1.33 GHz CPU. This has a negative impact on a non-linear
inversion algorithm which usually requires thousands of forward modeling of dispersion
curves.
Based on the R/T method, this study achieved a new more efficient algorithm,
called the fast generalized R/T coefficient method or the fast R/T method, to calculate the
28
phase velocity of surface waves for a layered earth model. The fast method is based on
but is more efficient than the method of Chen (1993) and Hisada (1994, 1995). Except for
a few modifications, most of the mathematic equations and notations are from Chen
(1993). Specifically this chapter focuses on:
1). A dispersion curve of surface waves is an implicit non-linear function of Swave velocity, thickness, density, and P-wave velocity of each layer, listed in a
decreasing order of priority (Xia et al., 1999). Solving for dispersion curves is an
eigenvalue problem of the system of differential equations. But what is the system of
differential equations?
2). Traditional R/T method is considered one of stable and efficient methods.
How does the traditional R/T method solve the system of differential equations for the
dispersion curves?
3). The fast R/T method is faster than the traditional R/T method while
maintaining the stability. How?
4). The efficiency and stability of the fast R/T method is demonstrated by tests of
six cases at both large and small scales.
2.1 Motion-stress vector
Earth material must behave elastically in order to transmit seismic waves. The
behavior of the material is described by density ρ and elastic constants including shear
modulus (µ), Young’s modulus (E), Bulk Modulus (K), and Poisson’s ratio (σ). Those
constants, along with the two Lame parameters (λ and shear modulus µ) completely
describe the linear stress-strain relation within an isotropic solid. Their definitions and
29
relations are tabulated in Table 2.2 and 2.3. There are numerous, excellent papers on the
elastic behavior and derivation of wave equations. The following summaries are from
Lay and Wallace (1995), Shearer (1999), and Aki and Richards (2002).
The theory of elasticity provides mathematical relationships between stresses and
strains in the medium (for details see Shearer, 1999, Chapter 2) and thus governs the
equation of motion in the medium:
ρ
∂ 2ui
= τ ji , j + fi
∂t 2
(2.1)
where ui the displacement in ith component, ρ the density, τ the stress, f the body force.
Equation (2.1) is known as the equation of motion in the medium. For details, please refer
Shearer (1999, equation (3.6) ) and Aki and Richards (2002, equation (2.13) ).
Table 2.2 Definition of elastic constants
Name
Symbol
Young’s modulus
E
Shear modulus
µ
Poisson’s ratio
σ
Bulk Modulus
K
Definition
longitudinal stress
longitudinal strain
shear stress
shear strain
longitudinal stress
transversal strain
idrostatic stress
volumetric strain
Notes
Free transversal deformation
Free transversal deformation
Starting with the simplest problems, we consider Cartesian coordinates and a
surface wave u propagating in the horizontal direction of increasing x with angular
frequency ω and wavenumber k:
u( x, y, z , t ) = Z( z )ei ( kx −ω t )
(2.2)
30
where z is depth, Z(z) is amplitude exponential decay term for surface waves. Other
harmonic wave parameters are listed on Table 2.4.
Table 2.3 Relationship between elastic constants
µ or σ
λ or µ
λ
λ
µ
µ
K
E
σ
(3λ + 2µ )
3
µ (3λ + 2µ )
E=
λ+µ
K=
σ=
λ=
2 µσ
1 − 2σ
µ
K=
2µ (1 + σ )
3(1 − 2σ )
E = 2µ (1 + σ )
E
σ
σ
λ
2(λ + µ )
K or µ
2
λ=K− µ
3
E or σ
µE
λ=
(1 + σ )(1 − 2σ )
E
µ=
2(1 + σ )
E
K=
3(1 − 2σ )
µ
K
9K µ
3K + µ
3K − 2 µ
σ=
2(3K + µ )
E=
Table 2.4 Harmonic wave parameters
Frequency
f
Hz
Period
T
s
Velocity
c
m/s
Angular frequency
ω
radian/s
Wavelength
λ
m
Wavenumber
k
radian/m
1 ω
c
=
=
T 2π Λ
1 2π Λ
T= =
=
f
c
ω
f =
c= fλ =
λ
=
ω
T k
2π
ω = 2π f =
= ck
T
f
2π
= cT =
c
k
ω 2π 2π f
k= =
=
c
c
λ
λ=
31
Let us now consider surface waves propagating in the x-direction in a vertically
heterogeneous, isotropic, elastic medium occupying a half-space z>0 in which elastic
moduli λ(z), µ(z) and density ρ(z) are arbitrary function of z.
Love waves are SH waves only. Their displacements in three directions (u, v, w)
are in the form of:
u = 0

i ( kx −ω t )
 v = l1 (k , z , ω )e
w = 0

(2.3a)
The stress components associated with the above displacement are:
 τ xx = τ yy = τ zz = τ zx = 0

dl1 i ( kx −ω t )

e
τ yz = µ
dz

τ xy = ik µ l1ei ( kx −ωt )
(2.3b)
where l1 is amplitude exponential decay term in both equations.
Equation of motion (equation (2.1) ) must satisfy the following four boundary
conditions:
BC1 (radiation condition): The displacement in infinite depth is zero u |z →∞ = 0 ;
BC2 (displacement continuity condition): Displacement must be continuous
across any layer boundary ui |z = d = ui +1 |z = d ;
BC3 (traction continuity condition): Traction must be continuous across any
layer boundary τ i |z = d = τ i +1 |z = d ;
BC4 (zero traction at the free surface): Traction must be zero at the free surface
τ |z =0 = 0 .
32
After applying BC1-BC4, equation (2.3) could be rewritten as a set of first-order
ordinary differential equations,
 dl1 ( k , z , ω ) l2 ( k , z , ω )
 dz = µ

 dl2 ( k , z , ω ) = (k 2 µ ( z ) − ω 2 ρ ( z ))l ( k , z , ω )
1

dz
(2.4)
Or in a matrix form

d  l1 ( k , z ,ω )  
0
=
 ( )   2
dz  l2 k , z ,ω   k µ ( z )-ω 2 ρ ( z )
1 

µ (z) 

0 
 l1 
 
 l2 
(2.5)
Equation (2.5) is called the motion-stress vector for Love (SH) waves.
For Rayleigh waves that are resulted from interference between P and SV-waves,
the displacements in three directions (u, v, w) are in the form of:
 u = r1 ( k , z , ω )ei ( kx −ω t )

v = 0
 w = ir ( k , z , ω )ei ( kx −ω t )
2

(2.6a )
The stress components associated with the above displacement are:
 τ yz = τ xy = 0

dr2
i ( kx −ω t )
τ xx = i λ dz + k (λ + 2µ )r1  e

dr
i ( kx −ω t )
τ yy = i λ dz2 + k λ r1  e

dr1
i ( kx −ω t )
= r3ei ( kx −ω t )
 τ zx = µ ( dz − kr2 )e
τ = i (λ + 2µ ) dr2 + k λ r  ei ( kx −ω t ) = ir ei ( kx −ω t )
1
4
dz
 zz

(2.6b)
where r1 and r2 are amplitude exponential decay terms in both equations.
After applying BC1-BC4, the motion-stress vector for Rayleigh waves (r1, r2, r3,
r4)T are obtained as
33
1
k
0
 r1  
µ (z)

 
d  r2   λ ( −z )k+λ2(µz )( z )
0
0
=
dz  r3 
k 2ξ ( z )-ω 2 ρ ( z )
0
0
  

r
 4 
−ω 2 ρ ( z )
0
-k
4 µ ( z )[λ ( z ) + µ ( z )]
where ξ ( z ) =
λ ( z ) + 2µ ( z )


1

λ ( z )+ 2 µ ( z )

kλ ( z )
λ ( z )+2 µ ( z ) 

0

0
 r1 
 
 r2 
 r3 
 
 r4 
(2.7)
Equation (2.7) is a linear differential eigenvalue problem with displacement
eigenfunction r1 and r2 and stress eigenfunction r3 and r4. For a given frequency, ω, nontrivial solutions exist only for special values of the wavenumber, k. These possible values
k1(ω), k2(ω), … kn(ω) are called eigenvalues. The corresponding functions (r1, r2, r3, r4)
are the eigenfunctions.
A general form of the motion-stress vector for both SH and P-SV waves is
df ( z )
= G ( z )f ( z )
dz
(2.8)
2.2 Reflection and transmission coefficients
Waves are scattered between two solid half-space. For SH waves, four possible
scatters occur (Fig. 2.1). The scatter matrix for a SH wave would be
R
 du
 Td
Tu 

Rud 
where R represents reflection coefficient and T represents transmission coefficient. Subindex ‘d’ means down-going waves; ‘u’ up-going waves. For example, Rdu
is the
reflection coefficient of incident down-going SH-wave to reflected up-going SH-wave at
interfaces. Td is the transmission coefficient of incident down-going SH-wave to
34
transmitted down-going SH-wave at interfaces. Other terms have similar physical
meaning.
For P-SV waves, sixteen possible scatters occur (Fig. 2.2). The scatter matrix
would be
 Rdpp
Tu   Rdps
= T

R ud   dpp
 Tdps
 R du
T
 d
Rdsp
Rdss
Tupp
Tups
Tdsp
Tdss
Rupp
Rups


Rusp 

Russ

Tusp
Tuss
Sub-index ‘d’ means down-going waves; ‘u’ up-going waves; ‘p’ P-waves; and ‘s’ SVwaves. Rdps is the reflection coefficient of incident down-going P-wave to reflected SVwave at interfaces. Tdps is the transmission coefficient of incident down-going P-wave to
transmitted down-going SV-wave at interfaces. Other terms have similar physical
meaning.
2.3 Plane waves in a layered model
Let us consider a plane surface wave in a horizontally layered, vertically
heterogeneous, isotropic, elastic medium occupying a half-space z > 0 in which elastic
moduli
µ j λj ρ j
are dependent on depth and are constant within layers (Fig. 2.3). From
equation (2.5) and (2.7), the differential equations for the motion-stress vector are

d  l1j   0
=
  
dz  l2j   k 2 µ j -ω 2 ρ j
1 

 l1j 
 j 
0   l2 
µj
(2.9)
for SH waves and
35
 r1 j   0
 j
j
d  r2   −j k λ j
2
λ
µ
+
=
dz  r3j   k 2ξ j -ω 2 ρ j
 j 

 r4   0
k
1
µj
0
0
0
0
−ω 2 ρ j
-k
 j 
  r1 
1
 rj
λ j +2µ j   2 
j
k λ j   r3 
j
j


λ +2µ
 r j 
4 

0

0
(2.10)
for P-SV waves.
0
1
j
N
N +1
j −1
j
where z < z < z , j = 1, 2,3,", N , N + 1 and 0 = z < z < " < z < " < z < z = ∞
In matrix format, equation (2.9) and (2.10) could be summarized as
df j ( z )
= G j ( z )f j ( z )
dz
(2.11)
j
where f ( z ) is the motion-stress vector for the jth layer and has dimension of 4x1 for P-
SV waves or 2x1 for SH waves. Accordingly, the G matrix (most right matrix in the
right-hand-side of equation (2.9) and (2.10) ) has dimension of 4x4 and 2x2 for P-SV and
SH waves, respectively.
Inside each layer, the analytic solution of the differential equation system
(equation (2.11) ) has the following format (Aki and Richards, 2002):
f j ( z) = A j B j D j
(2.12)
j
j
j
for j = 1, 2,3, ", N , N + 1 . where A and B are know matrices given below. But D are
unknown vectors to be determined. For SH waves we have

A j B j D j = 
1
j
 −µ ν
j
1 

µ jν j 
 e−ν j z

 0
 S j↓
j  j 
eν z   S ↑ 
0
(2.13)
j = 1, 2,3," , N . where ν = ± k 2 − ω
2
2
β
For P-SV waves we have (Aki and Richards, 2002, p.276 equation (7.55) )
36
 α jk
1  α jγ j
j
j
j
ABD =  j j j
ω  −2α µ kγ
 −α j χ j µ j
β jν j
β jν j
β jk
−α j γ
−β jχ jµ j
2α j µ j kγ
−2 β j µ j kν
 −γ j z
e
 0
−β jk
β jχ jµ j 
 0
−2 β j µ j kν j  
 0
α jk
j
j
j
−α j χ j µ j
0
0
j
e−ν z
0
0
j
eγ z
0
0
  P j ↓
  j 
0
  S j ↓ 
0   P ↑
 j 
j   S ↑
eν z 
0
(2.14)
j = 1, 2,3," , N . where γ = ± k 2 − ω
2
2
α
j
BC1-BC4 is to determine the unknown matrix D for each layer. The continuity
condition implies
f j ( z j ) = f j +1 ( z j +1 )
(2.15a)
for j = 1, 2,3," , N .
The radiation condition requires
f N ( z) → 0
(2.15b)
as z → ∞
j
Careful observation reveals that matrix A contains some layer-specified
j
j
constants, like α , β , which are constant within a layer and varies for different layers.
These layer-specified constants could be extremely large to cause over-flow errors during
j
matrix multiplications. The matrix B contains some depth-growth terms, like e
−γ z
, eγ z ,
which are extremely large or small for a deep layer. Multiplication on the matrix causes
the instability of the analytical calculation (Chen, 1993). Excluding of these layerspecified constants and depth-growth terms will significantly increase the stability of the
algorithm. So we can rewrite the equations (2.13) and (2.14) as (Luco and Apsel, 1983;
Chen, 1993; Hisada, 1994)
37
 1

 − µ jν

j
1 

µ jν j 
 e−ν j ( z − z j −1 )

0

  e−ν j z j−1 S j ↓  j j  Csdj  j j j
  ν j z j j  =E Λ  j  =E Λ C
j
j
S ↑ 
e−ν ( z − z )   e
 Csu 
0
(2.16)
j = 1, 2,3, " , N for SH waves .
 k
 γj
 −2 µ j kγ j
 −χ jµ j

νj
k
k
−γ
−χ jµ j
2 µ j kγ
−2 µ j kν
j
 −γ
e

−k
χ jµ j  

−2 µ j kν j  

j
νj
j
j
−χ jµ j
( z−z
j −1
)
0
0
j
j −1
e−ν ( z − z )
0
0
0
j
j
e− γ ( z − z )
0
0
0
0


0

0

j
j

ν
z
z
(
)
−
−
e

0
 j

j j −1
 α e− γ z P j ↓ 
ω


 j
j j −1
β

e−ν z S j ↓ 

 ω


j
j j
α

γ z Pj↑ 

 ω e



 β j ν jz j j
e
S ↑ 

 ω

=E j Λ
 j 
 C pd 


 j 
j  Csd 


C j 
 pu 
 j 
C 
 su 
=E j Λ j C j
(2.17)
j = 1, 2,3, " , N for P-SV waves.
where E is layer matrix, Λ is phase delay matrix, and C is amplitude vector matrix.
j
It is noted that exponential terms in the matrix B in equations (2.13) and (2.14)
γ
has been modified by multiplying the layer-specified constants e
j
z j −1
, eν
j
z j −1
, e −γ
j
zj
, e −ν
j
zj
.
j
Accordingly, the unknown matrix D in equations (2.13) and (2.14) has absorbed these
factors. Absorbing of these constants greatly increase the stability of the algorithm. It
j
should be pointed out that the elements of the matrix D in equations (2.13) and (2.14)
j
represent amplitudes of downgoing and upgoing waves. The matrix C in equations
j
(2.16) and (2.17) is the matrix D that has absorbed the layer-specified constants.
j
Therefore, the elements of the matrix C have lost their original physical meanings. They
are just other layer-specified constants.
38
2.4 Phase velocity of Love waves
Love waves contain only SH waves. The analytic solution to equation (2.9) can be
rewritten as
 l1j   E11j
 j  = Ej
 l2   21
( )

E12j  Λ dj
j 
E22
 0
0
Λuj
 Cdj 
 j
 Cu 
(2.18a )
j = 1, 2,3," , N
 l1N +1   E11N +1Λ dN +1CdN +1 
 N +1  =  N +1 N +1 N +1 
 l2   E21 Λ d Cd 
(2.18b)
j = N + 1.
The equation (2.18b) is obtained by applying the radiation condition that only allows the
decaying solution to exist in the (N+1)th layer (the bottom half-space) so that
N +1
Cu
= 0.
For an arbitrary jth interface, the modified reflection and transmission
coefficients for SH waves are denoted as Rduj , Tdj . They are defined as
Cdj +1 = Tdj Cdj + Rudj Cuj +1
 j
j
j
j
j +1
 Cu = Rdu Cd + Tu Cu
(2.19a)
j = 1, 2,3,", N − 1
CdN +1 = TdN CdN
 N
N
N
 Cu = Rdu Cd
(2.19b)
j=N
Continuity condition requires that
 l1j   l1j +1 
 j  =  j +1 
 l2   l2 
⇒
(
E11j
E12j
E21j
E22j
)(
Λ dj ( z j ) 0
0
1
)
 Cdj 
 j =
 Cu 
(
E11j +1
E12j +1
j +1
E21
E22j +1
)(
1
0
0 Λ uj +1 ( z j )
)
 Cdj+1 
 j+1 
 Cu 
(2.20)
After rearranging equation (2.20) and comparing with equation (2.19), we find
39
(
Tdj
Rudj
Rduj
j
=
Tu
)=(
E11j +1
− E12j
j +1
E21
− E22j
)(
−1
E11j
− E12j +1
E21j
− E22j +1
)(
Λ dj ( z j )
0
0
Λ uj +1 ( z j )
(
)

1
0
2 µ jν j
µ j +1ν j +1 − µ jν j  Λ dj ( z j )

µ jν j + µ j +1ν j +1  µ jν j − µ j +1ν j +1
0
Λ uj +1 ( z j )
2 µ j +1ν j +1 

)
(2.21a )
j = 1, 2,3,", N − 1
 N
 Td 
 RN 
 du 
=
(
N +1
E11
N
− E12
N +1
E21
N
− E22
)
−1
 N N N 
 E11 Λ d ( z ) 
 E N ΛN ( z N )
 21 d

(2.21b)
j=N
ˆj ˆj
The generalized reflection and transmission coefficients ( Rdu , Td ) for SH waves
are defined as following:
Cdj +1 = Tˆdj Cdj
 j
j
j
 Cu = Rˆ du Cd
(2.22a)
j = 1, 2,3,", N − 1
CdN +1 = TˆdN CdN
 N
 Rˆdu = 0
(2.22b)
j=N
Cd1 = Rˆud0 Cu1
(2.22c)
j = 0 for the free surface.
It is noted that TdN = TˆdN
and
RduN = Rˆ duN
(2.23) .
At the free surface the traction-free condition yields
E1 Λ1 (0) µ 1ν 1 −ν 1 ( z1 − z0 ) −ν 1z1
1
1
0 = T 1 (0) = E21
Cd1 + E22
Λ1u (0)Cu1 ⇒ Rˆud0 = 22 u1
= 1 1e
=e
µν
− E21
(2.24)
40
Substituting equation (2.22b) in equation (2.22a), we can obtain the following
recursive formula for computing other generalized R/T coefficients
Tˆdj = (1 − Rudj Rˆ duj +1 ) −1Tdj
 j
j
j
j +1 j
 Rˆ du = Rud + Tu Rˆ du Tˆd
(2.25)
j = 1, 2,3,", N − 1
Inside the top layer the definition of the generalized reflection and transmission
coefficients gives us (equations (2.22))
Cd1 = Rˆud0 Cu1
 1
1
1
Cu = Rˆ du Cd
(2.26)
1
Combining the above two equations leads to (1 − Rˆud0 Rˆ du
)Cd1 = 0
(2.27)
The existence of the non-trivial solution leads to the following implicit dispersion
1
equation for SH waves 1 − Rˆud0 Rˆdu
=0
(2.28)
The phase velocities of SH waves are the non-trivial solution of the implicit
equation (2.28). For a given frequency, only a finite number of roots exist, corresponding
to the phase velocities from fundamental to possible higher modes.
1
According to equation (2.27), Cd (vn ) can take any non-zero value, where vn is
the phase velocity of the nth higher mode. We take unity. Then we have
 Cd1 (vn ) = 1
 1
1
1
Cu (vn ) = Rˆ du Cd (vn )
(2.29)
Applying equation (2.26) into (2.22), we find
j +1
j
j −1
1
 Cd (vn ) = Tˆd (vn )Tˆd (vn )"Tˆd (vn )
 j +1
j +1 j +1
Cu (vn ) = Rˆ du Cd
(2.30a)
41
j = 1, 2,3,", N − 1
CdN +1 (vn ) = TˆdN (vn )TˆdN −1 (vn )"Tˆd1 (vn )
(2.30b)
j=N
Finally, substituting these solved
(l
we obtain the non-trivial solutions
j
1
l2j
(C
j
d
Cuj
) for each mode into equation (2.18),
) which are eigenfunctions of displacement and
traction of SH waves. These eigenfunctions are normalized by the corresponding
maximum value.
2.5 Phase velocity of Rayleigh waves
Rayleigh waves are formed by interference between P and SV waves. The
analytic solution to equation (2.10) can be rewritten as
0  Cdj 
 D j ( z )   E11j E12j   Λ dj ( z )
j
j
j
=

 j =E Λ C


Sj z 
j
j
j
Λ u ( z )   Cu 
 ( )   E21 E22   0
(2.31)
See appendix A for each term.
For an arbitrary jth interface, the modified reflection and transmission
coefficients for Rayleigh waves are denoted as (R duj , R udj , Tdj , Tuj ) and defined by the
following equations:
Cdj +1 = Tdj Cdj + R udj Cuj +1
 j
j
j
j
j +1
 Cu = R du Cd + Tu Cu
(2.32a )
for j = 1, 2,3,", N − 1 and
CdN +1 = TdN CdN
 N
N
N
 Cu = R du Cd
(2.32b)
42
for j = N
 j
 Rdpp
where R duj = 
j
 Rdps
j 
R
dsp 

j 
R 
dss 
 j
 Rupp
, R udj = 
j
 Rups
j 
Rusp


j 
Russ

 j
T
, Tdj =  dpp
j
 Tdps
j 
T
dsp 

j 
T 
dss 
 j
T
, and Tuj =  upp
j
 Tups
j 
Tusp


j 
Tuss

. Sub-index ‘d’
j
means down-going waves; ‘u’ up-going waves; ‘p’ P-waves; and ‘s’ SV-waves. Rdps
is
the reflection coefficient of incident down-going P-wave to reflected SV-wave at
interface j. Tdpsj is the transmission coefficient of incident down-going P-wave to
transmitted down-going SV-wave at interface j. Other terms have similar physical
meaning.
Like SH waves, the modified reflection and transmission coefficients ( R duj , Tdj )
for P-SV waves can be found by applying the continuity condition at an arbitrary jth
interface. Then we have
 D j ( z )   D j +1( z ) 
 j  =  j +1 
 S (z)   S ( z) 
j
j
j +1
j +1 
  C j +1 
 j j   C j 
⇒  E11j E12j   Λd ( z ) 0   dj  =  E11j+1 E12j+1   I j+01 j   dj +1 
 E21 E22   0 I   Cu   E21 E22   0 Λu ( z )   Cu 
After rearranging, we obtain the explicit expressions of the modified R/T matrices as
follows:
 Tdj
 j
 R du
R udj   E11j +1 −E12j 
=

Tuj   E21j +1 −E22j 
−1
 E11j
 j
 E21
−E12j +1   Λ dj ( z j )
0 

j +1  
j +1
−E22   0
Λu ( z j )
(2.33a )
for j = 1, 2,3,", N − 1
 TdN
 N
 R du
N
  E11N +1
R ud
 =  N +1
TuN   E21
−E12N 
N 
−E22

−1
 E11j Λ dj ( z j ) 
 j j j 
 E21Λ d ( z ) 
(2.33b)
for j = N
43
Note that the layer matrix E is composed of elements that are determined by the elastic
parameters of both jth and (j+1)th layers.
For an arbitrary jth interface, the generalized reflection and transmission
ˆ j ,T
ˆ j ) and defined by the following
coefficients for Rayleigh waves are denoted as (R
du
d
equations:
ˆ jC j
Cdj +1 = T
d
d
 j
j
j
ˆ
 Cu = R du Cd
(2.34a)
for j = 1, 2,3,", N − 1
ˆ N CN
 CdN +1 = T
d
d
 N
ˆ
R du = 0
(2.34b)
for j = N
Comparing equations (2.32) and (2.34) we find
ˆ N = TN
T
d
d
and
ˆ N = RN
R
du
du
(2.35) .
Substituting equation (2.34) in equation (2.33), we obtain the recursive formula
for computing other generalized R/T coefficients as
ˆ j = (I − R j R
ˆ j +1 ) −1 T j
T
d
ud
du
d
 j
1
+
j
j
j
ˆ = R +T R
ˆ T
ˆj
 R
du
ud
u
du
d
(2.36)
for j = 1, 2,3,", N − 1
ˆN
Starting from the last interface where R du = 0 , we can use equation (2.36) to find the
ˆj ˆj
generalized reflection and transmission coefficients ( R du , Td ) for Rayleigh waves for all
interfaces above.
44
The Rayleigh dispersion curves can be determined by imposing the traction-free
condition at the free surface (z=0). From equation (2.31) we calculate the traction at the
free surface as
ˆ 1 )C1
S1 ( 0 ) = (E121 + E122 Λ u0 (0)R
du
d
(2.37)
Equation ( 2.3 7) has non-trivial solutions only for some particular phase velocities that
satisfy the following secular equation:
ˆ1 )=0
det(E121 + E122 Λ u0 (0)R
du
(2.38)
Equation ( 2.3 8) is called the secular function for Rayleigh waves. Therefore, the roots of
this equation are the phase velocities for modes that potentially exist.
C1d (vn ) has infinite solutions to satisfy equation (2.37). Let’s
ˆ 0 (v ) R
ˆ 1 (v )
G (vn ) = I − R
ud
n
du
n
(2.39)
We take one of them as our starting point. Thus we find that
−G12
 1
C pd (vn ) =
G112 + G122


G11
C1 (v ) =
pd
n
2

G11 + G122

(2.40)
and
ˆ 1 C1 (v )
C1u (vn ) = R
du d
n
(2.41a)
Correspondingly, we have
ˆ j (v )T
ˆ j −1 (v )" T
ˆ 1 (v )
 Cdj +1 (vn ) = T
d
n
d
n
d
n
 j +1
j +1 j +1
ˆ
Cu (vn ) = R du Cd
(2.41b)
j = 1, 2,3,", N − 1
45
ˆ N (v )T
ˆ N −1 (v )" T
ˆ 1 (v )
CdN +1 (vn ) = T
d
n
d
n
d
n
(2.41c)
j=N
(C
Finally, substituting these solved
yields the non-trivial solutions
(r
j
1
r2j
j
d
Cuj
r3j
) for each mode into equation (2.31)
r4j
)
, which are eigenfunctions of
displacement and traction of P-SV waves.
2.6 Improvements on calculation of phase velocity of Rayleigh waves
One of the most CPU time-consuming parts in the R/T method is to compute the
inverse of the 4x4 layer matrix E in equation (2.33) to obtain the modified R/T
coefficients for P-SV waves. Hisada (1995) presents an analytical solution for the inverse
matrix. However, this analytical solution requires recursive substitution. The layer matrix
E in the equation (2.33) is composed of elements that is determined by the elastic
parameters of both jth and (j+1)th layers. This property prohibits the existence of a
simple analytic solution for the inverse matrix E-1.
Careful observation from equation (2.38) reveals that only the generalized R/T
coefficients are needed to calculate the secular function of Rayleigh waves. In fact, the
generalized R/T coefficients could be directly calculated without knowing the modified
R/T coefficients. Therefore, the modified R/T coefficients are not necessary for the
calculation of the secular function of Rayleigh waves (equation (2.38) ).
The continuity condition at any arbitrary interface j states that
0  Cdj +1 
 E11j E12j   Λ dj ( z j ) 0  Cdj   E11j +1 E12j +1  1
=
 j

  

j +1
j 
j 
1   Cuj   E21j +1 E22j +1  0 Λ u ( z )  Cuj +1 
E21 E22   0
(2.42)
46
The definition of the generalized R/T coefficients implies that
ˆ j Cj
 Cuj = R
du d
 j +1
j
j
ˆ
Cd = Td Cd
 j +1 ˆ j +1 ˆ j j
Cu = R du Td Cd
(2.43)
Directly substituting equation (2.43) into equation (2.42) yields
−1
ˆj

 I   E11j E12j  E11j +1 E12j +1  
T
d
=


ˆ j   j



j
j +1
j +1
ˆ j +1T
ˆ j
 R du   E21 E22  E21 E22   Λ uj +1 ( z j )R
du
d 
(2.44)
ˆN
Starting from the last interface where R du = 0 , equation (2.44) yields the
ˆj ˆj
generalized reflection and transmission coefficients ( R du , Td ) of Rayleigh waves for all
interfaces above.
Thus, we can directly calculate the generalized R/T coefficients without knowing
the modified R/T coefficients. The next step is to derive the inverse matrix of 4X4 E
matrix in equation (2.44). Unlike these of the layer matrix E in equation (2.33), the
elements of the layer matrix E in equation (2.44) are determined by the elastic parameters
of the jth layers only. This characteristic allows many terms to be crossed out during the
derivation of the inverse matrix E-1 and results in a simple analytic solution for the
inverse matrix E-1 (Appendix A). For equation (2.33) it is impossible to derive the similar
analytic form of E-1 as the elements of matrix E are related to the elastic parameters of
both jth and (j+1)th layers. The simplicity of our solution significantly reduces the
computational time. The following test section shows that these improvements cut the
computational time for dispersion curves of Rayleigh waves at least by half.
47
2.7 Numerical examples on dispersion calculation of Rayleigh waves
Three cases are designed for the crustal scale. Model 1 is Gutenberg’s classic
crust and upper mantle model for a continent (Table 2.5), which resembles the velocity
structure of the Earth to the depth of 1000 km. The model consists of a stack of 24
homogeneous and isotropic layers and has been used for many geophysical studies and
provides an excellent reference. Model 2 is an artificial, inverted profile, which has a low
velocity zone for the fourth layer (Table 2.6). Model 3 is another artificial four-layer
profile, which has a high velocity zone for the second layer (Table 2.6). Three other cases
are designed for the local site scale. Model 4 is a regular stack of 4 homogeneous and
isotropic layers (Table 2.7). Model 5 has a low velocity zone for the second layer and
model 6 has a high velocity layer for the second layer (Table 2.7).
In traditional forward calculation (Chen, 1993; Hisada, 1994, 1995), the modified
R/T coefficients (equation (2.33) ) are calculated followed by the generalized R/T
coefficients (equation (2.36) ). The inverse matrix E-1 is computed following Hisada’s
procedure (1995). The bisection root-searching method is employed to find the phase
velocities from the secular equation (2.38). I code the above calculation in a program
called RTmod, emphasizing the fact that the generalized R/T coefficients are based on the
modified R/T coefficients. In the fast R/T method, the generalized R/T coefficients are
computed directly from equation (2.44) without calculations of the modified R/T
coefficients. Keeping other parts identical, I code the fast R/T method in a program called
RTgen. I perform stability and efficiency tests for RTgen at both crustal and local site
scales.
48
Table 2.5 Gutenberg’s layered model of continental structure
Depth to bottom (km)
19
38
50
60
70
80
90
100
125
150
175
200
225
250
300
350
400
450
500
600
700
800
900
1000
Density (g/cm3)
2.74
3.00
3.32
3.34
3.35
3.36
3.37
3.38
3.39
3.41
3.43
3.46
3.48
3.50
3.53
3.58
3.62
3.69
3.82
4.01
4.21
4.40
4.56
4.63
Vp (km/s)
6.14
6.58
8.20
8.17
8.14
8.10
8.07
8.02
7.93
7.85
7.89
7.98
8.10
8.21
8.38
8.62
8.87
9.15
9.45
9.88
10.3
10.71
11.10
11.35
Vs (km/s)
3.55
3.80
4.65
4.62
4.57
4.51
4.46
4.41
4.37
4.35
4.36
4.38
4.42
4.46
4.54
4.68
4.85
5.04
5.21
5.45
5.76
6.03
6.23
6.32
Table 2.6 Test models at crustal scale
Depth to bottom (km)
M2*
M3
18
20
24
25
30
40
50
∞
∞
Density (g/cm3)
M2
M3
2.80
2.8
2.90
3.4
3.50
3.2
3.40
3.4
3.30
Vp (km/s)
Vs (km/s)
M2
M3
M2
M3
6.00
5.6
3.50
2.5
6.30
6.3
3.65
3.2
6.70
6.1
3.90
2.9
6.00
6.3
3.70
3.2
8.20
4.70
*M2 means model 2; the same for M3
49
Table 2.7 Test models at local site scale
Depth to bottom
(m)
M4* M5
M6
12
11
11
24
23
23
36
35
35
∞
130 130
∞
∞
Density (g/cm3)
M4
2.0
2.0
2.0
2.0
M5
2.0
2.0
2.0
2.0
2.0
M6
2.0
2.0
2.0
2.0
2.0
Vp (m/s)
Vs (m/s)
M4
M5
M6
M4 M5
M6
311.8 1558.8 866.0 180 900 500
519.6 866.0 1558.8 300 500 900
866.0 1212.4 1212.4 500 700 700
1212.4 1732.0 1732.0 700 1000 1000
2551.6 2551.6
1300 1300
*M4 means model 4; the same for M5 and M6
The stability tests of the fast generalized R/T coefficients method are done by
comparing the calculated phase velocities for a given model, by RTgen and by CPS.
Herrmann and Ammon (2002) developed a FORTRAN code to calculate the phase
velocity dispersion curves of surface waves, which is a part of a package called Computer
Programs in Seismology (CPS) (http://mnw.eas.slu.edu/People/RBHerrmann/). CPS has
been widely used to compute the dispersion curves of Rayleigh waves (e.g., Stephenson
et al., 2005). I use CPS to calculate dispersion curves of Rayleigh waves for six test
models and compare with those calculated by our code RTgen. Figures 2.4 shows the
fundamental-mode dispersion curves of Rayleigh waves calculated by RTgen plotted atop
those by CPS. The dispersion curves cover a broad frequency range from 0.01 Hz to 100
Hz. Figure 2.5 shows the normalized errors between phase velocities calculated by
RTgen and CPS. Both calculations yield the normalized errors of 0.001% - 0.01% for all
tested models, indicating RTgen is correct and stable. Higher-mode (up to mode 14)
dispersion curves of Rayleigh waves for all six test models by both CPS and RTgen are
50
almost the same up to 5% accuracy. Figures 2.6 and 2.7 show the results of model 1 and
model 4.
An efficiency test of RTgen is done through comparison of computational time
taken by RTgen and RTmod for models with differing numbers of layers. We coded both
RTmod and RTgen in an identical way except in how to calculate the generalized R/T
coefficients. The coefficients are calculated from the modified R/T coefficients (equation
(2.36) ) in RTmod, and directly computed from equation (2.44) in RTgen.
Both codes ran on Linux Pentium machines, Sun workstations, personal Windows
PCs, and OS X Macintosh machines. Figure 2.8 shows the computational times in
seconds for 20 runs on a 1.33 GHz PowerPC G4 Mac notebook. The 24-layer-model used
for efficiency test is Gutenberg’s Earth model. We delete the lower 2 layers of
Gutenberg’s Earth model to make the 22-layer-model; 4 to make the 20-layer-model; and
so on. The figure clearly shows that RTgen saves 55% computational time for the 4layer-model; 57% for the 14-layer-model; 60% for the 24-layer-model. Tests on other
computer platforms also show at least 50% saving on computational time.
2.8 Improvements on calculation of phase velocity of Love waves
The same idea can be applied to Love waves, which consist of only SH waves. In
another words, we directly calculate the generalized R/T coefficients of Love waves
without knowing the modified R/T coefficients.
For Love waves, the continuity condition at any arbitrary interface j states that
(see Appendix B for each term)
51
 E11j
 j
 E21
E12j   Λ dj ( z j ) 0  Cdj   E11j +1

  = 
E22j   0
1  Cuj   E21j +1
0  Cdj +1 
E12j +1  1


j +1
j 
E22j +1  0 Λ u ( z )  Cuj +1 
(2.45)
The definition of the generalized R/T coefficients implies that
 Cuj = Rˆ duj Cdj
 j +1
j
j
Cd = Tˆd Cd
 j +1 ˆ j +1 ˆ j j
Cu = Rdu Td Cd
(2.46)
where Rˆduj and Tˆdj are the generalized reflection and transmission coefficients of incident
down-going SH-wave to reflected SH-wave and transmitted SH-waves at interface j,
respectively. Directly substituting equation (2.46) into equation (2.45) yields
 I   E11j
 ˆj  = j
 Rdu   E21
−1
E12j   E11j +1
 
E22j   E21j +1

Tˆdj
E12j +1  



E22j +1   Λ uj +1 ( z j ) Rˆ duj +1Tˆdj 
(2.47)
Starting from the last interface where Rˆ duN = 0 , equation (2.47) yields the generalized
reflection and transmission coefficients ( Rˆ duj , Tˆdj ) of Love waves for all interfaces above.
Appendix B gives explicit solutions for these coefficients.
2.9 Numerical examples on Love waves
I use CPS to calculate higher-mode dispersion curves of Love waves for six test
models and compare with those calculated by the code RTgen. Figures 2.9 and 2.10 show
that both calculations yield identical results to 1%-5% for models 1 and 4, indicating the
new version is correct and stable. However, efficiency tests show only a 1-5% speed-up.
This is not a surprise due to the fact that inversion of the 2X2 E matrix (equation (2.47) )
is not major CPU time consuming part on SH-wave cases. Tests on other models lead to
the same conclusion.
52
2.10 Group velocity calculation of surface waves
The calculation of group velocity of surface waves is adopted from Aki and
Richards (2002, chapter 7). For a linear elastic body, the Lagrangian density is the kinetic
energy minus the elastic strain energy.
L=
1
1
ρ ui ui − ( λ (ekk ) 2 + µ eij eij )
2
2
(2.48)
In the case of plane Love wave, where
ekk = 0 and
eij eij = 2((
1 ∂v 2 1 ∂v 2
) +(
) ) and v = l1 ( k , z ,ω ) ei ( kx −ω t )
2 ∂x
2 ∂z
.
Then we have
L=
2∫
∞
0
∂l
1 2 2 1
ρ l1 ω − µ (k 2l12 + ( 1 ) 2 ) and
∂z
2
2
L =
∂l
1 2 2 1
ρ l1 ω − µ (k 2l12 + ( 1 )2 )
∂z
4
4
∞
∞
1
1
1 ∞ ∂l
L dz = ω 2 ∫ ρ l12 dz − k 2 ∫ µ l12 dz − ∫ µ ( 1 ) 2 dz = ω 2 I1 − k 2 I 2 − I 3
0
0
2
2
2 0
∂z
(2.49)
(2.50)
where < > means an averaging process. The energy integrals are
I1 =
1 ∞ 2
1 ∞ 2
1 ∞ ∂l1 2
l
dz
,
I
l
dz
,
I
ρ
=
µ
=
µ ( ) dz
1
2
1
3
2 ∫0
2 ∫0
2 ∫0
∂z
(2.51)
After a complicated derivation, the group velocity in terms of integrals is given
as
U=
∆ω kI 2
I
=
= 2
∆k ω I1 cI1
(2.52)
The partial derivative of phase velocity with respect to model S-wave velocity
(fixed frequency and density) and with respect to model density (fixed frequency and
model s-velocity) are given as
53
[
∂c
]ω , ρ =
∂β
dl1 2
) )
c(k 2 dI 2 + dI 3 )
dz
=
∞
β k 2 I2
µ k 2 ∫ µ l12 dz
ρ c β (k 2 µ l12 + µ (
(2.53)
0
[
cl12ω 2
c
c dI 2 2
∂c
]ω , β = −
= − 2 l12ω 2 = − 2
ω
∞
2k I 2
2k I 2 µ
∂ρ
2k 2 µ l 2 dz
∫
0
(2.54)
1
For a Rayleigh wave, the energy integrals are defined as
1 ∞
1 ∞
2
2
ρ
+
=
(
)
((λ + 2 µ ) r12 + µ r22 )dz
r
r
dz
I
1
2
2
∫
∫
0
0
2
2
∞
dr
dr
dr
dr
1 ∞
I 3 = ∫ (λ r1 2 − µ r2 1 ) dz
I 4 = ∫ (λ + 2µ )( 2 ) 2 + µ ( 1 ) 2 )dz
0
0
2
dz
dz
dz
dz
The group velocity in terms of integrals is given as
I1 =
U=
∆ω
=
∆k
I2 +
I3
2k
cI1
(2.55)
(2.56)
The partial derivative of phase velocity with respect to model S-wave velocity (fixed
frequency, density, and model p-wave velocity) and with respect to model density (fixed
frequency, model s-wave velocity, and model p-wave velocity) are given as
[
∂c
1 
1
2k 2 4k 1 
]ω , ρ ,α = 2
(kr2 ) 2 + I 42 −
I −
I

∂β
µ
µ 3 λ 3 
4k UI1 
[
∂c
1
ω2
α 2 k 1 1 1 2kα 2 1
]ω , β ,α = 2 [ I1 −
I − I −
I ]
ρ 1 ρ 4
λ 3
∂ρ
4k UI1 ρ
(2.57)
(2.58)
2.11 Published calculation codes
While many codes to calculate dispersion curves have been developed as part of
research, they are generally not published. Based on the transfer matrix method,
Herrmann and Ammon (2002) developed a FORTRAN code to calculate the phase
54
velocity dispersion curves of surface waves. The code is part of package called Computer
Programs in Seismology (CPS). The CPS also provides modal summation synthetic
seismograms. Yuehua Zeng implemented R/T methods using FORTRAN for calculating
the phase velocity dispersion curves of surface waves. Saito’s code (1988), original in
FORTRAN and later adopted in C by Yuehua Zeng, is based on numerical integration.
Matlab code for isotropic dispersion is listed in Chapman (2003).
2.12 RTgen
Our own code, called RTgen, is based on the R/T method. It is written in C and
has been commercialized through SeisOpt® ReMi™ from Optim Inc. For both Rayleigh
and Love waves, RTgen is able to calculate
1). Phase and group velocities from fundamental mode up to any higher mode
that potentially exists;
2). Partial derivative of phase velocity with respect to model S-wave velocity
(fixed frequency, density, and/or model p-wave velocity) and with respect to model
density (fixed frequency, model s-wave velocity, and/or model p-wave velocity);
3). Coefficients of displacement and traction or normalized eigendisplacements
and eigentractions.
ReMi is using Saito’s code (1988) as its forward engine. The code uses a
numerical integration method for computing dispersion curves of surface waves. It is
adapted from Fortran to C by Yuehua Zeng of Nevada Seismological Laboratory in 1992.
Unlike RTgen, the adapted Saito’s code can only produce the phase velocity dispersion of
the fundamental-mode Rayleigh waves.
55
incident SH
transmitted SH
reflected SH
Interface
incident SH
transmitted SH
(a)
Interface
x
(b)
z
x
reflected SH
z
Figure 2.1. Illustration of coefficients of reflection and transmission due to SH incident
down (a) and up (b) to an interface.
incident SV
transmitted SV
reflected SV
transmitted P
reflected P
Interface
Interface
x
transmitted P
reflected P
incident SV
transmitted SV
(a)
(b)
z
incident P
reflected SV
z
transmitted SV
reflected SV
reflected P
Interface
transmitted P
Interface
x
transmitted P
transmitted SV
(c)
z
x
x
reflected P
incident P
(d)
reflected SV
z
Figure 2.2. Illustration of coefficients of reflection and transmission
due to SV incident down (a) and up (b) to an interface and P incident
down (c) and up (d) to an interface.
56
free surface
x
μ1, λ1, ρ1
μ2, λ2, ρ2
μj-1, λj-1, ρj-1
z (1)
:
≈:
μj, λj, ρj
z (2)
z (j-1)
z (j)
μj+1, λj+1, ρj+1
μN-1, λN-1, ρN-1
z (0)
:
≈:
μN, λN, ρN
z (j+1)
z (N-1)
z (N)
μN+1, λN+1, ρN+1
z
Figure 2.3. Configuration and coordinate system of a multiplelayered half-space.
57
4.2
1200
4
1000
model 1____
3.8
800
____model 2
3.4
Vs (m/s)
Vs (km/s)
3.6
3.2
3
____model 5
600
____model 6
400
____model 4
____model 3
2.8
200
2.6
2.4
0
20
40
60
Period (s)
80
0
100
0
0.2
(a)
0.4
0.6
Period (s)
0.8
1
(b)
Figure 2.4. Phase velocity dispersion curves of the fundamental-mode Rayleigh waves for large scale
models 1, 2, and 3 (a) and small scale models 4, 5, 6 (b). In (a) the crosses are phase velocities calculated by RTgen and the circles by CPS. In (b) the crosses are phase velocities calculated by RTgen and
the circles and diamonds by CPS. Note that models 5 and 6 are closely overlapped.
x 10
−5
12
0.8
10
0.6
8
0.4
6
0.2
4
(Vs1−Vs2)/Vs2
(Vs1−Vs2)/Vs2
1
0
−0.2
0
−2
−0.6
−4
−0.8
−6
0
20
40
60
Period (s)
(a)
80
100
−5
2
−0.4
−1
x 10
−8
0
0.2
0.4
0.6
Period (s)
0.8
(b)
Figure 2.5. The normalized errors between phase velocities calculated by RTgen and CPS of
large scale models 1, 2, and 3 (a) and small scale models 4, 5, 6 (b). Vs1 of Y-axis in both
graphs represents the phase velocities calculated by CPS and Vs2 by RTgen. The normalized
errors fall into 0.001% in (a) and 0.01% in (b).
58
1
6.5
700
6
600
5.5
5
Vs (m/s)
Vs (km/s)
500
4.5
300
4
200
3.5
3
400
0
20
40
60
Period (s)
80
100
Figure 2.6. Phase velocity dispersion curves
of Rayleigh waves for the Gutenberg model.
The crosses are phase velocities from fundamental up to the 14th higher mode calculated by RTgen. The circles are the corresponding mode phase velocities by CPS.
100
0
0.2
0.4
0.6
Period (s)
0.8
1
Figure 2.7. Phase velocity dispersion curves of
Rayleigh waves for model 4. The crosses are
phase velocities from fundamental up to the
6th higher mode calculated by RTgen. The
circles are the corresponding mode phase
velocities by CPS.
59
160
RTgen
RTmod
140
Figure 2.8. Computational time against number of
layers in models. The solid line represents computational time of 20 iterations of phase velocity calculation of fundamental-mode Rayleigh waves taken by
RTgen, and the dash line by RTmod. Clearly, the fast
generalized R/T method cuts the computational time
at least by half.
Computational Time (s)
120
100
80
60
40
20
5
10
15
Number of Layer
20
6.5
700
6
600
5.5
500
Vs (m/s)
Vs (km/s)
0
5
400
4.5
300
4
200
3.5
0
20
40
60
Period (s)
80
100
Figure 2.9. Phase velocity dispersion curves
of Love waves for the Gutenberg model.
The crosses are phase velocities from fundamental up to the 14th higher mode calculated by RTgen. The circles are the corresponding mode phase velocities by CPS.
100
0
0.2
0.4
0.6
Period (s)
0.8
Figure 2.10. Phase velocity dispersion
curves of Love waves for model 4. The
crosses are phase velocities from fundamental up to the 6th higher mode calculated by RTgen. The circles are the corresponding mode phase velocities by CPS.
60
1
Chapter 3 Linearized inversion of surface-wave dispersion
Solutions of geophysical inversion problems are not unique in the sense that there
are many models that explain the data equally well (Tarantola, 1987). There are a number
of reasons for this. Firstly, the theoretical error occurs due to use of an inexact theory in
the prediction of theoretical data. For example, the use of 1D model could cause
theoretical errors since in most cases the earth is truly 3D. Secondly, the observational
errors are ubiquitous. They could be measurement errors, instrumental errors, numerical
truncations. The third reason is the most fundamental. In many inverse problems the
model that one aims to determine is a continuous function of the space variables. This
means that the model has infinitely many degrees of freedom. However, in a realistic
experiment the amount of data that can be used for the determination of the model is
usually finite. A simple count of variables shows that the data cannot carry sufficient
information to determine the model uniquely. In the context of linear inverse problems
this point has been raised by Backus and Gilbert (1967, 1968) and more recently by
Parker (1994). This issue is equally relevant for nonlinear inverse problems.
The model obtained from the inversion of the data is therefore not necessarily
equal to the true model that one seeks (Snieder and Trampert, 1999). This implies that for
realistic problems, inversion really consists of two steps. Let the true model be denoted
by m and the data by d. From the data d one reconstructs an estimated model m̂ , this is
called the estimation problem (Fig. 3.1). Apart from estimating a model m̂ that is
consistent with the data, one also needs to investigate what relation the estimated model
61
m̂ bears to the true model m. In the appraisal problem one determines what properties of
the true model are recovered by the estimated model and what errors are attached to it.
Therefore, inversion = estimation + appraisal. It does not make much sense to make a
physical interpretation of a model without acknowledging the presence of errors and
limited resolution in the model (Trampert, 1998).
There are many excellent textbooks on inversion problems (e.g. Aster et al., 2005;
Tarantola, 2005; Snieder and Trampert, 1999; Park, 1994; Menke, 1989). Before
discussing the case of the surface wave data inversion, some of the basic concepts of the
inversion will be briefly presented to introduce the tools that will be used later.
3.1 Linear model estimation
Suppose that we have a discrete n-point model m and discrete m-point data
vector d that in practice are contaminated with errors e. The recorded data and the model
are related through some fundamental physics by a linear system of equation
d = Gm + e
(3.1)
where G is the data kernel matrix.
From the recorded data one makes an estimate of the model m̂ . There are many
ways for designing an inverse operator that maps the data on the estimated model (e.g.
Menke, 1989; Tarantola, 1987; Parker, 1994). Whatever estimator one may choose, the
most general linear mapping from data to the estimated model can be written as:
ˆ = G−gd
m
(3.2)
The operator G-g is called the generalized inverse of the matrix G. In general, the
number of data is different from the number of model parameters. For this reason G is
62
usually a non-square matrix, and hence its formal inverse does not exist. Later we will
show how the generalized inverse G-g may be chosen, but for the moment G-g does not
need to be specified. The relation between the estimated model m̂ and the true model m
follows by inserting (3.1) in expression (3.2):
ˆ = G − g Gm + G − g e
m
(3.3)
The matrix operator G-gG, called the resolution kernel R, gives
R ≡ G−gG
(3.4)
Equation (3.3) can be interpreted by rewriting it in the following form:
−g
ˆ = m + (G − g G − I )m + G
m
e
ErrorN
Pr opagation
(3.5)
lim ited Re solution
ˆ =m ,
In the ideal case, the estimated model equals the true model vector: m
meaning that our chosen parameters, specified in vector m, may be estimated
independently from each other. The last two terms in equation (3.5) account for
“blurring” and artifacts in the estimated model. The term (G − g G − I )m describes the fact
that components of the estimated model vector are linear combinations of different
components of the true model vector. We only retrieve averages of our parameters and
“blurring” occurs in the model estimation as we are not able to map out the finest details.
In the ideal case this term vanishes; this happens when G-gG is equal to the identity
matrix (the perfect resolution: R ≡ I ). The trace of R ( Tr(R) )is often used as a simple
quantitative measure of the resolution. If Tr(R) is close to the number of layer of models,
then R is relatively close to the identity matrix.
The last term in equation (3.5) describes how the errors e are mapped onto the
estimated model. These errors are not known deterministically (otherwise they could be
63
subtracted from the data). A statistical analysis is needed to describe the errors in the
estimated model due to the errors in the data. When the data dj are uncorrelated and have
standard deviation σdj , the standard deviation σmi in the model estimate m̂ , resulting
from the propagation of data errors only, is given by
σ mi2 = ∑ (Gij− gσ dj2 )2
(3.6)
j
where i = 1, 2, …, n and j = 1, 2, …, m.
Ideally, one would like to obtain both: a perfect resolution and no errors in the
estimated model. Unfortunately this cannot be achieved in practice. The error
propagation is, for instance, completely suppressed by using the generalized inverse G-g =
ˆ = 0 which is indeed not affected by errors.
0. This leads to the (absurd) estimated model m
However, this particular generalized inverse has a resolution matrix given by R=0, which
is far from the ideal resolution matrix. Hence in practice, one has to find an acceptable
trade-off between error-propagation and limitations in the resolution.
3.2 Solving a linear system
Considering a discrete linear problem we begin with a data vector d of m
observations and a model vector m of n model parameters that we wish to determine. The
inversion problem can be written as a linear system of equations
Gm = d
(3.7)
For this simple case, the solutions to equation (3.7) can be obtained with different
approaches from different viewpoints: the length method, the generalized inverse, the
64
maximum likelihood method (Menke, 1989). The follow are the most commonly used
methods.
3.2.1 Least-squares estimation
In some problems the number of unknowns is more than the number of
parameters (over-determined problems). A common way to estimate a model is to seek
the model m̂ that gives the best fit to the data in the sense that the difference, measured
ˆ is made as
by the L2-norm, between the data vector d and the recalculated data Gm
small as possible. This means that the least-squares solution is given by the model that
minimizes the following cost function
S =|| d − Gm ||2
(3.8)
This quantity is minimized by the following model estimate (Aster et al., 2005)
ˆ = (G T G ) −1 G T d
m
(3.9)
3.2.2 Minimum norm estimation
In some problems the number of unknowns is less than the number of parameters
(under-determined problems). The minimum norm solution is defined as the model that
fits the data exactly, Gm = d, and that minimizes || m ||2 . As shown in detail by Menke
(1989) the minimum-norm solution is given by:
ˆ = G T (GG T ) −1 d
m
(3.10)
3.2.3 Mixed determined problems
In the least-squares estimation, we assumed that we had enough information to
evaluate all model parameters, even though contradictions occurred due to measurement
errors. The problem is then purely over-determined and as a consequence GTG is regular,
65
which means its inverse matrix always exists. In the minimum norm solution, we
assumed no contradictions in the available information, but we don't have enough
equations to evaluate all model parameters. This is the case of a purely under-determined
problem and here GGT is regular. The most common case, however, is that we have
contradictory information on some model parameters, while others cannot be assessed
due to a lack of information. Then neither GTG nor GGT can be inverted and the problem
is ill-posed. Even if the inverse matrices formally exist, they are often ill-conditioned
meaning that small changes in the data vector lead to large changes in the model
estimation. This means that errors in the data will be magnified in the model estimation.
Clearly a trick is needed to find a model that is not too sensitive on small changes in the
data. To this effect, Levenberg (1944) introduced a damped least-squares solution
ˆ = (G T G + α I) −1 G T d
m
(3.11)
2
Equation (3.11) minimizes the cost function G (m) − d + α m
2
(3.12)
3.3 Regularization
The initial system of equation that we are solving is equation (3.7) is not quite
correct. We ignore the error e which will always be present. To compensate errors, we
transform the model parameters through matrix L
and the data vector through Q
G
d = Qd
G
m = Lm
(3.13a )
(3.13b)
Assume that L has an inverse. The transformed system of equations can be
derived by substituting equation (3.13) into equation (3.7)
G G
QGL−1m = d
(3.14)
66
Damped least-squares solution for equation (3.14) is then given as
ˆ = (G T QT QG + α LT L) −1 G T QT Qd
m
(3.15)
Let QT Q = Wd and LT L = Wm , equation (3.15) is then given by
ˆ = (G T Wd G + α Wm ) −1 G T Wd d
m
(3.16)
This solution minimizes the following cost function:
S = (d − Gm)T Wd (d − Gm) + α mT Wmm
(3.17)
This expression shows that in general the weight matrices Wd and Wm can be
anything. Written in this way, α may be seen as a trade-off parameter which compromises
between two characteristics of the model: its size and its disagreement with the data. Both
independent properties of the model cannot be arbitrary small simultaneously. Hence
there is a need for a balance. The choice of an optimum, however, is not an easy job.
There is no reason to favor the damped least-squares solution (3.11) over the more
general least-squares solution (3.16) (Snieder and Trampert, 1999). In fact, most inverse
problems are ill-posed (partly underdetermined and partly over-determined) and illconditioned (small errors in the data causes large variations in the model) which goes
hand in hand with large null-spaces and hence non-unique solutions. Regularization is
thus needed, but there is a large ambiguity in its choice (Scales and Snieder, 1997). This
reflects the fundamental difficulty that one faces in solving inverse problems: solving the
system of equations is a minor problem, compared to choosing the regularization.
One approach is to define the misfit function in such a way that it favors models
with given properties (for example the smoothest model) (Parker, 1994). As an example
of the choice of the weighting matrices Wm, the use of Occam's inversion is quite
67
common (Constable et al., 1987) where one seeks the smoothest model that is consistent
with the data. Instead of putting a constraint on the model length, one seeks the square of
its gradient to be as small as possible.
Smoothing model is implemented through the Tikhonov regularization. For
simplification we assume Q = I, the equations (3.16) and (3.17) could be written as
ˆ = (G T G + α LT L) −1 G T d
m
2
S = d − Gm +α 2 Lm
(3.18) and
2
(3.19)
where L is called the roughening (smoothening) matrix and α is the damping factor.
L could take several forms.
L=I
(3.20a ) called the zero-order Tikhonov regulization
0 0

 −1 1




L=
−1 1


"



−1 1 n×n
0 0 0
0 0 0

L = 1 -2 1

"


(3.20b) the first-order Tikhonov regulization






1 −2 1 n×n
(3.20c) the second-order Tikhonov regulization
3.4 Singular value decomposition (SVD)
Conveniently, every matrix has a singular value decomposition (SVD). An m by
n matrix G could be factored into
G m×n = U m×mS m×n VnT×n
(3.21)
68
where U is an m by m orthogonal matrix with columns that are unit basis vectors
spanning the data space Rm. V is an n by n orthogonal matrix with columns that are unit
basis vectors spanning the model space Rn. S is an m by n diagonal matrix with
nonnegative diagonal elements called singular values and S1 ≥ S2 ≥ " ≥ S min ≥ 0 . If only
S
the first p singular values are nonzero, we can partition S as S =  p
0
and G as G =  U p
S
U 0   p
0
0
 Vp
0  
V0 
T
0
0 
(3.22b)
We can simplify the SVD of G into its compact form G = U p S p VpT
The
generalized
G † = Vp S −p1UTp
inverse
of
G,
(3.22a)
called
Moore-Penrose
(3.23)
pseudoinverse,
is
(3.24)
And the pseudoinverse (generalized inverse) solution is
m † = Vp S −p1UTp d
(3.25)
The pseudoinverse solution is a least squares solution. But, unless p=n, the solution is
biased. The covariance of the solution is cov(m ) = σ
†
The model resolution matrix is R m = G †G = Vp VpT
The data resolution matrix is R d = GG † = U p UTp
2
p
V.,iV.,Ti
i =1
si2
∑
(3.26)
(3.27 a )
(3.27b)
3.5 Proposed linear inversion algorithm
This dissertation uses Occam’s inversion technique (Constable et al., 1987) with
a higher-order Tikhonov regulization. The proposed inversion procedure contains two
69
fundamental components. First, an algorithm (forward engine) is required to construct a
theoretical dispersion curve based on the properties of an assumed profile. Second, an
algorithm is required to minimize the objective function which is usually the error
between the theoretical and experimental dispersion curves plus a damping term. Table
3.1 summarized the linear inversion methods of shallow surface waves used by major
research groups. Different groups used different forward engines. But all of them used
the generalized linear inversion algorithm by Wiggins (1972) with different damping
terms, iteration methods, and methods to calculate the partial derivative matrix (Jacobian
matrix). The advantage and disadvantage of these methods will be discussed in the next
sections.
Table 3.1 Linearized inversion methods of surface waves used by major research groups
Reference
Horike
(1985)
Ganji et al.
(1998)
Xia et al.
(1999)
This study
Forward
engine
Numerical
integral
Stiffness
matrix
Tranfer
matrix
R/T
method
Dispersion
method
Inversion method
Partial
derivatives
Data
errors
FK
Newton
Analytic
Neglected
Numerical
Neglected
Numerical
Neglected
Analytic
Modelled
SASW
MASW
Combined Newton
and quasi-Newton
LevenbergMarquardt (LM)
ReMi
Occam
Suppose that we have a discrete n-point model m and discrete m-point data
vector d that are related by a nonlinear system of equation G (m) = d
(3.28)
We can formulate this problem as a damped least squares problem. In another words, we
2
minimize G (m) − d 2 + α 2 Lm
2
2
(3.29)
where L is the roughening matrix and α is the damping factor.
70
As the first step of inversion process, we need linearize the nonlinear equation
(3.28). Given a trial model mk, Taylor’s theorem is applied to obtain the local
approximation
G (m k + ∆m) ≈ G (m k ) + J (m k )∆m = d
(3.30)
where J (m k ) is the Jacobian matrix J (m k ) =


k
∂G1 ( m k ) 
 ∂G1 ( m )
"


∂m1
∂mn




#
%
#


 ∂G ( m k )
k
∂G m ( m ) 
 m
"


∂m1
∂mn


(3.31)
Using the new R/T method described in Chapter 2, the Jacobian matrix is analytically
calculated (equations (2.57) and (2.58) ).
Using equation (3.30), the damped least squares problem (equation (3.29) ) is to
minimize
2
2
2
2
G (m k ) + J (m k )∆m − d + α 2 L(m k + ∆m)
(3.32)
where the variable is ∆m and m k is constant. Reformulating this as a problem in which
the variable is m k +1 = m k + ∆m
(3.33)
and letting d (m k ) = d − G (m k ) + J (m k )m k
(3.34)
2
2
2
2
give J (m k )(m k + ∆m) − (d − G (m k ) + J (m k )m k ) + α 2 L(m k + ∆m)
2
or minimize J (m k )m k +1 − d (m k ) + α 2 Lm k +1
2
2
2
(3.35)
(3.36)
Because J (m k ) and d (m k ) are constant, equation (3.36) is a damped least squares
problem J (m k )m k +1 = d (m k )
(3.37)
The solution is given as m k +1 = (J (m k )T J (m k ) + α 2 LT L) −1 J (m k )T d (m k )
(3.38)
71
The damped least squares solution given by equation (3.38) assumes Q = I
(equation (3.14) to (3.17) ). To incorporate the data standard deviation σ dj into a solution,
I use a weighting matrix Q = diag (1/ σ 1 ,1/ σ 2 ," ,1/ σ m )
(3.39)
The damped least squares solution (3.38) would be in the form of
ˆ k +1 = ((QJ (m k ))T (QJ (m k )) + α 2 LT L) −1 (QJ (m k ))T Qd (m k )
m
(3.40)
Fig. 3.2 shows the flowchart of the Occam’s inversion which implemented as
follows:
1). Set an initial model (S-wave and P-wave velocities, thickness, and density);
2). Calculate the theoretical dispersion curves G(mk) and Jacobian matrix J(mk) of the
current model;
3). Set a range of damping parameters (α0, α1, α2, …, αN), usually logarithmic;
4). For each damping parameters αi
a) calculate and save a new model mk+1(αi) based on equation (3.40);
b) and the corresponding RMS errors RMSk+1(αi);
5). Select the model mk+1(αj) based on the following criterions: If two or more trial
models give RMS errors below a user-specific threshold ε1, the largest α is preferred.
If no such value exists, then pick a value of αj that minimizes the RMSk+1(αj);
6). Check the model mk+1(αj) for convergence, in another words, check that the
RMSk+1(αj) is smaller than a user-specific threshold ε2.
7). If the model does not converge, pass the model to next iteration (Step2) until
maximum iterations reached.
72
3.6 Model appraisal method
In a surface wave survey (like ReMi survey), uncertainty is related to the
dispersion calculation (theory errors), the manual picking of dispersion curves, and the
data measurements (Table 3.2). The first is caused by improper application of dispersion
theory from 1D model into real 3D earth geologies. The second is due to transformation
from time-offset domain to frequency-slowness domain and the human bias during
dispersion curve picking from ReMi p-f image. The human bias may be avoided by
averaging dispersion curve picks from several analysts. The last category (measurement
uncertainties) is due mainly to noise in the recorded signals and to instrument
uncertainties.
Table 3.2 Sources of uncertainty in surface wave dispersion measurements
Theoretical errors
Picking errors
Instruments
Measurement errors
Signal noise
Correlated
Uncorrelated
The instrument uncertainties are related to limited frequency range of geophones,
orientation and spatial spacing of the array, limited recording length, and others. Using
numerical simulations, O’Neill (2003) reports minimal influence from geophone tilt and
coupling, while adding noise in the recorded signals introduce larger uncertainties in the
experimental dispersion. By the data gathered at two sites in Italy, Lai et al. (2005)
experimentally verified that the experimental dispersion data are normally distributed.
Uncertainties in recorded signals are associated with coherent noise and
uncorrelated noise. The latter is externally generated noise (environmental noise) and can
73
be studied via the statistical distribution of the recorded signals if many repetitions of the
test in a given configuration are available. Coherent noise is due to events generated by
the seismic source (e.g., near-field effects) or coherent noise associated with ambient
noise. The proposed approach by this study can be used to estimate the effect of
uncorrelated noise on surface wave measurements and its propagation in inversion.
It can be shown that if an inverse problem is nonlinear, a Gaussian distribution of
the errors in the data will in general be mapped into a distribution of the uncertainty of
the model parameters that is non-Gaussian (Tarantola, 1987; Menke, 1989). However, if
the inverse problem is not too non-linear, especially around the point of maximum
likelihood in the model space (i.e., the probability distribution of model parameters
around the point of maximum likelihood is not too different from a Gaussian function), it
is possible to use a simplified approach to estimate the uncertainty of the model
parameters (Lai et al., 2005).
If in solving the non-linear inverse problem G (m) = d , it is assumed that: a) the
uncertainty of the experimental dispersion curve follows a Gaussian distribution; b) the
inverse problem (equation (3.37) ) is only moderately non-linear around its solution, and
c) the relation G (m) = d is inverted using the method of maximum likelihood, then the
uncertainty associated with the expected shear wave velocity profile can be approximated
by the following formula (Tarantola, 1987; Menke, 1989):
ˆ ) = cov(Hd ) = H cov(d )HT
cov(m
(3.41)
From equation (3.40), we obtain the
H = ((QJ (m k ))T (QJ (m k )) + α 2 LT L) −1 (QJ (m k ))T Q
(3.42)
74
ˆ ).
The standard deviations of each model parameter are the root of diagonal of cov(m
Assuming a normal distribution associated with model parameter mi, the 95% confidence
ˆ ± 1.96 diag (cov(m
ˆ ))
intervals are given by m
(3.43)
The concept of model resolution is an important way to characterize the bias of
the linear inversion. For a linear system (equation (3.7) ), the model resolution matrix R
is defined as equation (3.4). The rows of R are the resolution kernels for each parameter.
The difference from the identity matrix illustrates the degree of non-uniqueness of each
parameter. The R would be the identity matrix if all parameters are perfectly resolved
(uniquely determined). The trace of R is often used as a simple quantitative measure of
the resolution. If the trace of R is close to total number of parameters estimated, R is
relatively close to the identity matrix.
Substituting d (m k ) of equation (3.37) into equation (3.40) yields
ˆ k +1 = ((QJ (m k ))T (QJ (m k )) + α 2 LT L) −1 (QJ (m k ))T QJ (m k )m k +1
m
(3.44)
Therefore, the model resolution matrix of the linearized equation (3.37) is
R = ((QJ (m k ))T (QJ (m k )) + α 2 LT L) −1 (QJ (m k ))T QJ (m k )
(3.45)
3.7 Test data sets and numerical tests
Sheriff (1991) sets out four canonical types of earth resistivity profiles. By
analogy we can generalize the velocity profiles into the four similar types. An A-type
velocity profile is a section where Vs increases with depth at all interfaces; H-type has a
low-velocity layer; K-type has a high-velocity layer; and a Q-type where Vs decreases
with depth at all interfaces. Q-type profiles may be encountered in evaporate settings.
75
However, in our collecting of shallow Vs profiles at hundreds of sites, we have not yet
observed a Q-type profile. Thus we do not test that type here.
The proposed Occam’s linearized inversion method described above was tested
on a suite of nine synthetic models and two field data sets. Nine synthetic models were
designed based on data collected in southern Nevada. Seven models are A-type sections,
one is H-type, and one is K-type. These synthetic models have been used to test the ReMi
technique (Heath et al., 2006). The synthetic trace data were generated using the finite
difference code, E3D (Larsen and Grieger, 1998), and recorded on a virtual 185m-long
linear array with a 15m-spacing and a 200 sample-per-second rate. The E3D is an explicit
2D/3D elastic finite-difference wave propagation code that has been successfully used for
the modeling of seismic waves (Larsen and Grieger, 1998; Liu et al., 2005). Flat-layered
models and a Ricker wavelet source placed at the free surface were used. The wavefield
was simulated on a 2-D grid (dt = 0.0001 s and dh = 0.0005 km) with the Courant
condition and Clayton-Engquist absorbing boundary condition (Clayton and Engquist,
1977, 1980) satisfied. The shortest wavelengths in the calculation cover 25 grid points to
completely avoid grid dispersion. As suggested by Park et al. (1999), near-offsets
(distance between source and first receiver) are given to be greater than half the
maximum desired wavelength to allow the development of surface waves. No noise is
added to the trace data. Figure 3.3 shows an example of a synthetic seismic record.
Clearly Rayleigh waves dominate the record. The ReMi dispersion curves are
independently picked from 2.9 Hz to 20.5 Hz in frequency. Figure 3.4 shows a typical
ReMi dispersion picks on the slowness-frequency (p-f) image.
76
The linearized inversion of the synthetic Rayleigh-wave phase velocity dispersion
curve picks is carried out by inverting for S-wave velocity and fixing density and
Poisson’s ratio at the original values (2.0 g/cm3 and 0.25 of the original synthetic models,
respectively) even though the Poisson’s ratio is low for soils. It is well-accepted that Vs
has the most dominant effect on the Rayleigh wave dispersion followed by layer
thickness. This has been shown by numerical partial derivatives at both crust-mantle
scale (Burkhard and Jackson, 1976) and local site scale (Xia et al., 1999). Xia et al. (1999)
showed that a 25% error in Vp or density of all layers only induces a 10% perturbance of
the fundamental-mode dispersion curves. Therefore constant density and Poisson’s ratio
will not significantly affect results of the inversion.
Due to unavailability of any a prior geological information for the synthetic
simulation, initial models are blindly constructed in the following way for all synthetic
test data. The number of layers (N), Poisson ratio, the maximum depth (Dmax), the
maximum (Vmax) and the minimum (Vmin) S-wave velocities are users-specified
quantities. The velocity gradient is defined as
∇V = (Vmax - Vmin ) / Dmax
(3.46)
The initial model is composed of layers of equal thickness of Dmax/N. The calculated
velocity from the velocity gradient at the middle point of each layer is taken as S-wave
velocity of that layer. P-wave velocity is updated from the corresponding S-wave velocity
based on the given Poisson ratio. Density is fixed at 2.0 g/cm3.
Based on the initial model described above and assuming no data noise (equation
(3.38) ), the inverted S-wave velocity profiles are plotted atop nine original synthetic
models (left column, Fig. 3.5). The 95% confidence intervals of S-wave velocity of each
77
layer are calculated from equation (3.43) and plotted as horizontal error bars. Fig. 3.5 also
shows the resolution matrix and calculated dispersion of the corresponding model against
ReMi dispersion picks. The resolution matrix R describes how the term G − g of equation
(3.5) smears out the original true model, m, into an estimated model, m̂ . In these
synthetic tests, if the trace of R ( Tr(R) ) is close to three (the number of layer of the
synthetic models), then R is relatively close to the identity matrix (Fig. 5.3).
The matches between the inverted models and original synthetics are not bad.
There are some bad-match cases (for example model 5, Fig. 3.5). Almost all of the first
layers are resolved as shown on left column of Fig. 3.5 as well as the resolution matrix
whose first elements are almost units for all nine synthetics. The H-type and K-type
models are generally recovered. High velocity layer of model 5 clearly is an inversion
artifact probably due to bad assignment of layer thickness.
On the contrast, the traces of R ( Tr(R) ) of almost all inverted models are close to
three, indicating a good model resolution. The discrepancy between bad model matches
and good model resolutions is caused by the inversion errors (from equation (3.5) ),
caused by linearization, picking, instruments, and correlated and uncorrelated signal
noise sources (Table 3.2). The last section of this chapter shows that incorporation of the
uncorrelated signal noise into inversion cause a little change in final models, indicating
that errors by linearization, picking, instruments, and correlated signal noise sources
causes the major uncertainties in the final inverted models.
The inverted synthetic models can also be evaluated by how well they recover the
original values of Vs averaged to 30 (V30), 50 (V50), and 100 (V100) m as shown in Fig.
3.6 and tabulated in Table 3.3 in which these values have been rounded to the one
78
decimal place. V30 is a major factor in prediction of earthquake amplification and site
response in sedimentary basins (Field et al., 1992). These values are well recovered.
Average V30 error is 5% over all nine synthetic models. Average V50 and V100 errors are
7% and 6%, respectively, over all nine synthetic models.
Table 3.3. Depth-averaged velocities in m/s for Occam’s inverted
models and percentage difference from known profiles in parentheses
Models
V30
V50
V100
model 1
479 (2%)
539 (1%)
607 (1%)
model 2
557 (4%)
630 (1%)
700 (1%)
model 3
609 (6%)
651 (11%)
838 (1%)
model 4
943 (7%)
1096 (14%)
1340 (13%)
model 5
519 (6%)
599 (7%)
622 (1%)
model 6
471 (4%)
502 (7%)
544 (10%)
model 7
873 (10%)
997 (8%)
1228 (9%)
model 8
449 (6%)
449 (8%)
491 (10%)
model 9
912 (1%)
1070 (6%)
1127 (10%)
Newhall
320 (19%)
365 (4%)
464 (3%)
CCOC
216 (5%)
258 (6%)
336 (11%)
Two field data sets were used to test the linearized inversion. One of them was
obtained at the Los Angeles County Fire Station in Newhall, California by the Resolution
of Site Response Issues from the Northridge Earthquake (ROSRINE) project
(http://geoinfo.usc.edu/rosrine). Another is from the Coyote Creek borehole (CCOC) in
Santa Clara Valley, California (Stephenson et al., 2005). Both data sets include OYO
suspension S-wave velocity logs (see Chapter 1 for OYO technique).
79
The Newhall ReMi experiment was performed using a 200m-long array of
twenty-four 8-Hz vertical geophones centered 4 m from the hole. The geophones
recorded the vertical component of microtremors generated by nearby trains and street
traffic. The picks are independently made from 2.44 Hz to 22.22 Hz. The ReMi survey at
CCOC was conducted by USGS (Wentworth and Tinsley, 2005; Asten et al., 2005c)
using a 220m-long array of forty-five, 4.5 Hz vertical geophones, which is located 200m
southwest of the CCOC borehole. The ReMi picks are independently made from 1.22 Hz
to 24.66 Hz (Fig. 3.7). In both data sets, all acquired records were used in the p-f analysis
unless amplitudes within a given record were clipped.
Inversion starts with an initial model generated in the same way as the synthetic
cases described above. The maximum modeling depths were estimated using the halfmaximum-wavelength discussed by Park et al. (1999). This criterion has been used by
Xia et al. (1999), and Stephenson et al. (2005). Another critical issue in dispersion
inversion is the number of layers. The more layers, the more computation effort, and the
more realistic the inverted models (Dal Moro et al., 2007). Based on synthetic experience,
the number of layers for both tests is set as 6.
Fig. 3.8 displays the inverted models for both the Newhall and CCOC data. The
inverted S-wave velocity for the Newhall data follows the OYO suspension S-wave
velocity log. The significant increase of the S-wave velocity at a depth of about 20 m is
shown on the inverted model. Fits of the calculated dispersion curve against the ReMi
picks is generally good (row 1 column 2 in Fig. 3.8). The traces of R ( Tr(R) ) of the
inverted model is 3.96 out of 6.0. Average Vs errors of the Newhall data are 7% for V30,
4% for V50, and 5% for V100 (Fig. 3.6 and Table 3.3).
80
The maximum depth of the inverted profile for the Coyote Creek data is over two
times deeper than that of the Newhall data. The inverted S-wave velocity of the Coyote
Creek data loosely follows the OYO suspension S-wave velocity log. Fits of the
calculated dispersion curve against the ReMi picks is good at low frequencies and bad at
high frequencies (row 2 column 2 in Fig. 3.8). The traces of R ( Tr(R) ) of the inverted
model is 4.16 out of 6.0. Average Vs errors of the CCOC data are 10% for V30, 5% for
V50, and 1% for V100 (Fig. 3.6 and Table 3.3).
3.8 Initial model dependence
The initial model has an important role on linearized inversions (Tarantola,
1987). A final inverted model determined by linearized inversions inherently depends on
an assumed initial model due to the existence of locally optimal solutions (Yamanaka and
Ishida, 1996). When an appropriate initial model can be generated using a priori
information about subsurface structure, linearized inversions may find an optimal
solution that is the global minimum of a misfit function. If a priori information is either
scant or unavailable, the inversion may find a local optimal solution. Luke et al. (2003)
showed that linear inversion yields excellent dispersion results for simple profiles.
However, for more complex profiles multiple solutions with equally good data fits are
possible.
As far as the initial model is concerned, two aspects have to be considered: the
number of layers and the S-wave velocity of each layer. Using Newhall data, I design two
cases to exam the initial-model-dependence: the effect of number of layers and the effect
of layer S-wave velocities.
81
Fig. 3.9 shows the effect of the number of layers using Newhall dispersion data.
The initial model is composed of equal thickness layers. All layer velocities are set to a
fixed value. Inversion with a few layers (for example 4 layers) cannot fit OYO
suspension S-wave log well. The initial models with the limited number of layers yields
high impedance contrasts between the layers (for example the first and second layer of
the four-layer-model, Fig. 3.9). When the number of layers is increased, the site
characteristics are better identified. The inversion with 10 layers (Fig. 3.9) shows a better
fit. This inverted model shows a quite smooth increase of shear velocity with depth. As
the number of layers is further increased, the uncertainties are generally increased
(horizontal bars is widening). There may be a risk of fitting the noise (for example, large
95% confidence interval for Newhall inversion with layer number more than 16, Fig.
3.9) . When the number of layers becomes high (20 layer, Fig. 3.9), the reliability of the
single shear velocities is decreased (due to larger uncertainty) and a trend between Swave velocity and depth is identified.
The effect of layer S-wave velocity is shown in Fig. 3.10. All initial models have
the same thickness but different S-wave velocities, yielding three different profiles
through linearized inversion. Both effects of the number of the layers and the S-wave
velocity of each layer indicate that the inverted profiles depend on the initial models.
3.9 The effect of minimum and maximum depth
The minimum depth, able to be modeled, depends on the minimum sampled
wavelength. If the acquisition high frequencies are reliably acquired, the minimum
wavelength that can be observed depends on the spatial sampling, and equals the
82
geophone spacing. The shortest propagating wavelength carries the information related to
the shallowest layer, and gives the mean properties of the layers above its investigation
depth. These mean properties can still be influenced by a thinner layer that is not
investigated: in this case it is necessary to incorporate a priori information about this
layer, or to acquire shorter wavelengths.
The maximum depth, able to be modeled, depends on the maximum reliably
estimated wavelength. As a rule of thumb, it is limited to one half of the maximum
wavelength (Park et al., 1999), but sometimes it is possible to go deeper, down to one
wavelength (Herrmann and Al-Eqabi, 1991). Actually the maximum investigated depth
depends on the site. A rigorous approach should give a result with its uncertainties, and
should also compare the results of different inversions with different numbers of layers
and depths.
The maximum depth has an important effect on inversion (Fig. 3.11). Using the
initial model that has equal layer thickness and the same velocities for all layers,
inversion with a shallower maximum depth yields a good match (left, Fig. 3.11). The
maximum penetration depth of the data is deeper than the user-specified value of the
maximum depth. If the user-specified maximum depth is deeper than the maximum
penetration depth of the data, the inverted models might not reflect the subsurface
geology (right, Fig. 3.11).
3.10 The effect of number of dispersion picks
The number of dispersion picks heavily influences the final model. Fig. 3.12
shows the Occam’s inverted results on Newhall data sets with different number of picks.
83
The inversion is performed assuming an exact initial model as one used in row 1 column
2 in Fig. 3.9. We randomly deleted 2 out of 14 total original picks (except the minimum
and maximum frequency picks) to make a 12-pick dispersion data; 6 to make a 8-pick
data; 8 to make a 6-pick data. Among three inverted models, the inversion on 12-pick
data (left, Fig. 3.12) yields the best fitting model in terms of how close the inverted
model follows the OYO S-wave log. It is followed by the inversion on the 8-pick data
(middle, Fig. 3.12) and the inversion on the 6-pick data (right, Fig. 3.12). Each pick of
the data contains the geological information on the subsurface S-wave structure. The
inversion of the less pick data sets yields models that are far away from the true profile
than the inversion of the more pick data sets. This means that more picks provide more
information on the underground geology.
3.11 The effect of frequency density of dispersion picks
In the different frequency bands the information density is different. This time,
we randomly deleted 4 out of 14 total original Newhall dispersion picks (except the
minimum and maximum frequency picks) to make a 10-pick dispersion data (called data
A). Then we delete the 4 lowest frequencies to make another 10-pick dispersion data
(called data B). The last 10-pick dispersion data (called data C) is made by deleting the 4
highest frequencies out of a total of 14 original picks.
Fig. 3.13 shows the Occam’s inverted results on these three data sets. Among
three inverted models, the inversion on data B (middle, Fig. 3.13) yields the worst fitting
model that the inverted S-wave velocities are far below the OYO log while the inversion
on data C produces the best model. This means that in the low frequency range the picks
84
are more important to the inversion. Each of them significantly increases the information
content of the dispersion picks. In the high frequency range, on the contrary, the picks are
less important. They actually replicate the same information. The implication is that more
picks are needed in the low frequency range than in the high frequency range.
3.12 The effect of the weighting matrix
The inverted models of all above tests are calculated following equation (3.38),
assuming an identity weighting matrix. This time we re-run the inversion on nine
synthetic data sets under the same condition as before except that the weighting matrix
are set to 10% of data value (σi=0.1*Vsi). The inverted models for this run are calculated
following equation (3.40). The inverted S-wave velocity profiles are plotted atop of the
nine original synthetic models and the inverted model using the identity weighting matrix
(Fig. 3.14). Clearly adding the weighting matrix of 10% of data value does not cause
much changes of the inverted model.
The less important effect of weighting matrix on the inverted models indicates
that the inverted model is not sensitive to the weighting matrix. In the phase velocity
inversion of surface waves, there are errors caused by in-exact forward modeling and
linearization, by manually picking of phase velocity dispersion curves, and by
measurement errors (Table 3.2). These errors may have more important effect than the
weighting matrix.
85
Forward problem
Data d
True model m
Appraisal problem
Estimation problem
Estimated model m
Figure 3.1 The inverse problem viewed as a combination of an estimation problem plus
appraisal problem.
86
starting m0
Forwarding G(m0)
and partials J(m0)
damping parameter
αi
Solve occam’s
solution
m0 = m1
All α tested?
i=0, 1, ..., N
N
Y
Select the best
damped model m1
N
Converged?
Y
m1 is the final solution.
Done!
Figure 3.2 Flow chart showing the Occam’s inversion procedure. After trialling all α, the best model is chosed based on the
following criterion: If two or more trial models give errors
below the threshold, the largest α is preferred. If no such value
exists, then pick a value of α that minimizes the RMS.
87
0.0
0
5
Trace Sequence
10
15
20
Rayleigh waves
1.0
Time (s)
2.0
3.0
4.0
Figure 3.3 A typical synthetic seismic
record with strong Rayleigh waves.
0.0
0.0
5.0
Frequency (Hz)
10.0
15.0
20.0
25.0
Slowness (s/m)
0.001
0.002
0.003
0.004
0.005
0.0
ReMi Spectral Ratio
2.5
Figure 3.4 Slowness-frequency spectrum (p-f) image
with ReMi dispersion picks of a typical synthetic
seismic record. The small squares indicate analyst’s
picks of phase velocities of the fundamental-mode
Rayleigh waves.
88
Model1
0
Model 1 w/ Tr(R)=2.97
Model1
600
20
30
500
40
2
450
400
50
3
350
60
200
400
600
Vs (m/s)
800
0.05
0.1
0.15
Period (s)
1
600
Vs (m/s)
30
40
550
2
500
450
50
400
600
800
Vs (m/s)
0.05
1000
0.1
Period (s)
0.15
Vs (m/s)
750
700
2
650
600
550
3
500
0.4
0.2
0
3
1
800
60
0.6
2
Model 3 w/ Tr(R)=2.99
850
40
0.8
1
Model3
Model3
20
Depth (m)
3
400
0
1
3
Model 2 w/ Tr(R)=2.96
650
20
80
400
2
700
10
60
1
0.2
Model2
Model2
0
Depth (m)
1
550
Vs (m/s)
Depth (m)
10
600
800 1000 1200 1400
Vs (m/s)
0.05
0.1
0.15
Period (s)
0.2
1
2
3
Figure 3.5 Linearized inverted S-wave velocities against the original synthetic models
for nine synthetic data sets. Left column lists the inverted the S-wave velocities (thick
solid lines) with 95% confidence interval (horizontal bars). The dashed lines represent
the original synthetic models. Middle column shows the original ReMi dispersion picks
(circles) against the calculated dispersion curves (thick solid lines) of the inverted
model listed in left column. Right column contains resolution matrix (scale is at the
bottom). The first seven profiles are A-type sections. The eighth and the ninth profiles
are H-type and K-type sections, respectively.
89
Model4
0
Model4
Model4
1500
1
1400
20
1300
Vs (m/s)
Depth (m)
40
60
1200
2
1100
1000
900
80
3
800
100
1000
1500
Vs (m/s)
2000
0.05
3
Model5
1
550
20
Vs (m/s)
Depth (m)
2
600
10
30
500
2
450
40
400
50
350
60
200 400 600 800 1000 1200 1400
Vs (m/s)
3
0.06 0.08 0.1 0.12 0.14
Period (s)
Model6
0
1
520
Vs (m/s)
30
3
1
500
20
2
Model6
Model6
10
Depth (m)
1
0.15
Model5
Model5
0
480
2
460
40
440
50
420
60
0.1
Period (s)
3
400
400
600
Vs (m/s)
800
0.1
0.15
0.2
Period (s)
0.25
1
2
3
Figure 3.5 Continue.
90
Model7
0
Vs (m/s)
Depth (m)
40
60
80
1000
1500
Vs (m/s)
3
0.05
0.1
0.15
Period (s)
1
0.2
20
30
40
1
440
2
420
400
50
400
600
Vs (m/s)
800
3
0.1
1000
Model9
0.2
Period (s)
0.3
1
2
3
Model9
Model9
1100
1
20
1000
Vs (m/s)
Depth (m)
3
Model8
460
200
2
Model8
Model8
Vs (m/s)
Depth (m)
2
1000
2000
10
0
1200
800
0
60
1
1400
20
100
Model7
Model7
40
60
80
800
2
900
800
1000 1200 1400 1600 1800
Vs (m/s)
3
0.1
0.2
0.3
Period (s)
0.4
1
2
3
Figure 3.5 Continue.
91
1200
Original Vs30, m/s
Inverted Vs30, m/s
1000
800
600
400
200
0
M1
M2
M3
M4
M5
M6
M7
M8
M9
1200
Newhall
CCOC
Original Vs50, m/s
Inverted Vs50, m/s
1000
800
600
400
200
0
M1
M2
M3
M4
M5
M6
M7
M8
M9
Newhall
CCOC
1600
Original Vs100, m/s
1400
Inverted Vs100, m/s
1200
1000
800
600
400
200
0
M1
M2
M3
M4
M5
M6
M7
M8
Figure 3.6 The depth-averaged velocities in m/s against the
known values for Occam’s inverted models.
M9
Newhall
CCOC
92
5.0
Frequency (Hz)
10.0
15.0
20.0
25.0
0.0
0.0
0.002
0.002
0.004
0.004
Slowness (s/m)
Slowness (s/m)
0.0
0.0
0.006
0.008
5.0
Frequency (Hz)
10.0
15.0
20.0
25.0
0.006
0.008
0.01
0.01
0.0
ReMi Spectral Ratio
2.9
0.0
ReMi Spectral Ratio
2.5
Figure 3.7 Dispersion picks on the slowness-frequency spectrum (p-f) images of Newhall
(left) and Coyote Creek (right) data. The small squares on both images indicate analyst’s picks
of phase velocities of the fundamental-mode Rayleigh waves.
93
Newhall
0
1
500
20
40
Vs (m/s)
Depth (m)
Newhall w/ Tr(R)=3.96
Newhall
60
2
400
3
300
4
5
80
200
6
100
200 400 600 800 1000 1200
Vs (m/s)
0.1
Vs (m/s)
Depth (m)
50
100
200
600
1
500
2
400
3
300
4
200
5
100
1000
Vs (m/s)
1500
2
3
4
5
6
CCOC w/ Tr(R)=4.16
150
500
1
0.4
CCOC
CCOC
0
0.2
0.3
Period (s)
6
0
0.2
0.4
0.6
Period (s)
0.8
1
2
3
4
Figure 3.8 Linearized inverted profiles of S-wave velocity against the OYO suspension S-wave logs of Newhall and CCOC data. Left column lists the inverted the S-wave
velocities (thick solid lines) with 95% confidence interval (horizontal bars). The wiggle
lines represent the OYO suspension S-wave velocity log. Middle column shows the
original ReMi dispersion picks (circles) against the calculated dispersion curves (thick
solid lines) of the inverted model listed in left column. Right column contains resolution matrix (scale is at the bottom)
5
6
1
0.8
0.6
0.4
0.2
0
94
Newhall 4 layers
20
40
40
40
60
0
Depth (m)
20
100
60
80
Newhall 10 layers
60
100
200 400 600 800 1000 1200
Vs (m/s)
Newhall 12 layers
0
0
20
20
40
40
40
80
100
0
Depth (m)
20
60
60
80
100
200 400 600 800 1000 1200
Vs (m/s)
Newhall 16 layers
0
0
500
1000
Vs (m/s)
100
1500
Newhall 18 layers
0
40
40
Depth (m)
40
Depth (m)
20
100
60
80
200 400 600 800 1000 1200
Vs (m/s)
100
Newhall 14 layers
80
20
80
200 400 600 800 1000 1200
Vs (m/s)
60
20
60
Newhall 8 layers
80
100
200 400 600 800 1000 1200
Vs (m/s)
Depth (m)
Depth (m)
0
20
80
Depth (m)
Newhall 6 layers
0
Depth (m)
Depth (m)
0
200 400 600 800 1000 1200
Vs (m/s)
Newhall 20 layers
60
80
200 400 600 800 1000 1200
Vs (m/s)
100
200 400 600 800 1000 1200
Vs (m/s)
Figure 3.9 Inverted S-wave velocity against the OYO S-wave log of Newhall data, showing the
effect of the number of layers. The thick solid lines are inverted the S-wave velocities with 95%
confidence interval (horizontal bars). The thin wiggle lines represent the OYO suspension S-wave
velocity log. The dash lines are the initial models.
95
20
20
20
40
40
40
60
80
Depth (m)
0
Depth (m)
0
60
100
200 400 600 800 1000 1200
Vs (m/s)
60
80
80
100
100
200 400 600 800 1000 1200
Vs (m/s)
0
1000
2000
Vs (m/s)
3000
Figure 3.10 Inverted S-wave velocity against the OYO S-wave log of Newhall data, showing the
effect of the layer S-wave velocity. The legends are the same as Figure 3.9.
0
20
20
40
Depth (m)
10
30
40
50
Max depth of 100 m
50
60
400
600 800 1000 1200
Vs (m/s)
100
100
150
80
200
Max depth of 200 m
0
Depth (m)
Max depth of 50 m
0
Depth (m)
Depth (m)
0
200 400 600 800 1000 1200
Vs (m/s)
200
0
500
1000
Vs (m/s)
1500
Figure 3.11 Inverted S-wave velocity against the OYO S-wave log of Newhall data, showing the
effect of the maximum depth. The legends are the same as Figure 3.9.
96
Using 12 picks
Using 8 picks
0
20
40
40
40
60
80
80
100
Depth (m)
20
60
0
500
Vs (m/s)
100
1000
Using 6 picks
0
20
Depth (m)
Depth (m)
0
60
80
0
500
Vs (m/s)
100
1000
0
500
Vs (m/s)
1000
Figure 3.12 Inverted S-wave velocity against the OYO S-wave log of Newhall data, showing
the effect of the number of the picks. The legends are the same as Figure 3.9.
Randomly delete 4 picks
0
Delete 4 highest frequency picks
0
Delete 4 lowest frequency picks
20
20
40
40
40
60
80
100
Depth (m)
20
Depth (m)
Depth (m)
0
60
80
80
0
500
Vs (m/s)
1000
100
60
0
500
Vs (m/s)
1000
100
0
500
Vs (m/s)
1000
Figure 3.13 Inverted S-wave velocity against the OYO S-wave log of Newhall data, showing
the effect of the frequency density of the picks. The legends are the same as Figure 3.9.
97
Model1
10
20
20
30
40
50
400
600
Vs (m/s)
400
60
80
1000
1500
Vs (m/s)
2000
80
400
1000
Model5
10
20
20
30
40
60
200 400 600 800 1000 1200 1400
Vs (m/s)
60
0
10
Depth (m)
40
60
80
400
600
Vs (m/s)
800
Model9
20
20
Depth (m)
20
Model6
40
50
Model8
800 1000 1200 1400
Vs (m/s)
30
50
0
600
0
10
Model7
0
600
800
Vs (m/s)
Depth (m)
40
40
60
0
Depth (m)
Depth (m)
40
60
800
20
Depth (m)
30
Model4
0
100
20
50
60
200
Model3
0
Depth (m)
10
100
Model2
0
Depth (m)
Depth (m)
0
30
40
40
60
50
1000
1500
Vs (m/s)
2000
60
200
400
600
Vs (m/s)
800
1000
80
800
1000 1200 1400 1600 1800
Vs (m/s)
Figure 3.14 Effects of the weighting matrix on the inverted models. Each graph shows the final
S-wave velocities model inverted using an identity weighting matrix (the vertical thick solid line) with
95% confidence interval (the horizontal solid bar), the final model using a weighting matrix of 10%
data value (the vertical thick dash line) with 95% confidence interval (the horizontal dash bar), and the
original synthetic models (the vertical dash line).
98
Chapter 4 Non-linear inversion of surface-wave dispersion
based on simulated annealing optimization
One of primary goals of inversion is to find the minimum of an error function.
Due to the fact that such an error function may have several minima of different heights
(known as multimodal problems), the linearized inversions may become trapped in one
of these local minima. In section 3.8 of Chapter 3, I demonstrated that a final inverted
model determined by the linearized inversions inherently depends on an assumed initial
model. The initial-model-dependence is evaluated in two aspects: the number of layers
and the velocity of each layer. The more layers, the more computation effort, and the
more realistic the inverted models. However, increasing number of layers alone does
not guarantee the escape from these local minimum points.
The multimodal problems of surface waves dispersion curves are highlighted by
Dal Moro et al. (2007) through a simple example. They calculated the synthetic
fundamental mode dispersion curve for a model consisting of six layers. The objective
functions (RMS error, equation (3.8) ) are calculated by varying only thickness and Swave velocity of the second and third layers and by fixing the values of the other
parameters to their known values. In other words, as a surface-wave dispersion curve is
a function of S-and P-wave velocities, density, and thickness, they free only four out of
twenty three possible variables (the bottom layer is semi-infinite). Figure 4.1a shows a
3D plot of objective functions (calculated for the synthetic curve) for 4000 random
models. The abscissas report the ratio between S-wave velocity in the second and third
99
layers (VS ratio) and the ratio between the thickness of the second and third layer (THK
ratio).
The complex result from this extremely simple and constrained example
illustrates that the local minima would cause a trap of searching path in cases where
linearized methods are used to invert the data. To further highlight the problem, Figure
4.1b shows a 2-D plot for objective functions versus the VS ratio, by keeping constant
the layer thickness to the proper values (thickness ratio equal to 1). The distribution of
points (models) characterized by low-value objective functions is large even in the
surrounding of the global-minimum area. This gives evidence of the extreme
nonlinearity and multimodality of the problem.
4.1 Global searching optimization
Several algorithms tackle this problem with the main goal of sampling a wide
search space to detect the global minimum (or maximum) of a given function (e.g. Sen
and Stoffa, 1995). Heuristic optimization schemes can be divided into enumerative or
grid search, random search (uniform distribution of the search space sampling, such as
Monte Carlo methods), and “importance sampling” (the search space is non-uniformly
sampled because some function drives the search, such as Simulated Annealing (SA)
and Evolutionary or Genetic Algorithms (GA) ).
Enumerative or grid search methods of inversion involve the systematic search
through each point in a pre-defined model space to locate the best fit models. For most
geophysical problems, the model space is very large and the forward calculation is
slow. For example, there are twenty three possible variables (the bottom layer is semi100
infinite) for a model consisting of six layers. Assuming 10 discrete grids for each
variable, the model space would be 1023 possible points (models), which is extremely
large. Another drawback of enumerative search is that grid increment may be large so
that the best fit model would be skipped. In the case of 10 grids for S-wave velocity
ranging from 100 m/s to 1000 m/s, the grid increment is 100 m/s, which is too coarse
for surface wave dispersion inversion.
Random search overcomes the discreteness problem. A random number is
drawn from a uniform distribution U[0, 1] and is then mapped into a model parameter.
As an example assume a random number (rn), the new model parameter value can be
given as
minew = mimin + rn × (mimax − mimin )
(4.1)
where mimin ≤ mi ≤ mimax
A new random modal vector can be generated by random perturbation of a
specific number of model parameters in the model vector. Synthetic data are then
generated for the new model and compared with observations. The model is accepted
deterministically based on an acceptance criterion which determines how well the
synthetic data compare with the observations. The generation-acceptance/rejection
process is repeated until a stopping criterion is satisfied. A commonly used stopping
criterion is given by the total number of accepted models.
The random search methods have been successfully used to solve highly
nonlinear inversion problems (Sambridge and Mosegaard, 2002). However, they are
computational expensive. Several different schemes have to be adopted. One approach
is to use prior information to define a prior probability density function (pdf) to sample
101
the models, such that the search space is narrowed. Another is “importance sampling”
such as Simulated Annealing (SA) and Genetic Algorithms (GA).
Genetic algorithms (GAs) are used to invert seismic velocities (Louis et al.,
1999), seismic waveform (Stoffa and Sen, 1991), and shallow elastic parameters
(Rodriguez-Zuniga et al., 1997). In GAs, a series of “genetic operations” (namely
selection, crossover and mutation) acts along various successive steps (generations)
with the aim of working out a solution able to minimize (or maximize) a certain fitness
function that measures how good a certain model is with respect to a desired
characteristic. The final solution is a model that shows the best fitness value. This kind
of procedure, or at least its basic form, does not provide any evaluation of accuracy or
uncertainty of the proposed final solution and the problem is actually seldom considered
in several optimization or inversion schemes. Recently, GA has been applied on surface
wave dispersion inversion (Yamanaka and Ishida, 1996; Dal Moro et al., 2007).
The basic GA is quite robust (Yamanaka and Ishida, 1996). The method is not
sensitive to the starting models. However, there are several problems with the basic
algorithm that are needed to be addressed (Sen and Stoffa, 1995). First, there is no
guarantee that the optimum solution will be found. The algorithm proceeds towards a fit
population. In other words, it converges toward the best model of the population.
Second, the convergence toward the global maximum may be slow because some model
parameters have minor impact on the fitness. Third, the basic algorithm is also
unsatisfying in that once the models have been selected and reproduction occurs, other
previously acquired information is not usually exploited. Recently improvements have
102
been made to overcome these disadvantages (Reeves and Rowe, 2003; Dal Moro et al.,
2007).
This dissertation employs the simulated annealing (SA) method as an alternative
for inversion of phase velocity dispersion curves of the high-frequency fundamentalmode Rayleigh waves contained in microtremors. The implementation is validated by
blind tests against a suite of synthetic data sets. SA is a Monte Carlo process for finding
the global minimum of a non-linear error function and has been successfully used in
many geophysical inverse problems (Pullammanappallil and Louie, 1993, 1994; Sen
and Stoffa, 1995) since the work of Metropolis et al. (1953) and of Kirkpatrick et al.
(1983). Recently, SA has been used to invert Rayleigh wave dispersion data for a Swave velocity profile at both crustal (Martinez et al., 2000) and local-site (Beaty et al.,
2002) scales. Martinez et al. (2000) used random numbers drawn from a uniform
distribution in the interval (-1, 1) to perturb the model during their SA inversion, while
Beaty et al. (2002) used a Cauchy-like distribution. In this implementation, the SA
algorithm draws random numbers from a standard Gaussian distribution. After testing
the implementation on twelve synthetic data sets, I applied it to two shallow dispersion
data sets collected using the ReMi technique. The inversion results from both field data
sets are compared against borehole logs.
4.2 Simulated annealing optimization method
Briefly summarized, simulated annealing is a generalization of a Monte Carlo
method for examining the equations of state and frozen states of n-body systems
(Metropolis et al., 1953). The concept is based on the manner in which liquids freeze or
103
metals re-crystallize in the process of annealing (Fig. 4.2). In an annealing process a
melt, initially at high temperature and disordered, is slowly cooled so that the system at
any time is approximately in thermodynamic equilibrium. As cooling proceeds, the
system becomes more ordered and approaches a "frozen" ground state at T=0. Hence
the process can be thought of as an adiabatic approach to the lowest energy state. If the
initial temperature of the system is too low or cooling is done insufficiently slowly the
system may become quenched forming defects or freezing out in metastable states (i.e.
trapped in a local minimum energy state).
The original Metropolis scheme uses an initial state of a thermodynamic system
that is chosen at energy E and temperature T. Holding T, the initial configuration is
perturbed and the change in energy ∆E is calculated. If the change in energy is zero or
negative, the new configuration is accepted. If the change in energy is positive, the new
configuration may still be accepted with a conditional probability given by the
Boltzmann factor exp(-∆E/T). This process is then repeated sufficient times to give
good sampling statistics for the current temperature. Then the temperature is
decremented and the entire process repeated until a frozen state is achieved at zero
temperature.
By analogy the generalization of the above Metropolis Monte Carlo process to
the inversion of dispersion curves is straightforward (Fig. 4.3). The current state of the
thermodynamic system is analogous to the calculated phase velocities based on the
current model m which involves the shear and compression wave velocities, thickness,
and density of each layer. The energy of the thermodynamic system is analogous to the
104
root mean square (RMS) error E between the calculated and the observed phase
velocities, defined as
E=
1
N
N
obs
cal 2
∑ (Vri
− Vri )
i =1
(4.2)
where N is the total number of the phase velocity picks of the fundamental Rayleigh
waves, Vriobs is the observed phase velocity for picks i, and Vrical is the calculated phase
velocity for picks i given a model vector m. The frozen state is analogous to the global
minimum of the RMS error E. The temperature T is analogous to a free parameter
which controls the whole process.
The conditional acceptance is very important for the simulated annealing. In a
local search method, we start with a reference model. A new model is accepted if and
only if ∆E ≤ 0 , in another words, it always searches in the downhill direction (Fig. 4.4).
However, in SA, every model has a finite probability of acceptance even though
∆E > 0 . Thus local methods can get trapped in a local minimum that may be in the
close neighborhood of the starting model while SA has a finite probability of jumping
out of local minima. As the temperature approaches zero, however, only the moves that
show improvement over the previous trial are likely to be accepted.
The Monte Carlo process at a constant temperature T can be modeled by
generating homogeneous Markov chains of finite length for a finite sequence (Sen and
Stoffa, 1995). A starting model m0 is selected and the corresponding misfit E(m0) is
determined. The shear velocity and the thickness of an arbitrary layer of the initial
model are perturbed by random numbers drawn from the standard Gaussian distribution
N(0, 1) as
105
Vs j = Vs j + aVs j
and
h j = h j + bh j
(4.3)
where a and b are random numbers drawn from the standard Gaussian distribution, Vs j
and h j are the S-wave velocity and the thickness of layer j, respectively. We keep
densities constant and update P-wave velocities (Vp) based on the assumed Poisson’s
ratio which is fixed during the inversion. A new misfit E is calculated for the perturbed
model. If the new misfit has been improved then we accept the perturbed model and
update the starting model m0 into m1. If the misfit has become larger then the perturbed
model is conditionally accepted with an acceptance probability exp(( E1 − E0 ) T ) . The
next iteration is based on the updated model m1. This process is repeated at a constant
temperature until the system arrives at an equilibrium state.
In order to find the global minimum, the homogeneous Markov chain of finite
length must be in an equilibrium state before temperature drops (Aarts and Korst,
1989). Mathematically, it is shown that such an equilibrium state can be obtained at a
constant temperature T after a large number of iterations (Aarts and Korst, 1989;
Geman and Geman, 1984). The equilibrium distribution at the temperature T is given by
the Gibbs’ probability density function
G (m) = exp(
− E (m)
)
T
∑ exp(
M
− E (m)
)
T
(4.4)
where the sum is taken over all models in the model space M.
Once the equilibrium has been achieved, we drop the temperature (annealing)
and repeat the Monte Carlo process at a lower constant temperature. After a large
number of iterations, another new equilibrium state is established for this lower
temperature. Then we repeat the temperature annealing until T is close to zero and the
106
misfit is smaller than a user-specified value ε. The whole iteration is called the
annealing process.
One advantage of the annealing process is that the results are independent of the
starting model (Rothman, 1985; Pullammanappallil and Louie, 1993, 1994). This is
rooted in the limiting probability theory of the Markov chain (Ross, 2003, p.200).
Another advantage is the conditional acceptance probability, which allows a conditional
acceptance of models having higher errors. This will give the annealing process an
opportunity to escape local minima. A disadvantage of the annealing process is that it
can be more expensive computationally than linearized methods.
A set of parameters, called the cooling schedule, govern the convergence of the
annealing process. Careful searching for them can guarantee the achievement of an
equilibrium state for each temperature drop. Different cooling schedules require
different probability distributions from which a random number is drawn to perturb
models. I use a polynomial-time cooling schedule that guarantees the convergence of
the annealing process and leads to a polynomial-time execution of the simulated
annealing (Aarts and Korst, 1989). It consists of four components: an initial temperature
T0, a decrement of the temperature, a stop criterion, and the length of the homogeneous
Markov chains.
The initial temperature T0 is obtained from the requirement that at this value
virtually all proposed models should be accepted. It is measured by the acceptance ratio
which is the ratio of the number of models accepted to the number of models proposed.
In practice this can be achieved by starting off at a small positive value of temperature
107
and multiplying it with a constant factor, larger than 1, until the corresponding
acceptance ratio is close to unity.
The temperature drop should be set carefully. A large drop will lead to the loss
of the equilibrium state. A small drop will cost more computational time. Aarts and Van
Korst (1989) showed
Tk +1 = α Tk
0.8 ≤ α ≤ 0.99
k = 1, 2,"
(4.5)
where k is the step in temperature drop. Our tests show that the value of 0.95 for α is
effective for our dispersion curve inversion.
The stop criterion of annealing process is arbitrary. Reasonable constraints are
that the misfit remains the same for a number of iterations for T close to zero, and that
the misfit is smaller than a user-specified value ε.
The length of the homogeneous Markov chain is based on the requirement that
at each constant temperature, equilibrium is to be restored. It is dynamically determined
during the execution of the annealing process.
4.3 Model appraisal
In linearized inversions, the quality of the inverted results can be investigated by
calculating the resolution matrix (equation (3.45) ). The deviation of the inverted
parameters can be estimated from the deviation of the observed data. However, such
information can not be derived when using the SA inversion. One way to approach the
problem is the Gibbs sampler. If the SA runs for a large number of iterations at the final
temperature, it may be used as a Monte Carlo importance sampling technique, weighted
108
by the problem’s posteriori probability density function, to evaluate quantities such as
mean and variance.
The reliability of the model resulting from the annealing can be evaluated by the
uncertainty estimation process, during which the annealing process is repeated at the
final temperature for at least 500 iterations (Martinez et al., 2000). The expected model
<m> can be expressed as
m = ∫ mG (m)dM
M
(4.6)
and the covariance matrix as
Cm = ∫ mmT G (m)dM - m m
T
M
(4.7)
where G (m) is the Gibbs probability density function for the different models m of the
model space M. In practice the integration in the above equations can be replaced by
finite sums for all accepted models.
Both Vs and thickness of each layer of the models are varied during the
annealing process. However, one of these has to be fixed during the uncertainty
estimation process. Under fixed thicknesses, Equations (4.6) and (4.7) yield a mean
profile with the standard deviations of Vs of each layer; under fixed shear velocities, a
mean profile with the standard deviations of thickness of each layer.
4.4 Inversion results
The SA inversion method described above was tested on a suite of nine
synthetic models and two field data sets. Chapter 3 described the acquisition of seismic
records and extraction of surface wave dispersion.
109
The initial models were generally set to be a uniform half-space. The maximum
modeling depths were estimated using the half-maximum-wavelength discussed by Park
et al. (1999). This criterion has been used by Xia et al. (1999), Beaty et al. (2002), and
Stephenson et al. (2005). Another critical issue in dispersion inversion is the number of
layers. The more layers, the more computation effort, and the more realistic the inverted
models (Dal Moro et al., 2007). From the mathematical point of view, we determine the
number of layers by sign change of the second derivatives of the picked dispersion
curves. All initial S-wave velocities were set to a visually inspected average of the
picked phase velocities. P-wave velocities are updated based on the assumed Poisson’s
ratio. Densities are set at a reasonable value (e.g. 2.0 g/cm3) and kept constant during
the inversion. The maximum modeling depth and number of layers can also be
manually set if known independently. Other a priori information that could be manually
incorporated into the SA optimization process includes Poisson’s ratio, density,
thickness, and S-wave velocity of any layers (at least two layers are free to allow
perturbations).
Inversion on the synthetic ReMi Rayleigh-wave phase velocity dispersion curve
picks is carried out by varying both S-wave velocity and layer thickness and fixing
density and Poisson’s ratio at the original values (2.0 g/cm3 and 0.25 of the original
synthetic models, respectively).
Figure 4.5 shows the mean of the inverted S-wave velocity profiles plotted atop
nine original synthetic models. Under fixed thicknesses, the standard deviations of Swave velocity of each layer are calculated from equation (4.7) and plotted as horizontal
error bars. Among the total twenty-seven layers of nine synthetic models, velocities of
110
twenty-one layers (78% of trial cases) fall within the error bars. Statistics on the
inverted models show that the average errors for Vs are 5.5% for the top layer, 11% for
the second, and 7.9% for the lower half space. The top layer Vs errors for H-type and Ktype models are 14.3% and 0.2%, respectively. The Vs inversions for the second layer
for H-type (0.1% Vs error) and K-type (5.8% Vs error) models are better. Inverted
velocities for the half space of the H-type and K-type models are off 17.9% and 15.3%,
respectively.
The Vs value averaged to 30 (V30), 50 (V50), and 100 (V100) m shown in Fig. 4.6.
Table 4.1 tabulates these values which have been rounded to the one significant figure.
Average V30 error is 3.5% over all nine synthetic models. Only one (Model 4) yields a
large V30 error of 11%. Average V50 and V100 errors are 4.6% and 6.1%, respectively,
over all nine synthetic models.
Table 4.1 SA-inverted depth-averaged velocities in m/s and
percentage difference from known profiles in parentheses
V50
V100
Models
V30
model 1
480 (2%)
540 ( 1%)
610 ( 1%)
model 2
550 ( 3%)
620 ( 0%)
720 ( 1%)
model 3
620 ( 6%)
660 (10%)
800 ( 6%)
model 4
980 (11%)
1070 (11%)
1310 (11%)
model 5
490 ( 1%)
560 ( 0%)
630 ( 2%)
model 6
480 ( 1%)
520 ( 3%)
570 ( 6%)
model 7
920 ( 5%)
1020 ( 6%)
1240 ( 8%)
model 8
430 ( 1%)
460 ( 6%)
490 (11%)
model 9
930 ( 1%)
1040 ( 4%)
1110 ( 9%)
model 7a
920 ( 5%)
1110 ( 3%)
1400 ( 3%)
model 8a
460 ( 9%)
470 ( 4%)
500 (10%)
model 9a
1010 (10%)
1140 (13%)
1230 (20%)
Newhall
300 (11%)
380 ( 7%)
490 ( 4%)
CCOC
240 (14%)
270 (11%)
340 (14%)
111
Under fixed shear velocities, equations (4.6) and (4.7) yield the mean profiles
with the standard deviations of thickness of each layer (Fig. 4.7). The models 7a, 8a,
and 9a were inverted using the same picks as the models 7, 8, and 9 but with a different
uncertainty estimation process. The average Vs values (V30, V50, and V100 ) of models
7a, 8a, and 9a do not differ significantly off from those of models 7, 8, and 9 (Table
4.1).
Inversion of two field data sets (Newhall and Coyote Creek) starts with an initial
model generated in the way described above. Both S-wave velocity and thickness are
allowed to vary for a finer fit. P-wave velocities are updated based on the assumed
Poisson’s ratio of 0.35 (Dal Moro et al., 2007). Densities are set at 2.0 g/cm3 and kept
constant during the inversion.
Figure 4.8 displays the inverted models for both the Newhall and Coyote Creek
data. The mean of the inverted S-wave velocity for the Newhall data follows the OYO
suspension S-wave velocity log well. The significant increase of the S-wave velocity at
a depth of about 26 m is clear on the inverted model. Ninety percent of the OYO
suspension S-wave velocity log values fall within the standard deviations of the inverted
model. The velocity uncertainty increases with depth as expected (resolution of
dispersion curves decreases as depth). The maximum depth of the inverted profile for
the Coyote Creek data is over two times deeper than that of the Newhall data. The mean
of the inverted S-wave velocity of the Coyote Creek data follows the OYO suspension
S-wave velocity log. The rapid increases of Vs at the depths of 20 and 60 m are
effectively inverted. Figure 4.9 compares the calculated dispersion curves and ReMi
dispersion picks. Both match well. Average Vs errors of the Newhall data are 11% for
112
V30, 7% for V50, and 4% for V100 (Table 4.1). Average Vs errors of the Coyote Creek
data are 14% for V30, 11% for V50, and 14% for V100.
4.5 Comparison with linearized inversion results
The same suite of nine synthetic models and two field data sets are used to test
both linearized and SA inversions. Visual inspections on nine synthetic data sets (Fig.
3.5 and 4.5) reveal that the SA inverted models have an equal or better fit than the
linearized inverted models. For example, in the case of model 1, SA-inverted profile
almost perfectly fits the original synthetic model (Fig. 4.5) while the linearized
inversion fit for model 1 is far less perfect. Another example is model 5. The high
velocity layer of the inverted model shows an artifact of linearized inversion (Fig. 3.5)
while SA-inverted final model follows the original model very well (Fig. 4.5). Visual
inspections on Newhall and CCOC data sets also show a significant fitting improvement
that is achieved by the SA inversion. Unlike the linearized inverted model of CCOC
data set (Fig. 3.8), the SA-inverted model follows the OYO log very well and does not
show the low velocity zone. The calculated dispersion curve of CCOC fits the ReMi
picks excellently, especially near period 0.3 second (right, Fig. 4.9) while the curve
misfits the picks near 0.3 second on the linearized inversion (Fig. 3.8).
There are some cases where both linearized inversion and SA inversion yield the
models that fit the data equally well. For example, both inversions on model6 6 and 7
yield almost same final model (Fig. 3.5 and Fig. 4.5).
The difference in the inverted final models by both methods is probably related
to the layer thickness. The S-wave velocities are inverted while the layer thickness is
113
kept the same as these in the initial model during the linearized inversion. But both Swave velocities and layer thickness are varied during SA inversion. A dispersion curve
is a function of S-wave velocity, layer thickness, layer density, and P-wave velocity of
models. Varying layer thickness increases the searching model space, thus, increases the
possibility to find a global minimum solution.
4.6 Difference from previous implementation
The SA begins with Metropolis et al. (1953) and was introduced into geophysics
area by Kirkpatrick (1983) and others. Since then, it has been used for many studies
(Pullammanappallil and Louie, 1993, 1994; Sen and Stoffa, 1995; Martinez et al., 2000;
Beaty et al., 2002). Basic implementation of SA consists in three parts: 1) initial
temperature T0; 2) annealing process; and 3) uncertainty estimation. The annealing
process could be subdivided into a) probability distribution that a random number is
drawn from; b) the length of the homogeneous Markov chains; c) decrement of
temperature ∆T; d) stop criterions. Due to these options, researchers are able to
determine the best combination that fit their problem. Even for the same proplem, there
could be various implementations. For example, this study is for inversion of phase
velocity dispersion curves of the high-frequency fundamental-mode Rayleigh waves
contained in microtremors, which is the similar purpose of Beaty et al. study (2002).
Table 4.2 lists the implementation difference between this study and Beaty’s study.
Beaty et al. (2002) used a fast simulated annealing algorithm (FSA), which is
identical to the Metropolis Monte Carlo process with the exception that model
parameters are drawn from a Cauchy-like distribution (Sen and Stoffa, 1995). Unlike
114
a)
b)
c)
d)
e)
f)
Table 4.2 Implementation difference of SA from previous study
Beaty’s study
This study
Cauchy-like distribution.
A standard Gaussian distribution.
One iteration for each temperature Many iterations until equilibrium state
drop.
is achieved. Then temperature is
dropped.
T(i) = T(0) / i
T(i+1) = αT(i)
Stop the SA when the misfit
1) The misfit remains the same for a
remains the same for a number of
number of iterations for T close to zero,
iterations.
and 2) the misfit is smaller than a userspecified value ε.
Invert for coefficients of
Directly invert for S-wave velocities
Chebyshev polynomials that
represent velocity profiles
The SA inversion employing only The SA (this implementation) works
fundamental-mode dispersion
well for S-wave velocity inversion on
curves does not work well. Higher- dispersion curves of fundamental-mode
mode dispersion data have to be
Rayleigh waves.
used for an eligible SA inversion.
this algorithm that requires the equilibrium state before the temperature drops, the FSA
does not consider whether the current state is in equilibrium. For FSA the temperature is
dropped after each individual iteration. Further, instead of directly inverting for the
velocities of each layer, Beaty et al. (2002) use a sum of Chebyshev polynomials to
represent velocity profiles, and invert the coefficients of these polynomials. Beaty et al.
(2002) stop the SA when the misfit remains the same for a number of iterations. This
stop criterion may be satisfied while the temperature is still high, possibly causing a
high probability of non-global-minimum solutions.
Tests on both field and synthetic Rayleigh dispersion data show that our
implementation of SA works well for S-wave velocity inversion on dispersion curves of
high-frequency fundamental-mode Rayleigh waves. Using the idea of SA with a
115
different implementation, Beaty et al. (2002) suggested that SA inversion employing
only fundamental-mode dispersion curves does not work well. The disparity with our
successful blind tests might be due to their different implementation of SA and the
small number of tests they conducted. Only one synthetic and one field data set were
used for testing in Beaty’s study. In this implementation of SA, temperature is not
dropped until an equilibrium state is achieved through many iterations at the current
temperature. Twelve blind synthetic and two field data sets were used to demonstrate
the validity of the implementation.
4.7 A hybrid inversion approach: simulated
annealing followed by the linearized inversion
The convergence of SA is governed by a cooling schedule. At the final
temperature stage, the annealing process is repeated for hundreds of iterations, sampling
at least a hundred models that have misfits to the dispersion data within a pre-set
tolerance. Then the mean and the standard deviation of either S-wave velocity or depth
can be calculated based on mathematical formula (equations (4.6) and (4.7) ). In most
cases, these model samples are close to each other (standard deviation of the samples is
close to 0) such that they are virtually one model. In other cases the standard deviation
of these samples is significant.
The hybrid algorithm utilizes the output of SA as the input to the Occam’s
linearized inversion that has been described in Chapter three (Fig. 4.10). Although SA
practically produces more than one model samples that fit data equally well, they are
independent of the initial model and closer to the global minimum one than the initial
116
model. If we choose the mean of these model samples as our initial model for the
linearized inversion, the possibility of obtaining the global minimum model would be
higher.
The hybrid inversion method described above was tested on two experimental
data sets, Newhall and CCOC. The acquisition of the seismic records and the extraction
of surface wave dispersion curves at both sites have been described in Chapter 3.
The SA inversion is implemented as described above except that the layer
thickness is fixed. In order to exam the improvement of the hybrid inversion, only Swave velocities are allowed to vary during iteration. P-wave velocities are updated
based on an assumed Poisson’s ratio of 0.35. Densities are set at 2.0 g/cm3 and kept
constant during the SA inversion. These SA-inverted models are taken as the starting
models for the Occam’s inversion. The results from the Occam’s inversion are the final
hybrid-inverted models.
For the Newhall data, the SA-inverted model (left, Fig. 4.11) generally follows
the OYO suspension S-wave velocity log with a RMS error (equation (4.2) ) of 2.4 m/s.
The hybrid inversion yields a model (right, Fig. 4.11) that is the exactly same one as the
SA-inverted model with the same RMS error and the layer S-wave velocities. This
indicates that the further Occam’s inversion does not improve the SA-inverted model
and the SA inversion obtained the global minimum.
The similar conclusion could be drawn from the CCOC data. Both SA-inverted
(left, Fig. 4.12) and the hybrid-inverted models (right, Fig. 4.12) follow the OYO
suspension S-wave velocity log. The hybrid inversion produces a model that has a
smaller RMS error (10.7 m/s) than the SA-inverted model (RMS = 11.8 m/s). However,
117
the decrease of RMS error (7.6%) is insignificant. The further Occam’s inversion does
not significantly improve the SA-inverted model, indicating that the SA-inverted model
is reliable.
For the Occam’s inversion, the inverted model can be evaluated by computing
resolution matrix (equation (3.45) ) and covariance matrix (equation (3.42) ). However,
they can not be computed for some cases of the hybrid inversion algorithm. For the
Occam’s inversion, the direction of descent of the cost function is linear. In another
words, the model in iteration (i) must have a smaller cost function than the model in
iteration (i-1). Otherwise, the model in iteration (i) would be the final model. The
hybrid algorithm utilizes the output of SA as the initial model to the Occam’s linearized
inversion (Fig. 4.10). In some cases, this initial model has a smaller cost function than
models produced by equation (3.40) (for instant the Newhall data set). Then the initial
model is simply taken as the final model. Therefore, there are no resolution and
covariance matrix for this final model is given not calculated.
118
Figure 4.1 Multimodality of the surface wave dispersion curve inversion problem. a) In abscissa: the ratios between the shear-wave velocities (VS) of two overlaying strata and their thicknesses (THK), in ordinate the objective functions for 4000 models. The white arrow indicates
the position of the real model. b) A 2-D plot of 4000 objective functions
versus the VS ratio, by keeping constant the THK ratio at the proper
value. The black arrow indicates global minimum point (From Dal
Moro et al., 2007)
119
Fast drop
Glass
Slow drop
Hot metal block
T=1000 oC
Crystal
Figure 4.2 A cartoon showing an annealing process.
E0
h[0] Vs[0]
h[1] Vs[1]
h[2] Vs[2]
Initial model
∆E ≤ 0
Perturb
E1
Accept model
Pc ≥ Pr
else
else
Accept model
h[1] Vs[1]
Refect model
h[2] Vs[2]
Perturbation
where ∆E = E1 − E0 , Pc = e
−(
h[0] Vs[0]
∆E
)
T
Perturbed model
, and Pr ⊂ U [0, 1]
Figure 4.3 A flowchart showing the annealing process on inversion of dispersion curve
of surface waves.
120
Pc = e
−(
∆E
)
T
Figure 4.4 A cartoon showing the role of the conditional acceptance.
A local search mothod searches the minimum model always in the
downhill direction and can get trapped in a local minimum/valley
(black solid dots) that may be in the close neighborhood of the starting model. In SA, every model has a finite probability of acceptance
(Pc). In other words, it has a finite probabability of jumping out of
local minimum/valley. As temperature gradually decreases, only
moves that show improvement over the previous trial are likely to be
accepted. As temperature closes to zero, the global minimum/valley
(open circle) will be found.
121
Model 1
10
10
20
20
40
10
Depth (m)
30
40
400
600
60
800
400
600
800
80
400
1000
60
80
10
10
20
20
30
40
60
200
2000
400
Model 7
600
800
40
60
1000
400
600
800
Vs (m/s)
Model 8
0
Model 9
0
10
10
20
1200
30
Vs (m/s)
Vs (m/s)
0
1000
50
50
1500
800
Model 6
0
Depth (m)
Depth (m)
40
1000
600
Vs (m/s)
Model 5
0
20
Depth (m)
50
Vs (m/s)
Model 4
0
20
40
60
Depth (m)
Depth (m)
20
Depth (m)
40
70
Vs (m/s)
30
40
30
40
50
60
80
100
30
60
50
50
100
Model 3
0
20
30
60
200
Model 2
0
Depth (m)
Depth (m)
0
50
1000
1500
Vs (m/s)
2000
60
70
200
400
600
Vs (m/s)
800
1000
80
800
1000
1200
1400
1600
Vs (m/s)
Figure 4.5 Inverted profiles with standard deviation of S-wave velocity against the original
synthetic models. The thick solid lines are means of the profile inverted by the simulated annealing algorithm. The horizontal bars are standard deviations associated with the solid lines. The
dashed lines represent the original synthetic models. The first seven profiles are A-type sections.
The eighth and ninth profiles are H-type and K-type sections, respectively.
122
1200
Original Vs30, m/s
Inverted Vs30, m/s
1000
800
600
400
200
C
O
CC
ha
ll
N
ew
9a
M
8a
M
7a
M
9
M
8
M
7
M
6
M
5
M
4
M
3
M
2
M
M
1
0
1200
Original Vs50, m/s
Inverted Vs50, m/s
1000
800
600
400
200
C
O
CC
ha
ll
N
ew
9a
M
8a
M
7a
M
9
M
8
M
7
M
6
M
5
M
4
M
3
M
2
M
M
1
0
1600
Original Vs100, m/s
1400
Inverted Vs100, m/s
1200
1000
800
600
400
200
C
O
CC
ha
ll
N
ew
9a
M
8a
M
7a
M
9
M
8
M
7
M
6
M
5
M
4
M
3
M
2
M
M
1
0
Figure 4.6 The depth-averaged velocities in m/s against the known values for SA inverted models.
123
Model 7a
0
10
20
40
60
Depth (m)
20
Depth (m)
Depth (m)
Model 9a
0
10
20
30
40
30
40
50
60
80
100
Model 8a
0
50
1000
1500
Vs (m/s)
2000
60
70
200
400
600
Vs (m/s)
800
1000
80
800
1000
1200
1400
1600
Vs (m/s)
Figure 4.7 Inverted profiles with standard deviation of layer thickness against the original
synthetic models. The thick solid lines are means of the profile with standard deviation of layer
thickness shown as the vertical bars. The dashed lines represent the original synthetic models.
124
0
0
OYO Vs log
Inverted model
OYO Vs log
Inverted model
10
50
20
30
Depth (m)
Depth (m)
40
50
60
100
150
70
200
80
90
100
250
0
200
400
600
Vs (m/s)
800
1000
0
200
400
600
Vs (m/s)
800
1000
1200
Figure 4.8 Comparison of the OYO suspension S-wave velocity logs and the inverted
models for the Newhall (left) and the Coyote Creek data (right). The thin lines are the
OYO suspension logs and the thick lines are means of the optimized S-wave velocity
profiles with standard deviations shown as horizontal error bars.
700
700
600
600
500
Vs (m/s)
Vs (m/s)
500
400
400
300
300
200
200
100
100
0
0.1
0.15
0.2
0.25
Period (s)
0.3
0.35
0
0.2
0.4
Period (s)
0.6
0.8
Figure 4.9 Calculated dispersion curves (lines) of fundamental-mode Rayleigh waves
plotted atop the ReMi dispersion picks (circles) for the Newhall (left) and the Coyote
Creek data (right).
125
An
arbitrary
initial
model
SA
inversion
Occam’s
linearized
inversion
A final
model
Figure 4.10 The flow chart of the hybrid inversion algorithm
126
0
0
OYO Vs log
75.7 260.3
2.4 295.8
10
20
30
30
40
40
Depth (m)
Depth (m)
20
50
60
50
60
70
70
80
80
90
90
100
200
400
600
800
Vs (m/s)
1000
OYO Vs log
2.4 295.8
10
100
1200
200
400
600
800
Vs (m/s)
1000
1200
Figure 4.11 Final inverted models for Newhall data using the simulated annealing method (left) and the
hybrid inversion method (right). Dash lines are the initial S-wave velocity models and solid lines are
the inverted S-wave velocity models by SA (left) and the hybrid inversion (right). Wiggles are OYO
suspension shear wave velocity log. Numbers in the legend are RMS errors in m/s followed by average
shear wave velocity (m/s) in upper 30 meters.
0
0
OYO Vs log
68.1 206.6
11.8 235.6
20
40
60
60
80
80
Depth (m)
Depth (m)
40
100
120
100
120
140
140
160
160
180
180
200
200
220
200
400
600
800
Vs (m/s)
1000
OYO Vs log
10.7 229.9
20
1200
220
200
400
600
800
Vs (m/s)
1000
1200
Figure 4.12 Final inverted models for CCOC data using the simulated annealing method (left) and the
hybrid inversion method (right). The legends are the same as Figure 4.11.
127
Chapter 5
A joint SA inversion using both Rayleigh and
Love surface-wave dispersions
In Chapter 4 I have shown that in the absence of any a priori information or
other constraints, the SA inverted models, along with the uncertainties, give the analyst
a good idea of the underlying S-wave velocity structure. However, dispersion curves of
fundamental mode Rayleigh waves alone do not contain sufficient information to
uniquely determine a model. The velocity-depth trade-off gives rise to model nonuniqueness.
The effect of the layer thickness on surface wave dispersion curves can be
compensated by altering the layer S-wave velocities. Thin layers with a given S-wave
velocities may have the same effect on surface wave dispersion curves as thick layers
with lower S-wave velocities. This ambiguity tends to increase with depth. Figure 5.1
shows two models consisting of five layers. Changing the thickness of the fourth and
the fifth layers of the model A makes the model B. Properly adjusting the S-wave
velocities of the model B yields almost identical dispersion curves. The model B is
equivalent to the model A in the sense that this is not enough resolution for the finer
distinction between both dispersion curves.
The effect of the trade-off between the layer thickness and the S-wave velocity
also can be seen for models with different number of layers. A complicated stratigraphy
can be simplified to find an equivalent model with fewer layers. Figure 5.2 shows two
models, one with five layers (model C) and the other with eleven layers (model D), and
128
corresponding phase velocity curves (right, Fig. 5.2). Both curves are in practice
identical. Thus, the two models are equivalent.
Using other geophysical data is one way to help resolve this non-uniqueness
problem in estimating subsurface structures using the fundamental-mode Rayleigh wave
dispersion curves. Studies using receiver functions (e.g., Last et al., 1997; Özalaybey et
al., 1997; Du and Foulger, 1999; Julia et al., 2000; Chang et al., 2004), gravity (Hayashi
et al., 2005), and the reflection travel times (Dal Moro and Pipan, 2007) have all shown
that added information results in more robust models.
In this Chapter, I explore the possibility of using the fundamental-mode Love
wave dispersion curves to constrain the Rayleigh wave inversion by simulated
annealing optimization. The SA has been successfully adapted to minimize multiple
cost functions involving disparate parameters (Pullammanappallil and Louie, 1997).
Using Love waves as a constraint presents several advantages. First Love wave, like
Rayleigh waves, are dominant in either seismic or microtremor records. Second, there is
no significant increase in field acquisition effort. Both Rayleigh and Love waves could
be recorded simultaneously using a three component geophone. In the absence of 3components, Love wave can be recorded by simply swapping the vertical geophones
used for Rayleigh waves with horizontal ones. Third, forward modeling of Love waves
is much faster than that of Rayleigh waves. Lastly, Love wave dispersion curves can be
extracted using the same procedure as the standard Rayleigh ReMi data (Louie, 2001).
Both can be done simultaneously.
129
5.1 Equalized cost function
The SA inversion, described in Chapter 4, is performed on extracted
fundamental-mode dispersion picks of both Rayleigh and Love waves. Rayleigh waves
are caused by interference between P and SV waves while Love waves are SH waves
only. Thus, they provide two mutually exclusive sampling of the subsurface properties.
Both data sets are said non-commensurable.
Each data set is characterized by its own forward modeling and number of data
points. For the Rayleigh wave data set consisting of M phase velocity picks of the
fundamental Rayleigh waves, the cost function Er is defined as
Er =
1
M
m
obs
cal 2
∑ (Vri
− Vri )
i =1
(5.1)
where M is the total number of the phase velocity picks of the fundamental-mode
Rayleigh waves, Vriobs is the observed phase velocity for picks i, and Vrical is the
calculated phase velocity for picks i given a model vector m.
For the Love wave data set consisting of N phase velocity picks of the fundamentalmode Love waves, the cost function El is defined as
El =
1
N
n
obs
cal 2
∑ (Vl j − Vl j )
j =1
(5.2)
where N is the total number of the phase velocity picks of the fundamental-mode Love
cal
waves, Vl obs
j is the observed phase velocity for picks j, and Vl j is the calculated phase
velocity for picks j given a model vector m.
The simultaneous inversion of both data sets without equalization may yield
solutions that are dominated by one set. To perform the joint inversion, I define a joint
equalized cost junction E as
130
E = γ Er + (1 − γ ) El
(5.3)
where γ ⊆ [0, 1] is an influence/weighting factor, valued between zero and unit. The
factor is an a priori value that trades off between the relative influences of each data set.
ˆr
The minimization of one data set only would yield an estimate for instance m
(the case of γ = 1.0 in equation (5.3) ), while the minimization of the other data set
ˆ l (the case of γ = 0.0 in equation (5.3) ). It
would yield another estimate for instance m
is very important to realize that two estimates generally are different. For any other
value of γ, the minimization of the equalized cost junction E would yield an estimate for
ˆ r and m
ˆl .
ˆ γ that is between m
instance m
The role of the influence factor γ is illustrated by Figure 5.3 which shows that no
improvement in one cost function can be achieved without producing a simultaneous
degradation in another cost function. Zero γ corresponds to point A in Fig. 5.3 which is
the minimum solution solely based on Love wave dispersion data. Unit γ corresponds to
point B in Fig. 5.3 which is the minimum solution solely based on Rayleigh wave
dispersion data. The influence factor γ should set to a value (for example point C in Fig.
5.3) that corresponds to the point closest to the origin of the Cartesian coordinate
system. If a priori geological information is available or one set of picks is more reliable
than the other, the influence factor γ could set to a value that corresponding to this priori
knowledge. If no such knowledge is available, γ usually is set to 0.5, giving the same
weight to each of data sets.
131
5.2 Synthetic tests
Three synthetic data sets are borrowed from Noise Blind Test Project of
Grenoble, France (http://esg2006.obs.ujf-grenoble.fr/BENCH1/B1-Noise.html). The
noise synthetics are computed using 1D structures. Noise sources were approximated by
surface or subsurface forces with random force orientation and amplitude. Distribution
of sources is random in time. In space, distribution is such that around two-third of total
number of sources is randomly distributed, while one-third is spatially localized. The
source time function is either a delta-like signal or a pseudo-monochromatic signal.
Computation of the associated wave field is performed using wavenumber-based
method of Hisada (1994, 1995) for 1D horizontally layered structures. The same names
(N102, N103, and N104) are used as the Noise Blind Test Project of Grenoble, France
for each synthetic data set.
5.3 Inversion results
The non-linear SA inversion, and joint inversion methods described in Chapter 4
and 5 were tested on three synthetic models. The inverted models, along with the
original synthetics and the inverted depth-averaged velocities, are shown from Fig. 5.4
to 5.13. Table 5.1 tabulates these values which have been rounded to the one significant
decimal.
The initial models for all tests were generally set to be a 12-layer uniform halfspace with equal layer thickness. The maximum modeling depths were user-specified.
The S-wave velocities of the initial models are set from depth assuming a velocitydepth slope.
132
Table 5.1 Joint-inverted depth-averaged velocities in m/s and
percentage difference from known profiles in parentheses
V50
V100
Models*
V30
N102R
290 (34)
299 (10)
443 (19)
N102L
219 (1)
238 (13)
298 (20)
N102J
310 (43)
226 (17)
308 (17)
N103R
349 (0)
349 (0)
349 (0)
N103L
353 (1)
353 (1)
353 (1)
N103J
357 (2)
357 (2)
357 (2)
N104R
320 (5)
446 (2)
651 (15)
N104L
312 (7)
397 (9)
511 (10)
N104J
315 (6)
403 (8)
515 (9)
* Last letter of the model name R represents SA inversion on Rayleigh data, L SA
inversion on Love data, and J joint inversion on both
The SA inversion on the synthetic dispersion curve picks is carried out by
varying both S-wave velocity and layer thickness and fixing density and Poisson’s ratio
at the reasonable values (2.0 g/cm3 and 0.25, respectively). For comparison, we fix the
number of layer at 12 for all tests. The inverted models are plotted without uncertainty.
This is a blind test. No any prior geological information is available. Thus, the
simultaneous SA inversion on both Rayleigh and Love wave dispersion data is preferred
by setting the influence factor γ to 0.5 for the joint inversion.
The original synthetic model N102 is a complex shallow structure with strong
impedance contrast and complex layering including low velocity zones. This model is
chosen for verifying the capacity of inversion methods to resolve fine layering. Both
features of strong impedance contrast and low velocity zones appears on models
inverted by SA and the joint inversion methods (Fig. 5.4 and Fig. 5.5), The high
velocity layer below 150 m is better inverted by the joint method (Fig. 5.5).
133
Unfortunately the fine layers in the shallow depth are poorly resolved by either of the
inversion method. Joint inverted model shows a larger RMS error (21.3 m/s). The larger
error is due to the incorrect low velocity zone mapped between 150 m and 190 m (left,
Fig. 5.5) due to lack of sufficient picks at low frequency (right, Fig. 5.5).
The ideal shape of the cost functions should be like one in Figure 5.3. The
closer to that shape, the more reliable the model to the true profile. The evolution of the
cost functions Er (equation (5.1) ) and El (equation (5.2) ) of N102 are shown in Fig.
5.6. The long distance of the distribution envelope from the origin shows that the
control parameters of the joint inversion should be adjusted. These parameters include
the cooling schedule of the SA process, the maximum depth, the number of layers,
density, and Passion’s ratio.
The Vs value averaged to 30 (V30), 50 (V50), and 100 (V100) m for all tests are
shown in Fig. 5.7 and tabulated in Table 5.1. Compared with the next two tests, the
inverted models of N102 have the large errors of V30, V50, and V100.
The N103 is a deep model with strong impedance contrast in order to check the
ability of inversion methods to resolve deep layers. SA inversion on Rayleigh wave
dispersion data produces low velocity zones which is an inversion artifact (left, Fig.
5.8). The final model from the joint inversion matches the original synthetic model best
in both shallow and deep parts (left, Fig. 5.9). The larger RMS (12.1 m/s) of the joint
inverted model is due to bad picks of Love dispersion data above 10 Hz (right, Fig. 5.9).
Very small errors of V30, V50, and V100 (Fig. 5.7) and the relatively shorter distance of
the distribution envelope from the origin (Fig. 5.10) show that the joint inversion of
N103 is better than N102.
134
The N104 is a simple model involving shallow and deep layers. The shallow
layer is clearly inverted by the SA method. However, the deep layer is poorly defined
(left, Fig. 5.11). Layers in both shallow and deep depth are clearly resolved by the joint
inversion except a relatively low S-wave velocity near 320 m (left, Fig. 5.12). The
shape of the cost functions (Fig. 2.13) is close to that in Fig. 5.3 showing that the joint
inversion on N104 is good.
135
0
1000
10
900
20
800
700
40
Vs (m/s)
Depth (m)
30
50
60
500
400
70
300
80
Model A
Model B
90
100
600
200
400
From model A
From model B
200
600
800
Vs (m/s)
1000
0
1200
0.1
0.2
0.3
Period (s)
0.4
Figure 5.1 Two different models with same number of layers (left) and corresponding
dispersion curves (right). They are equivalent in the sense that there is not enough resolution for the finer distinction.
0
10
900
20
800
700
40
Vs (m/s)
Depth (m)
30
50
60
500
400
70
300
80
Model C
Model D
90
100
600
200
400
From model C
From model D
200
600
800
Vs (m/s)
1000
1200
0
0.1
0.2
Period (s)
0.3
0.4
Figure 5.2 Two different models with different number of layers (left) and corresponding dispersion curves (right). Both models yield almost identical dispersion curves.
136
El
C
(0, 0)
Er
Figure 5.3 A theoretical distribution of the value of the cost
function for the joint inversion. Point A and B are the minimum
points of the cost function El and Er, respectively. Point C represents the minimum value of the joint cost function E. Point C
would be closer to A for γ<0.5 and to B for γ>0.5
137
0
1500
Picked Love
Calculated Love
Picked Rayleigh
Calculated Rayleigh
Synthetic N102
Love RMS=19.4
Rayleigh RMS=9.2
50
100
150
Vs (m/s)
Depth (m)
1000
200
500
250
300
350
0
500
1000
1500
Vs (m/s)
2000
2500
0
3000
0
5
10
15
Frequency (Hz)
20
Figure 5.4 SA inversion results of the data N102. Inverted models based on Rayleigh and
Love dispersion data are plotted atop of the original true synthetic model (solid thin line) as
the dash thick line and the solid thick line on the left graph, respectively. The calculated Rayleigh dispersion curve is based on the model in dash thick line on the left graph; the calculated Love dispersion curve is based on the model in solid thick line on the left graph. Both
calculated curves are plotted atop of the observed dispersion data on the right graph.
0
1500
Picked Love
Calculated Love
Picked Rayleigh
Calculated Rayleigh
Synthetic N102
Final RMS=21.3
50
100
150
Vs (m/s)
Depth (m)
1000
200
500
250
300
350
0
500
1000
1500
Vs (m/s)
2000
2500
3000
0
0
5
10
15
Frequency (Hz)
20
Figure 5.5 Joint inversion results of the data N102. The inverted model and the original true
synthetic model are plotted as the solid thick line and the solid thin line on the left graph,
respectively. The calculated dispersion curves of Rayleigh and Love waves based on the joint
inverted model (solid thick line on the left) are plotted atop of the observed dispersion data
on the right graph.
138
30
25
El (m/s)
20
15
10
5
0
0
10
20
30
Er (m/s)
40
50
60
Figure 5.6 The distribution of the value of cost functions for joint inversion on N102. X-axis is the cost
function Er calculated from equation (5.1) in the text
on Rayleigh wave dispersion picks. Y-axis represents
the cost function El calculated from equation (5.2) on
Love wave dispersion picks.
139
400
Original Vs30, m/s
Inverted Vs30, m/s
350
300
250
200
150
100
50
0
N102R
N102L
N102J
N103R
N103L
N103J
N104R
N104L
N104J
Original Vs50, m/s
500
Inverted Vs50, m/s
450
400
350
300
250
200
150
100
50
0
N102R
N102L
N102J
N103R
N103L
N103J
N104R
N104L
700
N104J
Original Vs100, m/s
Inverted Vs100, m/s
600
500
400
300
200
100
0
N102R
N102L
N102J
N103R
N103L
N103J
N104R
N104L
Figure 5.7 The depth-averaged velocities in m/s against the known
values for both SA and joint inverted models.
N104J
140
0
1400
Synthetic N103
Love RMS=10.7
Rayleigh RMS=8.3
Picked Love
Calculated Love
Picked Rayleigh
Calculated Rayleigh
1200
1000
Vs (m/s)
Depth (m)
500
800
600
1000
400
200
1500
0
1000
2000
Vs (m/s)
3000
4000
0
5
10
15
Frequency (Hz)
Figure 5.8 SA inversion results of the data N103. The legends are the same as figure 5.4.
0
1400
Synthetic N103
Final RMS=12.1
Picked Love
Calculated Love
Picked Rayleigh
Calculated Rayleigh
1200
1000
Vs (m/s)
Depth (m)
500
800
600
1000
400
200
1500
0
1000
2000
Vs (m/s)
3000
4000
0
5
10
15
Frequency (Hz)
Figure 5.9 Joint inversion results of the data N103. The legends are the same as figure 5.5.
141
25
20
El (m/s)
15
10
5
0
0
10
20
30
40
50
Er (m/s)
Figure 5.10 The distribution of the value
of cost functions of the joint inversion
on N103. The legends are the same as
figure 5.6.
142
0
Synthetic N104
Love RMS=2.8
Rayleigh RMS=10.2
50
Picked Love
Calculated Love
Picked Rayleigh
Calculated Rayleigh
1600
1400
1200
150
Vs (m/s)
Depth (m)
100
200
1000
800
600
250
400
300
350
200
0
1000
2000
Vs (m/s)
3000
0
4000
0
5
10
15
Frequency (Hz)
20
Figure 5.11 SA inversion results of the data N104. The legends are the same as figure 5.4.
0
Synthetic N104
Final RMS=18.8
50
Picked Love
Calculated Love
Picked Rayleigh
Calculated Rayleigh
1600
1400
1200
150
Vs (m/s)
Depth (m)
100
200
1000
800
600
250
400
300
350
200
0
1000
2000
Vs (m/s)
3000
4000
0
0
5
10
15
Frequency (Hz)
20
Figure 5.12 Joint inversion results of the data N104. The legends are the same as figure 5.5.
143
160
140
El (m/s)
120
100
80
60
40
20
0
0
20
40
60
Er (m/s)
80
100
Figure 5.13 The distribution of the
value of cost functions of joint inversion on N104. The legends are the same
as figure 5.6.
144
Chapter 6 Summary and suggestions
6.1 Summary
The dispersion curves of surface waves have been successfully used for the
characterization of the shallow subsurface for decades. Three steps are involved in
utilizing dispersion curves of surface waves for imaging geological profiles:
1) acquire high-frequency broadband ground roll,
2) create efficient and accurate algorithms organized in a basic data processing
sequence designed to extract surface wave dispersion curves from the ground roll, and
3) develop stable and efficient inversion algorithms to obtain shear wave
velocity profiles.
This dissertation focuses on the third step, the inversion of the dispersion curves
of surface waves, with the aim of searching the best procedure to get a more accurate
and reliable estimate of the geological material properties. The inversion actually is
comprised of two sub-steps:
3a) estimate a model employing the theory of surface wave propagation and
mathematical optimization;
3b) appraise the model for its accuracy, either deterministically or statistically.
This study uses surface waves contained in ambient seismic noise. The
extraction of dispersion curves of surface waves is achieved by the refraction
microtremor (ReMi) technique (Louie, 2001), licensed as SeisOpt ReMi (©, Optim Inc.)
software.
145
Forward modeling is very important for the accuracy of inversion (Chapter 2).
Based on the R/T method, this study achieved a faster algorithm (called RTgen) to
compute dispersion curves of surface waves while keeping the stability of the R/T
method. The improvements by this study include 1) computation of the generalized
reflection and transmission coefficients without calculation of the modified reflection
and transmission coefficients; 2) presenting an analytic solution for the inverse of the
4X4 layer matrix E. Compared with traditional R/T methods, RTgen, when applied on
Rayleigh waves, significantly improves the speed of computation, cutting the
computational time at least by half while keeping the stability of the traditional R/T
method.
One of major goals of this study is to find the shallow S-wave velocity structure
that explains the observed dispersion curves of surface waves. This is achieved by
geophysical inversion that involves the minimization of the cost/objective function that
characterizes the differences between observed and calculated dispersion data.
However, due to discrete nature of geophysical inversion problems, the model obtained
from the inversion of the data is therefore not necessarily equal to the true model that
one seeks. This implies that for realistic problems, inversion really consists of model
estimation followed by model appraisal.
General speaking, there are three catalogs of inversion techniques based on the
internal physical principle of the geophysical problems: linear inversions, non-linear
inversions, and trial and error methods. In this dissertation, I employed a linear
inversion technique, a non-linear inversion method, and a joint method on the
dispersion data of surface waves.
146
Chapter 3 explores the Occam’s linear inversion technique with a higher-order
Tikhonov regulization. The proposed inversion procedure contains two fundamental
components. First, an algorithm (forward engine) is required to construct a theoretical
dispersion curve based on the properties of an assumed profile. Second, an algorithm is
required to minimize the objective function which is usually the error between the
theoretical and experimental dispersion curves plus a damping term. The advantages of
the application of the Occam’s inversion are 1) the analytic form of the Jacobian matrix
given by RTgen improves the accuracy of the inversion comparing with other linear
method implementation during which the Jacobian matrix is numerically given; 2) the
model in the current iteration directly yields the model in the next iteration without
model update; 3) data noise is incorporated in the inversion process. The blind tests on a
suite of nine synthetic models and two field data sets show that the final model is
heavily influenced by a) the initial model (in terms of the number of layers and the
initial S-wave velocity of each layer); b) the minimum and the maximum depth of
profiles; c) the number of dispersion picks; d) the frequency density of dispersion picks;
and e) other noise. When these effect factors are appropriately set either known or
generated using a priori information about subsurface structure, the Occam’s inversion
finds an optimal solution that is the global minimum of a misfit function. If a priori
information is either scant or unavailable, the inversion may find a local optimal
solution.
One way to help resolve this initial-model-dependence of the Occam’s inversion
is non-linear inversion techniques due to the non-linear nature of the surface wave
dispersion phenomenon. The simulated annealing (SA) inversion technique is explored
147
in Chapter 4. Following previous developments I modified the SA inversion yielding
one-dimensional shallow S-wave velocity profiles from high frequency fundamentalmode Rayleigh dispersion curves and validated the inversion with blind tests. Unlike
previous applications of SA, this study draws random numbers from a standard
Gaussian distribution. The numbers simultaneously perturb both S-wave velocities and
layer thickness of models. The annealing temperature is gradually decreased following a
polynomial-time cooling schedule. Phase velocities are calculated using the reflectivitytransmission method. The reliability of the model resulting from our implementation is
evaluated by statistically calculating the expected values of model parameters and their
covariance matrices. Blind tests on the same data sets as these in Chapter 3 show that
the SA implementation works well for S-wave velocity inversion of dispersion curves
from high-frequency fundamental-mode Rayleigh waves. Blind estimates of layer Swave velocities fall within one standard deviation of the velocities of the original
synthetic models in 78% of cases. A hybrid method is also tried in Chapter 4. The
hybrid idea is that the models obtained by the SA can used as input to the Occam’s
inversion. Tests show that the hybrid method does not always provide better results.
In the absence of any a priori information or other constraints, the SA inverted
models, along with the uncertainties, give the analyst a good idea of the underlying Swave velocity structure. However, dispersion curves of fundamental mode Rayleigh
waves alone do not contain sufficient information to uniquely determine a model. The
velocity-depth trade-off gives rise to model non-uniqueness.
To reduce the problem of the model non-uniqueness, in Chapter 5 I explore the
possibility of using the fundamental-mode Love wave dispersion curves to constrain the
148
Rayleigh wave inversion by the SA optimization. The SA technique described in
Chapter 4 is applied on the dispersion data of both fundamental-mode Love and
Rayleigh waves with equal weighting factor. Three synthetic tests show that Love wave
constraints result in significant improvement of inverted model in terms of resolution of
low velocity zones and high velocity contrasts.
6.2 Suggestions
A natural extension of the joint inversion method will be to add both
fundamental Love wave and higher-mode Rayleigh wave as constraints on Rayleigh
wave inversion. Higher modes of surface wave dispersion curves are independent from
the fundamental-mode phase velocities. Using higher modes as constraints presents
several advantages. First, higher modes have faster phase velocities, thus can penetrate
a greater depth. They exist under a specific frequency condition (Aki and Richards,
2002). It has been reported that the generation of higher modes has been associated with
presence of a velocity reversal (Stokoe et al., 1994) and that higher mode surface
waves, when trapped in a layer, are much more sensitive to the fine structure of the Swave velocity field (Kovach, 1965). Second, in some situations higher modes take more
energy than the fundamental mode does in a higher frequency range, which means the
fundamental mode data may not be available in the higher frequency range and higher
modes are the only choice (Xia et al., 2003). Third, reliable dispersion curves of higher
modes have been observed with ReMi techniques (personal communication with
Pullammanappallil, 2007). Fourth, there is no significant increase in field acquisition
149
effort with ReMi surveys. Finally, current forward code (RTgen) is able to calculate the
dispersion curves of higher modes.
Beaty et al. (2002) used a different implementation of SA to invert fundamental
and higher-mode Rayleigh wave dispersion curves for an S-wave velocity profile. Xia
et al. (2003) used a linear inversion method to invert high frequency surface waves with
fundamental and higher modes of Rayleigh waves. Their tests showed that higher mode
data stabilize the inversion procedure and increase the resolution of the inverted S-wave
velocities. However, errors of the inverted S-wave velocities are large (Xia et al., 2003).
Using both higher modes of Rayleigh waves and fundamental-mode Love waves may
solve the problem.
Two dimensional S-wave velocity structures are desirable for seismic-resistant
design of crucial buildings. This presents more challenges on data acquisition,
dispersion curve extraction, and 2D model inversion.
3D acquisition of microtremors is usually made on arrays of instruments placed
in two-dimensional configurations such as triangle and circle. Extraction of the phase
velocities can be done using beam-forming or frequency-wavenumber (f-k) methods
(e.g., Horike, 1985; Liu et al., 2000), or by using the SPAC method first proposed by
Aki (1957) and now experiencing a resurgence of interest (e.g., Okada, 2003; Asten,
2005a, 2005b).
Optim acquires 3D ambient noise records by successively deploying
independent ReMi linear arrays in two-dimensional configurations such as cross and
star shape. Assuming a consistent noise wavefield, successively recording of surface
waves on each linear line rather than simultaneously on the two-dimensional field
150
configurations for analysis is sufficient. The phase velocities are independently
extracted for each line using ReMi processing routine and independently inverted. The
final 2D model is obtained by extrapolating these independently inverted 1D profiles.
However, these independently inverted profiles are significantly different even for an
area where the lateral S-wave velocity heterogeneity is small.
One possible way to reduce the problem is 3D τ-p transformation from time
domain (x, y, t), to τ-p domain (px, py, τ) followed by 1D Fourier transformation to f-p
domain (px, py, ω). This technique has been tested on synthetic data (Donati and Matin,
1995). The new 3D τ-p transformation provides opportunity for simultaneous inversion
of a 2D S-wave velocity model.
It is possible to implement the SA in a parallel computer. Under a fixed
temperature, a parent model is perturbed on a master node to generate a serial of child
models which is then distributed to nodes for forward calculation of dispersion curves.
These calculated dispersion curves are returned to the master node. Only one model,
which is the basis for next iteration, is selected by comparing these curves and the
conditional probabilities at the temperature. The whole process is repeated until the stop
criterion is satisfied. The forward calculations of dispersion curves, which is the most
CPU-consuming part of SA, are performed simultaneously on each node. The
implementation of SA in a parallel computer will significantly speed up the inversion.
151
REFERENCES
Aki, K., 1957, Space and time spectra of stationary stochastic waves, with special reference
to microtremors, Bulletin of Earthquake Research Institute 25, 415-457.
Aki, K., 1993, Local site effects on weak and strong ground motion: Tectonophysics 218,
93-111.
Aki, K., and P. G. Richards, 2002, Quantitative Seismology: University Science Books.
Anderson, J. G., 1991, Strong motion seismology: Reviews of Geophysics 29, 700-720.
Arai, H. and K. Tokimatsu, 2004, S-wave velocity profiling by inversion of microtremor
H/V spectrum: Bulletin of the Seismological Society of America 94, 53-63.
Arai, H. and K. Tokimatsu, 2005, S-wave velocity profiling by joint inversion of
microtremor dispersion curve and horizontal-to-vertical (H/V) spectrum: Bulletin of
the Seismological Society of America 95, 1766-1778.
Asten, M. W., 2005a, An assessment of information on the shear-velocity profile at Coyote
Creek, San Jose, from SPAC processing of microtremor array data, Blind
comparisons of shear-wave velocities at closely-spaced sites in San Jose, California,
M. W. Asten and D. M. Boore (Editors), U. S. Geological Survey Open-File Report
2005-1169, part 2, 21 pp. [available on the World Wide Web at
http://pubs.usgs.gov/of/2005/1169/].
Asten, M. W. and D. M. Boore, 2005b, Comparison of shear-velocity profiles of
unconsolidated sediments near the Coyote borehole (CCOC) measured with
fourteen invasive and non-invasive methods, Blind comparisons of shear-wave
velocities at closely-spaced sites in San Jose, California, M. W. Asten and D. M.
Boore (Editors), U. S. Geological Survey Open-File Report 2005-1169, part 1, 35
pp. [available on the World Wide Web at http://pubs.usgs.gov/of/2005/1169/].
Asten, M. W., W. R. Stephenson, and P. Davenport, 2005c, S-wave velocity profile for
Holocene sediments measured from microtremor array studies, SCPT, and seismic
refraction: Journal of Engineering and Environmental Geophysics, 10, 235-242.
Aster, R. C., B. Borchers, and C. H. Thurber, 2005, Parameter Estimation and Inverse
Problems, Elsevier, New York.
ASTM (American Society for Testing and Materials) D4428/D4428M-00, 2003, Standard
Test Methods for Crosshole Seismic Testing, Vol 04.08 Soil and Rock (I).
Backus, G., and J. F. Gilbert, 1967, Numerical applications of a formalism for geophysical
inverse problems: Geophysical Journal of the Royal Astronomical Society 13, 247276.
152
Backus, G., and J. F. Gilbert, 1968, The resolving power of gross earth data: Geophysical
Journal of the Royal Astronomical Society 16, 169-205.
Bard, P. Y., 1998, Microtremor measurements: a tool for site effect estimation?, State-ofthe-art paper, Second International Symposium on the Effects of Surface Geology
on Seismic Motion, Yokohama, Balkema 3, 1251-1279.
Beaty, K. S., D. R. Schmitt, and M. Sacchi, 2002, Simulated annealing inversion of
multimode Rayleigh wave dispersion curves for geological structure: Geophysical
Journal International 151, 622-631.
Bonnefoy-Claudet, S.; C. Cornou; P. Bard; F. Cotton; P. Moczo; J. Kristek; D. Fah, 2006,
H/V ratios; a tool for site effects evaluation; results from 1-D noise simulations:
Geophysical Journal International 167, 827-837.
Boore, D. M. and M. N. Toksšz, 1969, Rayleigh wave particle motion and crustal structure:
Bulletin of the Seismological Society of America 59, 331-346.
Boore, D. M., 1972, Finite difference methods for seismic wave propagation in
heterogeneous materials, in Methods in computational physics, ed. B. A. Bolt,
Academic Press, 1-37.
Boore, D. M., 2006, Determining subsurface shear-wave velocities: A review, Third
International Symposium on the Effects of Surface Geology on Seismic Motion (P.Y. Bard, E. Chaljub, C. Conrou, F. Cotton, and P. Gueguen, Editors), Grenoble,
France, http://quake.wr.usgs.gov/~boore/pubs_online.php
Boore, D. M., and E. M. Thompson, 2007, On using surface-source downhole-receiver
logging to determine seismic slownesses, Soil Dynamics and Earthquake
Engineering 97, in press http://quake.wr.usgs.gov/~boore/pubs_online.php
Borcherdt, R. D., and G. Glassmoyer, 1994, Influences of local geology on strong and weak
ground motions in the San Francisco Bay region, California, and their implications
for site-specific code provisions, in The Loma Prieta Earthquake of October 17,
1989 – Strong Ground Motion, R.D. Borcherdt, (ed.), U.S. Geological Survey
Professional Paper 1551-A, A77-A108.
Brown, L. T., D. M. Boore, and K. H. Stokoe, II , 2002, Comparison of shear-wave
slowness profiles at ten strong-motion sites from noninvasive SASW measurements
and measurements made in boreholes, Bulletin of the Seismological Society of
America 92, 3116-3133.
Brune, J. N. and J. Oliver, 1959, The seismic noise of the earth's surface: Bulletin of the
Seismological Society of America 49, 349-353.
153
BSSC (Building Seismic Safety Council), 2000, NEHRP Recommended Provisions for
Seismic Regulations for New Buildings and Other Structures, Part 1: Provisions:
FEMA 368, Federal Emergency Management Agency, Washington, D. C.
Burkhard, N. R., and D. D. Jackson, 1976, Density and surface wave inversion:
Geophysical Research Letters 3, 637-638.
Campillo, M., and A. Paul, 2003, Long-range correlations in the diffuse seismic coda:
Science 299, 547–549.
Chang, S. J., C. E. Baag, and C. A. Langston, 2004, Joint analysis of teleseismic receiver
functions and surface wave dispersion using the genetic algorithm: Bulletin of the
Seismological Society of America 94, 691-704.
Chapman, C. H., 2003, Yet another elastic plane-wave, layer-matrix algorithm:
Geophysical Journal International 154, 212-223.
Chen, X., 1993, A systematic and efficient method of computing normal modes for
multilayered half-space: Geophysical Journal International 115, 391-409.
Clayton, R. W., and B. Engquist, 1977, Absorbing boundary conditions for acoustic and
elastic wave equations: Bulletin of the Seismological Society of America 67, 15291540.
Clayton, R. W., and B. Engquist, 1980, Absorbing boundary conditions for wave-equation
migration: Geophysics 45, 895-904.
Constable, S. C., R. L. Parker, and C. G. Constable, 1987, Occam's inversion: a practical
algorithm for generating smooth models from electromagnetic sounding data:
Geophysics 52, 289-300.
Cuellar, V., 1997, Geotechnical applications of the spectral analysis of surface waves: in
Modern geophysics in enginering geology, eds. D. M. MaCann, M. Eddleston, P. J.
Fenning, and G. M. Reeves, (Geological Society Engineering Geology Special
Publication No. 12) 53-62.
Dal Moro, G., M. Pipan, and P. Gabrielli, 2007, Rayleigh wave dispersion curve inversion
via genetic algorithms and marginal posterior probability density estimation: Journal
of Applied Geophysics 61, 39-55.
Dorman, J., and M. Ewing, 1962, Numerical inversion of surface-wave dispersion data and
crust-mantle structure in the New York-Pennsylvania area: Journal of Geophysical
Research 67, 5227-5241.
Donati, M. S., and N. W. Matin, 1995, Seismic reconstruction using a 3-D tau-p transform:
http://www.crewes.org/Reports/1995/1995-11.pdf
154
Du, Z. J., and G. R. Foulger, 1999, The crustal structure beneath the northwest fjords,
Iceland, from receiver functions and surface waves: Geophysical Journal
International 139, 419-432.
Field, E. H., K. H. Jacob, and S. E. Hough, 1992, Earthquake site response estimation: a
weak-motion case study: Bulletin of the Seismological Society of America 82,
2283-2306.
Forbriger, T., 2003a. Inversion of shallow-seismic wavefields. Part I: Wavefield
transformation: Geophysical Journal International 153, 719-734.
Forbriger, T., 2003b. Inversion of shallow-seismic wavefields. Part II: Inferring subsurface
properties from wavefield transforms: Geophysical Journal International 153, 735752.
Geman, S. and D. Geman, 1984, Stochastic relaxation, Gibbs distributions, and the
Bayesian restoration of images: IEEE Transactions On Pattern Analysis And
Machine Intelligence 6, 721-741.
Haskell, N. A., 1953, The dispersion of surface waves on multilayered media: Bulletin of
the Seismological Society of America 43, 17-34.
Hayashi, K., T. Matsuoka, and H. Hatakeyama, 2005, Joint analysis of a surface-wave
method and micro-gravity survey: Journal of Environmental & Engineering
Geophysics 10, 175-184.
Heath, K., J. N. Louie, G. Biasi, A. Pancha, and S. K. Pullammanappallil, 2006, Blind tests
of refraction microtremor analysis against synthetics and borehole data: CD-ROM
Proceedings of the 100th Anniversary Earthquake Conference.
Herrmann, R. B., G. I., Al-Eqabi, 1991, Surface wave inversion for shear velocity: in J. M.
Hoven et al. (eds), Shear waves in marine sediments, Kluwer Academic Publisher,
545-556
Herrmann, R. B., and C. J. Ammon, 2002, Computer Programs in Seismology version 3.20:
Surface Waves, Receiver Functions, and Crustal Structure, St. Louis University,
Missouri.
Hisada, Y., 1994, An efficient method for computing Green's functions for a layered
halfspace with sources and receivers at close depths: Bulletin of the Seismological
Society of America 84, 1456-1472.
Hisada Y., 1995, An efficient method for computing Green's functions for a layered
halfspace with sources and receivers at close depths (Part 2): Bulletin of the
Seismological Society of America 85, 1080-1093.
155
Holzer, T. L., M. J. Bennett, T. E. Noce, and J. C. Tinsley III, 2005, Shear-wave velocity
of surface geologic sediments in northern California: Statistical distributions and
depth dependence, Earthquake Spectra 21, 161-177.
Horike, M., 1985, Inversion of phase velocity of long-period microtremors to the S-wavevelocity structure down to the basement in urbanized areas: Journal of Physics of
the Earth 33, 59-96.
Horike, M., 1988, Analysis and simulation of seismic ground motions observed by an array
in a sedimentary basin: Journal of Physics of the Earth 36, 135-154.
Julia, J., C. J. Ammon, R. B. Herrmann, and A. M. Correig, 2000, Joint.inversion of
receiver function and surface wave dispesion observations: Geophysical Journal
International 143, 99-112.
Kausel, E. A., and J. M. Roesset, 1981, Stiffness matrices for layered soils: Bulletin of the
Seismological Society of America 71, 1743-1761.
Kennett, B. L. N., 1983, Seismic wave propagation in stratified media, Cambridge
University Press, Cambridge, 207 pp.
Kirkpatrick, S., C. D. Jr., and M. P., Vecchi, 1983, Optimization by simulated annealing,
Science 220, 671-680.
Kennett, B. L. N., 1983, Seismic wave propagation in stratified media: Cambridge
University Press.
Kobayashi, N., and K. Nishida, 1998, Continuous excitation of planetary free oscillations
by atmospheric disturbances: Nature 395, 357-360
Kovach, R. L., 1965, Seismic surface waves: some observations and recent developments.
In: L.H. Ahrens, Frank Press, S.K. Runcorn and H.C. Urey, Editors, Physics and
Chemistry of the Earth 6, Pergamon Press, Oxford, 251-314.
Lai, C. G., S. Foti, and G. J. Rix, 2005, Propagation of data uncertainty in surface wave
inversion: Journal of Engineering and Environmental Geophysics 10, 219-228.
Larsen, S., and J. Grieger, 1998, Elastic modeling initiative, Part III: 3-D computational
modeling: 68th Annual International Meeting, SEG, Expanded Abstracts, 18031806.
Last, R. J., A. A. Nyblade, and C. A. Langston, 1997, Crustal structure of the East African
plateau from receiver functions and Rayleigh wave phase velocities: Journal of
Geophysical Research 102, 24,469-24,483.
156
Lay, T. and Wallace, T.C., 1995, Modern global seismology: Academic Press.
Levenberg, K., 1944, A method for the solution of certain nonlinear problems in least
squares: Quarterly of Applied Mathematics 2, 164-168.
Liu, H. P., R. E. Warrick, R. E. Westerlund, J. B. Fletcher, and G. L. Maxwell, 1988, An
air-powered impulsive shear-wave source with repeatable signals: Bulletin of the
Seismological Society of America 78, 355-369
Liu, H. P., D. M. Boore, W. B. Joyner, D. H. Oppenheimer, R. E. Warrick, W. Zhang, J. C.
Hamilton, and L. T Brown, 2000, Comparison of phase velocities from array
measurements of Rayleigh waves associated with microtremor and results
calculated from borehole S-wave velocity profiles: Bulletin of the Seismological
Society of America 90, 666-678.
Liu, Y., B. Luke, S. K. Pullammanappallil, J. N. Louie, and J. Bay, 2005, Combining
active- and passive source measurements to profile S-wave velocities for seismic
microzonation: GeoFrontiers, American Society of Civil Engineers Geotechnical
Special Publications 133, 977-990.
Louie, J. N., 2001, Faster, better: Shear-wave velocity to 100 meters depth from refraction
microtremor arrays: Bulletin of the Seismological Society of America 91, 347-364.
Louis, S. J., Q. Chen, S. Pullammanappallil, 1999, Seismic velocity inversion with genetic
algorithms, CEC99, 1999 Congress on Evolutionary Computation, Mayflower Hotel,
Washington D.C., 855 – 861.
Luco, J. E. and R. J. Apsel, 1983, On the Green’s function for a layered half-space, Part I:
Bulletin of the Seismological Society of America 73, 909-929.
Luke, B., C. Calderón, R. C. Stone, and M. Huynh, 2003, Non-uniqueness in inversion of
seismic surface-wave data: the Proceedings of Symposium on the Application of
Geophysics to Engineering and Environmental Problems (SAGEEP), Environmental
and Engineering Geophysical Society, 1342-1347.
Lysmer, J., and L. A. Drake, 1972, A finite element method for seismology, in Methods in
computational physics, ed. B. A. Bolt, Academic Press, 181-216.
Martinez, M. D., X. Lana, J. Olarte, J. Badal, and J. A. Canas, 2000, Inversion of Rayleigh
wave phase and group velocities by simulated annealing: Physics of the Earth and
Planetary Interiors 122, 3-17.
McNamara, D. E. and R. P. Buland, 2004, Ambient Noise Levels in the Continental United
States, Bulletin of the Seismological Society of America 94, 1517-1527.
157
Menke, W., 1989, Geophysical Data Analysis - Discrete Inverse Theory, Academic Press,
New York.
Metropolis, N., A. Rosenbluth, M. Rosenbluth, A. Teller, and E. Teller, 1953, Equation of
state calculations by fast computing machines: Journal of Chemical Physics 21,
1087-1092.
Nazarian, S., and K. H. II. Stokoe, 1985, Near-surface soil profiling with Rayleigh waves:
Geophysics 50, 1204-1205.
Nighbor, R.L., and T. Imai, 1994, The suspension P-S velocity logging method:
Geophysical characterization of sites: International Science Publisher, New York, p.
57-61.
Okada, H., 2003, The microtremor survey method, Geophysical Monograph Series, no. 12:
Society of Exploration Geophysicists.
O’Neill, A., 2003, Full-waveform reflectivity for modeling, inversion and appraisal of
surface wave dispersion in shallow site investigations: Ph.D. dissertation, University
of Western Australia, Perth.
Ozalaybey, S., M. K. Savage, A. F. Sheehan, J. N. Louie, and J. N. Brune, 1997, Shearwave velocity structure in the northern Basin and Range province from the
combined analysis of receiver functions and surface waves: Bulletin of the
Seismological Society of America 87, 183-199.
Park, C. B., R. D. Miller, and J. Xia, 1998, Imaging dispersion curves of surface waves on
multi-channel record: [Expanded Abstract]: Society of Exploration Geophysics,
1377-1380.
Park, C. B., R. D. Miller, and J. Xia, 1999, Multi-channel analysis of surface waves:
Geophysics 64, 800-808.
Park, C. B., R. D. Miller, J. Xia, and J. Ivanov, 2004, Imaging dispersion curves of passive
surface waves: SEG Expanded Abstracts: Society of Exploration Geophysics (NSG
1.6).
Park, C.B., R. D. Miller, N. Ryden, J. Xia, and J. Ivanov, 2005, Combined use of active and
passive surface waves: Journal of Engineering and Environmental Geophysics 10,
323-334.
Parker, R. L., 1994, Geophysical inverse theory, Princeton University Press, Princeton,
New Jersey.
Peterson, J., 1993, Observations and modelling of background seismic noise: Open-file
report 93-322, U. S. Geological Survey, Albuquerque, New Mexico.
158
Phillips C., G. Cascante, D. J. Hutchinson, 2004, Evaluation of horizontal homogeneity of
geomaterials with the distance analysis of surface waves: Canadian Geotechnical
Journal 41, 212-226.
Press, W. H., S. A. Teukolsky, W. T. Vetterling, and B. P. Flannery, 1992, Numerical
recipes in C: Cambridge University Press.
Pullammanappallil, S. K. and J. N. Louie, 1993, Inversion of seismic reflection traveltimes
using a nonlinear optimization scheme: Geophysics 58, 1607-1620.
Pullammanappallil, S. K. and J. N. Louie, 1994, A generalized simulated-annealing
optimization for inversion of first-arrival times: Bulletin of the Seismological
Society of America 84, 1397-1409.
Pullammanappallil, S. K., and J. N. Louie, 1997, A combined first-arrival travel time and
reflection coherency. optimization approach to velocity estimation: Geophysical
Research Letters 24, 511-514.
Reeves, C. R., J. E. Rowe, 2003, Genetic algorithms - principles and perspectives, Kluwer,
Norwell, Kluwer Academic Publisher.
Rhie, J., and B. Romanowicz, 2004, Excitation of Earth's continuous free oscillations by
atmosphere-ocean-seafloor coupling: Nature 431, 552-556.
Rodriguez-Zuniga, J. L., C. Ortiz-Aleman, G. Padilla, R. Gaulon, 1997, Application of
genetic algorithms to constrain shallow elastic parameters using "in situ" ground
inclination measurements: Soil Dynamics and Earthquake Engineering 16, 223-234.
Ross, S. M., 2003, Introduction to probability models: Academic Press, Inc.
Rothman, D. H., 1985, Nonlinear inversion, statistical mechanics, and residual statics
estimation: Geophysics 50, 2784-2796.
Sabra, K. G., P. Gerstoft, P. Roux, W. A. Kuperman, and M. C. Fehler, 2005, Extracting
time-domain Greens function estimates from ambient seismic noise, Geophysical
Research Letter 32, L03310, doi:10.1029/2004GL021862.
Saito, M., 1988, DISPER80: a subroutine package for calculation of seismic normal-mode
solutions: in Seismological algorithms; computational methods and computer
programs, ed., D. J. Doornbos, Academic Press, 293-320.
Sambridge, M., Mosegaard, K., 2002. Monte Carlo Methods in Geophysical Inverse
Problems: Reviews of Geophysics 40, 3.1 -3.29.
Scales, J., and R. Snieder, 1997, To Bayes or not to Bayes?: Geophysics 62, 1045-1046.
159
Scherbaum F., K. G. Hinzen and M. Ohrnberger, 2003, Determination of shallow shear
wave velocity profiles in the Cologne, Germany area using ambient vibrations:
Geophysical Journal International 152, 597-612.
Scott, J. B., M. Clark, T. Rennie, A. Pancha, H. Park and J. N. Louie, 2004, A shallow Swave velocity transect across the Reno, Nevada area basin: Bulletin of the
Seismological Society of America 94, 2222-2228.
Scott, J. B., T. Rasmussen, B. Luke, W. Taylor, J. L. Wagoner, S. B. Smith, and J. N. Louie,
2006, Shallow shear velocity and seismic microzonation of the urban Las Vegas,
Nevada basin: Bulletin of the Seismological Society of America 96, 1068-1077.
Sen, M., and P. L. Stoffa, 1995, Global optimization methods in geophysical inversion:
Elsevier Science Publishing Company.
SESAME, 2004, Guidelines for the implementation of the H/V spectral ratio technique on
ambient vibrations - Measurements, processing and interpretation: SESAME
European research project WP12 – Deliverable D23.12, http://sesame-fp5.obs.ujfgrenoble.fr/SES_Reports.htm
Shapiro, N. M., M. Campillo, L. Stehly, and M. H. Ritzwoller, 2005, High-resolution
surface wave tomography from ambient seismic noise: Science 307, 1615–1617.
Shearer, P., 1999, Introduction to seismology: Cambridge University Press.
Sheriff, R. E., 1991, Encyclopedic dictionary of exploration geophysics: Society of
Exploration Geophysicists.
Snieder, R., and J. Trampert, 1999, Inverse problems in geophysics, in Wave field
Inversion, Ed. A. Wirgin, Springer Verlag, New York, 119-190.
Stephenson, W. J., J. N. Louie, S. K. Pullammanappallil, R. A. Williams, and J. K. Odum,
2005, Blind S-wave velocity comparison of ReMi and MASW results with
boreholes to 200 m in Santa Clara Valley: Implications for earthquake ground
motion assessment: Bulletin of the Seismological Society of America 95, 2506-2516.
Stoffa, P. L., and M. K. Sen, 1991, Nonlinear multiparameter optimisation using genetic
algorithms: inversion of plane wave seismograms: Geophysics 56, 1794-1810.
Stokoe, K. H., S. G. Wright, J. A. Bay, and J. M. Roësset, 1994, Characterization of
geotechnical sites by SASW method. In: R.D. Woods, Editor, Geophysical
Characterization of Sites, ISSMFE Technical Committee #10, Oxford Publishers,
New Delhi (1994), 15-25.
160
Takeuchi, H., and M. Saito, 1972, Seismic surface waves: in Methods in computational
physics, ed. B. A. Bolt, Academic Press, 217-294.
Tanimoto, T., 2005, The oceanic excitation hypothesis for the continuous oscillations of the
Earth: Geophysical Journal International 160, 276-288.
Tanimoto, T., 2007, Excitation of normal modes by non-linear interaction of ocean waves:
Geophysical Journal International 168, 571-582.
Tarantola, A., 2005, Inverse Problem Theory and Methods for Model Parameter estimation,
Society of Industrial and Applied Mathematics, Philadelphia, available online at
http://www.ipgp.jussieu.fr/~tarantola/Files/Professional/Books/index.html
Tarantola, A., 1987, Inverse problem theory; methods for data fitting and model. parameter
estimation, Elsevier.
Thelen, W. A., M. Clark, C. T. Lopez, C. Loughner, H. Park, J. B. Scott, S. B. Smith, B.
Greschke, and J. N. Louie, 2006, A transect of 200 shallow shear velocity profiles
across the Los Angeles Basin: Bulletin of the Seismological Society of America 96,
1055-1067.
Thomson, W. T., 1950, Transmission of elastic waves through a stratified solid medium:
Journal of applied Physics 21, 89-93.
Toksoz M. N. and R. T. Lacoss, 1968, Microseisms: mode structure and sources: Science
159, 872-873.
Trampert, J., 1998, Global seismic tomography: the inverse problem and beyond: Inverse
Problems 14, 371-385, 1998.
Wentworth, C. M., and J. C. Tinsley, 2005, Geologic setting, stratigraphy, and detailed
velocity structure of the Coyote Creek borehole, Santa Clara Valley, California, in,
M. W. Asten, and D. M. Boore, eds., Blind comparisons of S-wave velocities at
closely spaced sites in San Jose, California: U.S. Geological Survey Open-File
Report 2005-1169.
Wiggins, R. A., 1972, The general linear inverse problem: implication of surface waves and
free oscillations for earth structure: Reviews of Geophysics and Space Physics 10,
251-285.
Xia, J., R. D. Miller, C. B. Park, and G. Tian, 2003, Inversion of high frequency surface
waves with fundamental and higher modes: Journal of Applied Geophysics 52, 4557.
Xia, J., R. D. Miller, and C. B. Park, 1999, Estimation of near-surface S-wave velocity by
inversion of Rayleigh waves: Geophysics 64, 691–700.
161
Yamanaka, H., M. Dravinski, and H. Kagami, 1993, Continuous measurements of
microtremors on sediments and basement in Los Angeles, California: Bulletin of the
Seismological Society of America 83, 1595-1609.
Yamanaka, H., M. Takemura, H. Ishida, and M. Niwa, 1994, Characteristics of long-period
microtremors and their applicability in exploration of deep sedimentary layers:
Bulletin of the Seismological Society of America 84, 1831-1841.
Yamanaka, H., and H. Ishida, 1996, Application of genetic algorithms to an inversion of
surface-wave dispersion data: Bulletin of the Seismological Society of America 86,
436-444.
Yoon, S. and G. J. Rix, 2005, Active and passive surface wave measurements at the
William Street Park site, using f-k methods, Blind comparisons of shear-wave
velocities at closely-spaced sites in San Jose, California, M. W. Asten and D. M.
Boore (Editors), U. S. Geological Survey Open-File Report 20051169, part 2, 15 pp.
[available on the World Wide Web at http://pubs.usgs.gov/of/2005/1169/].
Zeng, Y., and J. G. Anderson, 1995, A method for direct computation of the differential
seismograms with respect to the velocity change in a layered elastic solid: Bulletin
of the Seismological Society of America 85, 300-307.
162
APPENDIX A: Matrices for Rayleigh waves
Rayleigh waves consist of P and SV-waves. We give the explicit expression of
the layer matrices, the inverse layer matrices, and downgoing- and upgoing-waves
matrices (equation (2.31) ) for Rayleigh waves here. The expression is derived by Luco
and Apsel (1983) and Hisada (1994). The explicit expression clearly shows that equation
(2.31) is completely free from the exponential growth term with depth.
The motion-stress vectors of the jth layer are expressed by the downgoing and
upgoing P and SV waves in the following matrix form (the same as equation (2.31) ):
0  Cdj 
 D j ( z )   E11j E12j   Λ dj ( z )
j
j
j
=
 j =E Λ C
Sj z   j
j 
j
C
(
)
0
(
)
E
E
Λ
z

  21
u
22  
 u
( A1)
where
 D pj ( z ) 
D ( z ) =  j  ( A2)
 Ds ( z ) 
j
are the motion/displacement vectors for P- and SV-waves at depth z.
S j ( z)
S j ( z ) =  pj  ( A3)
 Ss ( z)
are the stress/traction vectors for P- and SV-waves at depth z.
E
Ej = 
E
j
11
j
21
 −1

j
E12   −γ 1 j
=
E22j   2µ jγ j
 j j
 χ 1 µ
ν1j
1
− χ1 j µ j
−2µ jν j


−1 
( A4)
χ1 j µ j 

−2µ jν j 
ν1j
−1
γ 1j
−2 µ j γ j
χ1 j µ j
is the jth layer matrix, representing the effects of the elastic properties and the vertical
wavenumber in the jth layer.
 e −γ ( z − z
Λ ( z) = 

0
j
j
d
j −1
)


j
j −1
e −ν ( z − z ) 
0
 e −γ ( z
Λ ( z) = 
 0
j
j
u
j
−z)
0
e−ν
j
(z
j


−z)

( A5)
163
are the phase delay matrices for downgoing and upgoing P and SV-waves.
Cdpj 
C = j 
 Cds 
j
d
Cupj 
C = j 
 Cus 
j
u
( A6)
are the amplitude vector matrices for downgoing and upgoing P and SV-waves.
ν j = k 2 − (ω β j ) 2 with Re {ν j } ≥ 0
ν1j = ν j k
γ j = k 2 − (ω α j )2 with Re {γ
γ 1j = γ
j
j
}≥0
k
χ1 j = 2k − (ω β j )2 k
( A7)
where k is wavenumber, ω is frequency, βj is shear velocity of the jth layer, αj is
compressional velocity of the jth layer.
Analytical solution of the 4X4 layer matrix E in equation A4 is
 E11j
 j
 E21

j
 −2k µ

 µ j χ1 j
−
1
−
E12j 
1
ν1j

=

µ j (4k − 2 χ1 j ) 
E22j 
j
 −2k µ

 µ j χ1 j
−
 ν1j
µ j χ1 j
γ 1j
1
γ 1j
2k µ j
1
µ j χ1 j
−
γ 1j
1
− j
γ1
−2 k µ j
−1

−1 

1 
− j
ν1 

−1 

1 
− j
ν1 
( A8)
164
APPENDIX B: Matrices for Love waves
Love waves consist of SH-waves only. The motion-stress vectors of the jth layer
are expressed by the downgoing and upgoing SH waves in the following matrix form (the
same as equation (2.45) ):
 D j ( z )   E11j
S j z  =  j
 ( )   E21
0  Cdj 
E12j   Λ dj ( z )
j
j
j
 j = E Λ C
j 
j
Λ u ( z )  Cu 
E22   0
( B1)
Where D j ( z ) , S j ( z ) , Cdj and Cuj have the same physical meaning as these in Appendix
A but only for SH-waves.
 E11j
 j
 E21
E12j   1
=
E22j   − µ jν j
1 
( B 2)
µ jν j 
and
 Λ dj ( z )
0  e −ν ( z − z

=
Λ uj ( z )  
0
 0
j
j −1
)


j
j
e−ν ( z − z ) 
0
( B3)
The explicit solutions for generalized reflection/transmission coefficients are
given as
Tˆdj =
Λ dj ( z j )
a + bΛ uj +1 ( z j ) Rˆ duj +1
Rˆ duj = (b + aΛ uj +1 ( z j ) Rˆ duj +1 )Tˆdj
( B 4a )
( B 4b)
where
a = 1 2 + µ j +1ν j +1 2 µ jν j
( B5a )
b = 1 2 − µ j +1ν j +1 2 µ jν j
( B5b)
165