Download Wave equation inversion of skeletalized geophysical data

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Earthscope wikipedia , lookup

Surface wave inversion wikipedia , lookup

Transcript
Geophys. J . Inf. (1991) 105,289-294
Wave equation inversion of skeletalized geophysical data
Yi Luo and Gerard T. Schuster
Department of Geology and Geophysics, University of Utah, Salt Lake City, UT 84112, USA
Accepted 1990 September 19. Received 1990 July 27; in original form 1990 March 16
SUMMARY
Inversion methods based on gradient optimization techniques require the directional
derivative of the data with respect to the model parameters. Unfortunately, the data
(e.g., pressure seismograms) are usually restricted to that explicitly given in the
fundamental governing equation (e.g., wave equation). This limited choice of data
type may lead to misfit functions that are pathologically non-linear with respect to
the model parameters. We present a methodology which allows for the calculation
of directional derivatives for skeletalized data sets, yet still uses the fundamental
governing equations without the need for approximations. Skeletalized data are
defined as a reduced data set derived from the original data which retains the
important information about the model parameter of interest. The motivation for
working with skeletalized data rather than raw data is that the skeleton data may be
strongly influenced by only one type of model parameter and lead to a quasi-linear
misfit function.
Examples of skeletalized data sets include first arrival traveltimes picked from
CDP seismograms for velocity inversion, amplitudes of transmitted earthquake
SH-waves for earthquake moment inversion, or pulse width measured from first
arrivals for attenuation inversion. As a n example we devise a traveltime inversion
method based on the wave equation and free of any high-frequency approximations.
Results show that wave equation traveltime (WT) inversion is superior to ray traced
traveltime inversion for complicated velocity models. It is also shown that WT
inversion converges quickly for starting velocity models that are far from the actual
model.
Key words: seismic, skeletal, tomography, traveltime, wave equation inversion.
INTRODUCTION
Inversion methods are usually limited to those in which
there exists a fundamental governing equation which
explicitly relates the fundamental model parameters, m, to
the fundamental data deal, i.e.,
(la)
A@, d C d= 0 ,
where A is an operator and dca,can be considered a function
of m. An example might be to assign the homogeneous
wave equation as A , the calculated pressure seismograms as
deal, and acoustic velocities as m.
The optimization problem can be defined as finding
the optimal model m* in R"' which minimizes the misfit
function
E
E(m) = 1/2(dcal(m) - dabs, dcal(m) - dabs)
9
(Ib)
assume that d belongs to R" and that ( d , d ) represents the
squared Euclidean norm of d. A popular optimization
procedure is a gradient technique, which searches for m* in
directions parallel to the gradient vector
-6E(m)6mi
(y
, dcal(m)-
dabs),
where Sdca,(m)/Gmiis the directional derivative of d along
the ith component of m and dca,(m)-dobs is the data
residual vector. Assuming A is linear with respect to the
data and perturbing the fundamental governing equation
with respect to model perturbations
Sm,
dcal(m)+ A(m) 6dcal(m) - 0,
6mi
~
we can derive the directional derivative operator
where dobs is the observed data and ( ) denotes the inner
product. For convenience, the following discussion will
289
290
Y. Luo and G. T. Schuster
a
Sciamogramc for 20% Velocity C o n t r a i t Modal
I
I00
80
b
60
40
20
0
200
0
600
400
(ms)
Time
C
q
,
5
0
2
2
I3
I
,
I
40
0
20
Percentage of Velocity Contrast
Figure 1. (a) Acoustic cylinder velocity model with a cylinder
diameter equal to about five source wavelengths. The homogeneous
background velocity is 2.5 km s-' which is also that for the starting
models for all W T inversions. (b) Pressure seismograms for a source
on one edge and receivers on the opposite edge. (c) Misfit functions
for transmission traveltimes (solid line) and pressure seismograms
(dashed lines) as a function of velocity change in the Fig. l(a)
cylinder. The fundamental and skeletal data are, respectively,
pressure seismograms and traveltimes of first arrivals.
Examples of geophysical inversion which employ
equations similar to equations (1) include; (a) full wave
seismic inversion (Tarantola 1987; Johnson & Tracy 1983)
which assigns the wave equation as A to relate the, say
pressure field, to the unknown velocity parameters, (b)
standard traveltime inversion which associates the traveltime
integral with A to relate the observed body wave traveltimes
(Dines & Lytle 1979; Lines 1988) to the velocities, and (c)
electromagnetic inversion methods which assign Maxwell's
equations as A to relate the conductivity parameters to the
electric or magnetic field data (Nabighian 1987).
A problem with the above approach is that the misfit
function in equation (lb) can be a highly non-linear function
of several types of model parameters, such as velocities and
densities. An example is given in Fig. l(c) (dashed line)
which depicts the logarithm of the misfit function E of
pressure seismograms (Fig. lb) as a function of cylinder
velocity contrast for receivers and sources surrounding an
acoustic cylinder model (Fig. la). In this case, the pressure
seismograms deal are calculated for a homogeneous model,
subtracted from the actual seismograms for a cylinder
model, and the normed difference is given by E. Fig. l(c)
suggests that the seismogram misfit function E is non-linear
for moderate ( > l o per cent) to large perturbations in the
cylinder velocity (Gauthier, Virieux & Tarantola 1986). In
this case a gradient method may tend to get stuck in local
minima if the starting model is moderately far from the
actual model. Unlike traveltime data, the seismogram
amplitudes are strongly dependent on density variations.
Are there reduced or skeletal features in the observed
data which strongly depend on only one parameter type and
might also yield a more linear misfit function? If these
skeletal data, $, are extant why not use them to form the
misfit function? The benefit might be a quasi-linear (rather
than non-linear) misfit function which is decoupled from all
but one parameter type. Unfortunately, there is usually no
fundamental governing equation, such as the wave equation,
which relates these skeletal data
to a single type of model
parameter. This means that the directional derivative
S 4 1 S r n i in equation (Id) cannot be easily calculated.
An example of skeletalization is from traveltime
tomography where first arrival times are picked or
skeletalized from a common shot point (CSP) gather. This is
beneficial because a traveltime misfit function is decoupled
from densities and depends on velocities in a quasi-linear
manner (e.g., solid line in Fig. lc). The penalty, however, in
ray-based traveltime inversion is a high-frequency approximation to the data, which can restrict its utility.
This paper describes a methodology for constructing
directional derivatives 6d,,/Smi of important features or
skeletalized features of the data with respect to the most
influential parameters of the model, without using any
approximations (except for iterative linearization). The key
idea is that a Connective functional (or equation) is
constructed which relates the important or skeletal data, $,
to the fundamental data d,
F ( 4 , d) = F ( 4 , d(m)) = 0,
(le)
which will define d,, as a function of m. The functional F
might be constructed with a statistically robust pattern
matching operator which matches the observed data with
the calculated data (e.g., cross-correlation). For example, if
d represents a calculated seismogram, then d,, can be the lag
time which maximizes the cross-correlation F(d,,, d(m))
between the observed and synthetic seismogram. In this
case, 4 can also be interpreted as the traveltime difference
between the first arrivals of the observed and synthetic
seismograms.
From equation (le) the directional derivative of the
skeletalized data Sd,,lSmi can be calculated using the rule
for an implicit function derivative (Gillespie 1960)
(If)
where 6d/Srni in the numerator can be computed from the
fundamental governing equation (equation Id) and the
denominator 6 F ' / 6 d , can be computed from the connective
equation (le). For traveltime inversion, this means that the
directional derivative of traveltimes with respect to velocity
perturbations can be computed using the wave equation; no
high-frequency approximations are needed. This methodology can also be applied to other geophysical data types as
Inversion of skeletal data
long as equations (Id), (le) and (If) can be formulated, and
the conditions for the implicit function theorem are satisfied
(Gillespie 1960). This paper will illustrate this new
methodology in the context of wave equation traveltime
(WT) inversion.
291
where fi = 3p(x, t ; x , ) / d t . Equation (3) serves as the
connective function in equation (le) which connects the
fundamental data (pressure seismograms) with the skeletal
data (traveltime residuals).
Misfit function
THEORY
In this section the new inversion methodology will be
applied in inverting traveltime data with the wave equation,
henceforth designated as W T inversion. The key steps are to
(1) define a connective function (equation l e ) which defines
the traveltime residual (skeletal data) as a function of the
pressure seismograms (fundamental data). This step allows
for the derivation of the directional derivative (equation If),
(2) define a misfit function (hopefully, quasi-linear) of
traveltime residuals [squared normed difference between
observed and synthetic transmission traveltimes, equation
(lb)] which mainly depends on velocity, not density,
parameters, (3) determine the gradient of the misfit function
for use in a gradient optimization algorithm.
The following analysis assumes that the propagation of
seismic waves honours the 2-D acoustic wave equation
The WT method attempts to determine the velocity model
C ( X ) which predicts seismograms p ( x , , t ; xs)calthat minimize
the following traveltime misfit function:
(4)
where the summation is over source and receiver locations,
At is defined by equation (2b) and the factor 112 is
introduced for subsequent simplifications. This criterion, of
course, can be generalized to take into account the
estimated observation errors or a priori information in
model space.
Misfit gradient
A gradient method can be used to solve equation (4)for the
c ( x ) which minimizes S. For simplification we will discuss
(24
where p ( x , , t; x , ) , ~ ~denotes the observed pressure seismograms at receiver location x , due to a line source excited at
time t = 0 and at location x s , p(x,) is the density, s ( t ; x,) is
the source function and c(x,) is the material velocity.
Connective hrnction
The degree in which the synthetic and observed
seismograms match each other can be estimated by the
cross-correlation function:
the steepest descent method. To update the velocity model,
the steepest descent method gives
c(x)k+I
= -C@)k
+4X)kY(X)k,
(5)
where y(x), is along the gradient direction of the misfit
is a step length and k denotes the kth
function S,
iteration. The central problem is how to get y ( x ) based on
the wave equation. To obtain y ( x ) , construct the directional
derivative of S with respect to the velocity model c ( x ) :
where we have replaced the deltas with partial differentials.
Using equation (3) and the rule for an implicit function
derivative we get
where A ( ~ , ; x , ) is, ~the
~ maximum absolute amplitude of
p ( x , , t;x,)obs and t is the time shift between synthetic and
real seismograms. The divisor A@,;x , ) , ~ ~normalizes the
observed seismograms to a maximum amplitude of 1.
We seek a t in which a synthetic seismogram must be
shifted so that it 'best' matches the observed seismogram.
The criteria for 'best' match is defined as the traveltime
residual A t which maximizes the cross-correlation function,
f(xr, t;xS), i.e.,
f ( x , , A t ; x,) = max
W,, t;x,)
1 t E [-T,
TI),
(2b)
where T is the estimated maximum traveltime difference
between the observed and calculated seismograms. It is easy
to see that the derivative of f(x,, q x , ) with respect to t
should be zero at At unless its maximum is at an endpoint
At=TorAt=-T:
(3)
where, g(x, t; x ' , t') is the Green's function for equation
(2a); that is, the pressure field at point x and time t due to
the impulse line source 6 ( x - x') 6(r - r'). The symbol *
represents time convolution. Substituting equation (7c) into
292
Y.Luo and G. T. Schuster
equation (7b),
5%
- 2600
%
>
a f t e r 5 iterations
pyq
2400
I00
2 00
2400
u o
100
200
2000
- o
I00
200
10%
after 5 iterotions
20%
ofter 7 iterations
40%
one can rewrite equation (8a) as
(9)
where
p ’ ( x , t ; x,) =
C g ( x , -2;
r
x,, 0) * 6 t ( x , , t ; xs)t
and p ( x , t ; ~ , is) the
~ ~ pressure
~
field calculated (using a
finite difference method) for the current velocity model c ( x )
and p ’ ( x , t ; x , ) is the field computed by reverse time
propagation of the quasi residual 6t(x,, t ; x , ) acting as a
source at receiver location x,. This result is similar to that of
full wave inversion except 6t is used instead of Sp. In full
wave inversion Sp is defined as
SP =P(xr,
t ; ~ s ) o b s - ~ ( ~ t;xs)cal,
r,
(10)
where the subscripts ‘obs’ and ‘cal’ denote, respectively,
o f t e r 10 i t e r a t i o n s
Figure 2. Velocity models reconstructed by the WT method (using
transmission traveltimes) for different velocity perturbations in the
Fig. l(a) cylinder. The percentages indicate the per cent velocity
contrast of the cylinder from the background velocity. The left
column of figures depict the final reconstructed velocity model, and
the right column of figures depict the reconstructed velocity profile
(dashed lines) along a vertical cross-section through the model’s
centre.
Cylinder model
Gauthier et al. (1986) presented a cylinder model (Fig. l a )
composed of a cylindrical velocity perturbation five
wavelengths in diameter superimposed on a homogeneous
medium with velocity 2500ms-’. The starting model is
assumed to be a homogeneous medium with velocity
2500 m s-l, the dominant source frequency is 25 Hz, and
there are a total of 380 receivers and eight sources equally
observed and calculated pressure seismograms. Combining
spaced along the model edges.
equations (9) and (5) yields an iterative method to invert for
a velocity model c ( x ) from traveltime residuals. It can be
shown that in the high-frequency limit (Luo & Schuster
1991) equation (9) reduces to that of ray traced traveltime
tomography.
Gauthier et al. (1986) show that the full wave inversion
method will fail for cylinder velocity perturbations larger
than 10 per cent. Fig. 2 shows the success of WT inversion
for velocity cylinder perturbations up to 40 per cent. The
reason for this success is suggested in Fig. l(c) which
demonstrates that the normed traveltime misfit function has
a quasi-linear behaviour with respect to increasing velocity
contrast. On the other hand, the normed pressure
seismogram misfit function will be non-linear (Jannane et al.
1989).
NUMERICAL EXAMPLES
The WT method is tested for three different models; a
cylinder model, the Langan velocity model (Langan et al.
1988) and a fault model having steam injection. The cylinder
model is used to verify that WT inversion is more robust
than full wave inversion for incorrect starting models. The
Langan velocity model and the fault model are used to show
that WT inversion succeeds when ray tracing fails.
Langan velocity model
Figure 3(b) shows a sonic log velocity profile from Kern
River, California (Langan et al. 1988) in which ray tracing
(subject to a high-frequency approximation) fails to
Figure 4. (a) Earth velocity models, with the pre-steam injection model vpre(x,z ) at the left, the post-steam injection model vFst(x, z ) in the
middle, and the subtracted vposr(x,z ) - vp&, z) at the right. The steam injection lowers the velocity by 10 per cent. Coldest (warmest)
colour represents a velocity of 2300 m s-l (4000 m s-l) and the wells are 90 m apart and 200 m deep. (b). Reconstructed velocities using the
wave equation traveltime method are given directly below each model in Fig. 4(a).
"
01
-
-
I
5 200
a
D
True Velocity Model
30001
I
11
Inversion of skeletal data
I
2 2000
-
>
:
4ooo
o
o
o
l
100
200
300
Horizontal Distance (m)
0
200
Depth (m)
400
b
0
Cross-Well Seismograms f o r Longon Velocity Model
293
compute the first arrival times for energy travelling from the
source well to the receiver well. The observed data are
taken to be first arrival traveltimes from the synthetic
seismograms (Fig. 3c) computed by a finite difference
method (200 by 150 point grid and 700 time
steps/seismogram) for the acquisition system shown in Fig.
3(a) and the I-D velocity profile in Fig. 3(b). Eight sources
are equally distributed in the source well, and 83 receivers
with a 4.8m spacing are located in the receiver well. Figs
3(e) and (f) show the successful velocity reconstructions
obtained by the WT inversion method using a steepest
descent method. Each iteration consumed less than 10 CPU
min on a Stardent 2000 computer.
Some practical details that expedited rapid convergence
included application of a window function which suppresses
all energy but the transmitted arrivals. This mitigates
problems with multiple stationary points associated with the
cross-correlation function in equation (3).
Faulted steam injection model
A fault model is shown in Fig. (4a) and the crosswell
I00
200
Time (ms)
geometry consists of 21 sources evenly distributed in the
source well and 32 receivers distributed in the receiver well.
The leftmost model shows the pre-steam injection model,
the middle model is the post-steam injection model (steam
with 10 per cent velocity contrast injected at left well) and
the rightmost model represents the difference between these
two models. The dominant source frequency of the Ricker
wavelet is 80 Hz and the reconstructed velocity model after
20 iterations is given in Fig. (4b). The rightmost figure
shows that the steam injection zone is faithfully
reconstructed.
300
C
A f t e r 10 i t e r a t i o n s
I n i t i a l Velocity Model
3000 I
1000
I
0
200
3000 I
400
200
0
Depth ( m )
Oepth ( m )
d
e
400
A f t e r 20 I t e r a t i o n s
CONCLUSION
A methodology is described which shows how to compute
directional derivatives of skeletalized data (data types not
explicitly contained in the fundamental governing equations)
with respect to the model perturbations. This is achieved by
forming a connective equation which connects the skeletal
data with the fundamental data and using the rule for an
implicit function derivative. The connective equation can be
constructed via some pattern matching operator, such as
cross-correlation. No approximations are employed, and the
benefits include an enhanced ability to work with
skeletalized data sets and noisy data, and the capability to
match the important parts of the data (e.g., traveltimes) that
are mainly influenced by one model parameter (e.g.,
velocities).
This methodology is applied to develop a new seismic
inversion method which reconstructs velocities from
traveltimes based on the wave equation. No high-frequency
approximations are needed, velocities are decoupled from
densities, and the computer time is about the same as full
wave inversion. Compared to a ray based inversion code,
the WT computer code is much more general, and demands
less effort to write. For complicated velocity structures, the
CPU time required by the WT method is competitive with
that of our ray trace inversion code. Synthetic tests show
that successful reconstructions can be achieved with model
velocity contrasts greater than 2:l. This is an improvement
7
-.3000
2600
"7
E
2200
21
c
-
.-
5
I800
>
1400
1000
0
100
200
Oepth ( m )
300
4 00
f
(a) 2-D acquisition cross-well geometry for the Langan
acoustic velocity model. (b) 1-D Langan velocity profile. (c)
Acoustic seismograms for a source at depth 300 m in the source well
with offset 50 m, and receivers in the receiver well with offset 250 m.
Due to shadow zones and waveguide effects, ray tracing cannot
correctly compute all of the first arrival times at the receiver well.
(d) Initial velocity model (dashed line) employed by the WT
method. (e-f). The velocity model reconstructed by the WT
method (dashed line) compared to the true velocity model (solid
line).
Figure 3.
294
Y. Luo and G. T. Schuster
over full wave inversion which can fail for similar models
with little more than 10 per cent velocity contrast and over
ray based traveltime tomography when the velocity
distribution is too complex for the use of ray theory. In
addition, traveltime picking or event identification may
sometimes not be necessary, and diffraction, reflection and
refraction traveltimes can be incorporated into the inversion
process. This procedure is naturally suited for hybrid
inversion schemes (Luo & Schuster 1990).
Possible applications of this methodology include the
following.
(1) Radar tomography where the fundamental data are
the time-domain radargrams, the model parameter is the
electromagnetic propagation velocity distribution and the
skeletal data might be the time lag between the observed
and computed first arrivals. The connective functional is the
time cross-correlation between the observed and computed
time-domain radargrams. In addition, wavelet broadening
or amplitude might serve as skeletal data for conductivity
inversion in both EM and radar inversion.
(2) Surface wave inversion in the slowness-frequency
domain where the fundamental data consist of the
transformed seismograms s ( p , o),the model parameter is
the shear velocity distribution and the skeletal data are the
slowness lags between the observed and computed values of
the dispersion curve for a specified mode. The connective
equation is the slowness cross-correlation between the
observed and computed dispersion curves.
(3) Post-critical reflection inversion in the t-p domain
where the fundamental data consist of the transformed
seismograms s ( t , p ) , the model parameter is the compressional velocity distribution and the skeletal data are the
slowness lags between the observed and computed values of
the post-critical reflection envelope in the t - p domain. The
connective equation is the slowness cross-correlation
between the observed and computed post-critical reflection
envelopes.
ACKNOWLEDGMENTS
We thank the Gas Research Institute for sponsoring this
research under contract 5089-215-1872. Neither GRI,
members of GRI, nor any person acting on behalf of GRI
assumes any liability in the use of information in this report.
We are also grateful for the support of the 1989 University
of Utah tomography consortium members; Amoco, Arco,
British Petroleum, Conoco, GRI, Marathon, Mobil,
Phillips, and Texaco. The authors are very grateful to
Robert G. Keys for his comments and insights about
directional derivatives.
REFERENCES
Dines, K. & Lytle, R., 1979. Computerized geophysical
tomography, Proc. IEEE, 67, 1065-1072.
Gauthier, O., Virieux, J. & Tarantola, A., 1986. Two-dimensional
nonlinear inversion of seismic waveforms: numerical results,
Geophysics, 51, 1387-1403.
Gillespie, R. P., 1960. Parrial Differentiation, Interscience
Publishers, New York.
Jannane, M. et al., 1989. Wavelengths of earth structure that can be
resolved from seismic reflection data, Geophysics, 54, 906-910.
Johnson, S. & Tracy, M.L., 1983. Inverse scattering solutions by a
sinc basis, multiple source, moment method; part 1, Theory,
Ultrasonic Imaging, 5, 361-315.
Langan, R., Paulsson, B., Stefani, J., Finnstom, E. & Fairborn, J.,
1988. Cross-Well Seismology: A New Production Tool,
presented at the Joint SEG/CPS Meeting at the Daqing oil
field, 1988 September 6-10,
Lines, L. (ed.), 1988. Inversion of Geophysical Data, No. 9, SEG
Publications, Tulsa, OK.
Luo, Y. & Schuster, G. T., 1990. Wave equation traveltime and
waveform inversion, Expanded Abstracts of 1990 Technical
Program of SEG Annual Meeting, pp. 1223-1225.
Luo, Y. & Schuster, G. T., 1991. Wave equation traveltime
inversion, Geophysics, in press.
Nabighian, M. (ed.), 1987. Electromagnetic Methods in Applied
Geophysics, Vol. 1, SEG Publications, Tulsa, OK.
Tarantola, A,, 1987. Inverse Problem Theory, Elsevier, New York.