Download as a PDF

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Ear & Hearing:Volume 20(1)February 1999pp 33-44
Cortical, Auditory, Event-Related Potentials in Response to Periodic and
Aperiodic Stimuli with the Same Spectral Envelope
Martin, Brett A.; Boothroyd, Arthur
Program in Speech and Hearing Sciences (B.A.M., A.B.), Graduate Center, City
University of New York, New York, New York; and Department of SpeechLanguage-Hearing Sciences (B.A.M.), Hofstra University, Hempstead, New York.
Address for correspondence: Brett A. Martin, Ph.D., Program in Speech and
Hearing Sciences, Graduate Center, City University of New York, 33 West 42
Street, New York, NY 10036.
Received March 17, 1998; accepted September 28, 1998
Abstract
Objective: 1) To determine whether the N1-P2 acoustic change complex is
elicited by a change of periodicity in the middle of an ongoing stimulus, in the
absence of changes of spectral envelope or rms intensity. 2) To compare the
N1-P2 acoustic change complex with the mismatch negativity elicited by the
same stimuli in terms of amplitude and signal to noise ratio.
Design: The signals used in this study were a tonal complex and a band of
noise having the same spectral envelope and rms intensity. For elicitation of
the acoustic change complex, the signals were concatenated to produce two
stimuli that changed in the middle (noise-tone, tone-noise). Two control stimuli
were created by concatenating two copies of the noise and two copies of the
tone (noise-only, tone-only). The stimuli were presented using an onset-toonset interstimulus interval of 3 sec. For elicitation of the mismatch negativity,
the tonal complex and noise band stimuli were presented using an oddball
paradigm (deviant probability = 0.14) with an onset-to-onset interstimulus
interval of 600 msec. The stimuli were presented via headphones at 80 dB
SPL to 10 adults with normal hearing. Subjects watched a silent video during
testing.
Results: The responses to the noise-only and tone-only stimuli showed a
clear N1-P2 complex to the onset of stimulation followed by a sustained
potential that continued until the offset of stimulation. The noise-tone and tonenoise stimuli elicited an additional N1-P2 acoustic change complex in
response to the change in periodicity occurring in the middle. The acoustic
change complex was larger for the tone-noise stimulus than for the noise-tone
stimulus. A clear mismatch negativity was elicited by both the noise band and
tonal complex stimuli. In contrast to the acoustic change complex, there was
no significant difference in amplitude across the two stimuli. The acoustic
change complex was a more sensitive index of peripheral discrimination
capacity than the mismatch negativity, primarily because its average
amplitude was 2.5 times as large.
Conclusions: These findings indicate that both the acoustic change complex
and the mismatch negativity are sensitive indexes of the neural processing of
changes in periodicity, though the acoustic change complex has an advantage
in terms of amplitude. The results support the possible utility of the acoustic
change complex as a clinical tool in the assessment of peripheral speech
perception capacity.
Introduction
The general purpose of the project, of which the present study forms part, is to
develop electrophysiological tests of auditory speech discrimination capacity.
Such tests would contribute to the objective evaluation of subjects who, for
reasons of age and hearing loss, lack the auditory, linguistic, and/or cognitive
prerequisites for behavioral speech perception tests (Boothroyd, 1991; Tyler,
1993). Considerable success has been reported from the use of cochlear
implants in children who have profound, prelingually acquired, deafness
(Geers & Moog, 1994). As a result, there is increasing pressure to implant
children with lesser degrees of hearing loss. This pressure creates an acute
need for reliable, objective tests of auditory speech perception capacity that
remain valid when used with very young, prephonological, hearing-impaired
children. The availability of such tests would help reduce the possibility that
children with severe hearing losses will be given implants that provide no more
auditory capacity than could have been obtained with properly fitted hearing
aids.
Most of the previous work on this topic has involved mismatch negativity
(Näätänen, Gaillard, & Mäntysalo, 1978). Mismatch negativity is a negativegoing, cortical, event-related potential that occurs in response to an occasional
deviant acoustic stimulus in a train of repeated standard stimuli. It occurs even
when the subject is paying no attention to the stimuli and is believed to reflect
registration of a difference between the deviant stimulus and the memory trace
that has been developed by the standard stimuli-hence its potential use as an
index of peripheral auditory resolution (see Näätänen, 1992; Näätänen &
Kraus, 1995, for a review). Mismatch negativity has been shown to be elicited
by small changes in stimulus frequency, intensity, duration, and spatial
location (Näätänen, Paavilainen, Alho, Reinikainen, & Sams, 1987;
Näätänen, Paavilainen, & Reinikainen, 1989; Paavilainen, Karlsson,
Reinikainen, & Näätänen, 1989; Sams, Paavilainen, Alho, & Näätänen,
1985). It is also sensitive to changes in the spectral and temporal
microstructure of complex stimuli (Näätänen, Schröeger, Karakas,
Tervaniemi, & Paavilainen, 1993; Schröeger, 1994; Schröeger, Näätänen,
& Paavilainen, 1992). Mismatch negativity can, therefore, be used to
demonstrate the capacity for discrimination among speech sounds, both
synthetic (Aaltonen, Paavilainen, Sams, & Näätänen, 1992; Kraus, McGee,
Sharma, Carrell, & Nicol, 1993; Martin, Kurtzberg, & Stapells, Reference
Note 1; Sams, Aulanko, Aaltonen, & Näätänen, 1990). and natural
(Sandridge & Boothroyd, 1996).
There are, however, potential limitations to the use of mismatch negativity as a
clinical tool in a pediatric population. The amplitude of mismatch negativity is
low in relation to the background EEG activity (Picton, 1995), which, in young
children, is likely to be higher than in adults. In addition, there is a practical
limit to the amount of response averaging that can be done to increase signal
to noise ratio. This limit occurs for two reasons. First, there is a limit to the time
that a young child will remain immobile and cooperative. Second, the number
of deviant stimuli can only be a small proportion of the total, which limits the
amount of signal averaging that can be done to improve signal to noise ratio
for the deviant response. Even in cooperative adults (Ritter, personal
communication) and children (Kurtzberg, Vaughan, Kreuzer, & Fliegler,
1995) with normal hearing the mismatch negativity potential cannot always be
reliably demonstrated. For these reasons, mismatch negativity may not be the
most appropriate tool for clinical evaluation of auditory speech discrimination
capacity in young children with hearing loss.
There is, however, another event-related potential that might serve in this
application, namely the N1-P2 complex that occurs in response to acoustic
change during an ongoing stimulus. The N1-P2 complex is best known as an
onset response (Hillyard & Picton, 1978; Näätänen, 1992; Näätänen &
Picton, 1987; Onishi & Davis, 1968; Pantev, Eulitz, Hampton, Ross, &
Roberts, 1996). It is affected by changes of stimulus parameters. For
example, the amplitude is greater, and the response occurs earlier, as
stimulus amplitude increases (Davis & Zerlin, 1996; Keidel & Spreng, 1965),
and low-frequency sounds produce larger responses than do high-frequency
sounds of the same intensity (Antinoro & Skinner, 1968; Picton, Woods, &
Proulx, 1978). Unfortunately, these differences are not reliable or sensitive
enough to serve as an index of peripheral auditory resolution.
It has been shown, however, that the N1-P2 complex can be elicited by
changes in an ongoing acoustic stimulus. We refer to this event-related
potential as the acoustic change complex. It has been shown to occur in
response to intensity and frequency modulations in sustained tones (Clynes,
1969; Jerger & Jerger, 1970; Näätänen & Picton, 1987; Spoor, Timmer, &
Odenthal, 1969; Yingling & Nethercut, 1983). It has also been shown to
occur in response to simple syllables at the transition from consonantal
segment to vocalic segment (Hari, 1991; Imaizumi, Mori, Kiritani, &
Yumoto, 1996; Kaukoranta, Hari, & Lounasmaa, 1987; McGee, Kraus,
King, Nichol, & Carrell, 1996; Ostroff, Martin, & Boothroyd, 1998;
Steinschneider, Arezzo, & Vaughan, 1982). If the auditory cortex produces a
measurable response to an acoustic change during an ongoing stimulus, it
follows that the different acoustic patterns, occurring before and after the
change, are represented by different patterns of excitation that are manifested
at every level of processing from the cochlea through the cortex. Such a
response would index the arrival, at a cortical level, of potentially discriminable
information. The acoustic change complex may provide the desired index of
speech discrimination capacity if it is elicited, assuming that it is elicited by the
types of acoustic change that are involved in phonological contrasts.
We have previously demonstrated that the acoustic change complex is elicited
by the transition from fricative to vowel in the naturally produced syllable "say"
(Ostroff et al., 1998). In this example, however, the acoustic change involved
intensity, spectral envelope, and periodicity. In a subsequent study, we have
found that the acoustic change complex can be produced solely by a change
of spectral envelope. The stimuli consisted of synthetic 3-formant vowels with
a change of either first or second formant frequency at the temporal midpoint
(Ostroff, Martin, & Boothroyd, Reference Note 2).
The primary purpose of the present study was to determine whether the
acoustic change complex is elicited by a change of periodicity only, in the
absence of changes of spectral envelope or intensity. We also wished to
compare the acoustic change complex with mismatch negativity, in terms of
amplitude and sensitivity. It should be noted that "sensitivity" is a general term
that takes into account the signal to noise ratio of the recordings (i.e.,
amplitude of the waveforms, the interstimulus interval, and the number of
sweeps). Additionally, sensitivity is determined by the ability to show
statistically significant potentials that index the encoding of small changes in
periodicity in individual subjects.
Method
Subjects
Ten adults (three men, seven women), aged 23 to 41 yr (mean = 31 yr)
participated. All subjects had normal hearing sensitivity (thresholds ≤25 dB HL
from 250 through 8000 Hz bilaterally) and no history of neurological disorder.
Stimuli
The contrasting signals used in this study were a tonal complex and a band of
noise having the same spectral envelope and rms level. The tonal complex
was created by combining a series of equal-amplitude pure tones whose
frequencies were integral multiples of 100 Hz and passing the result through a
band-pass filter with edge frequencies of 600 and 1400 Hz. The noise band
was created by passing white noise, with a constant level per cycle, through
the same filter. After filtering, the rms intensity levels of the two signals were
equated. Both signals were trimmed to 400 msec. The first and last 10 msec
were then shaped with raised cosine functions to minimize spectral splatter at
onset and offset.
Two stimuli were created for elicitation of the acoustic change complex by
concatenating the tone and noise signals with a 10 msec overlap. Each
stimulus was, therefore 790 msec long. The acoustic change began at 390
msec and lasted for 10 msec. The resulting stimuli will be referred to as tonenoise and noise-tone. In addition, two control stimuli were created by
concatenating two copies of the noise and two copies of the tone. The
resulting stimuli will be referred to as noise-only and tone-only, respectively.*
Figure 1 shows the short-term rms amplitude of the two signals (integrated
over 50 msec), their waveforms, and their spectra.†
Figure 1. The top portion of the figure shows the rms amplitude of the tone
and noise signal integrated over 50 msec. The middle portion shows the
waveform of the tone-noise stimulus. The bottom portion shows the spectra of
the two signals. The spectra are superimposed to show the similarity of their
bandpaths.
The two stimuli were matched for spectral envelope and rms intensity. Their
fine spectra, however, necessarily differ. The aperiodic noise stimulus has a
continuous spectrum, whereas the periodic tonal stimulus has a line spectrum.
Note, also, that although the rms intensity is matched for the two stimuli, there
are small intensity fluctuations for the aperiodic stimulus that are not seen for
the periodic stimulus.
For elicitation of the mismatch negativity, the tone and noise stimuli were
shortened to have durations of 150 msec, with 10 msec raised-cosine shaping
of onsets and offsets.
All stimuli were digitized at 12 bits and 22050 samples per second. After A/D
conversion, they were presented to subjects binaurally, via EAR-3A insert
earphones at levels of 80 dB SPL.
EEG Recordings
Five EEG channels were recorded from surface electrodes placed at Fz, Cz,
Pz, A1 and A2. The EEG channels were referenced to an electrode at the tip
of the nose (Vaughan & Ritter, 1970). A sixth channel to monitor vertical eye
movements and eye blinks was recorded from electrodes placed above and
below the right eye. An electrode at Fpz served as ground. Electrode
impedances were maintained below 5000 Ohms.
The EEG channels were amplified (gain = 20,000 except for the EOG channel
where gain = 5000), and filtered (0.01 to 100 Hz, 6 dB/octave). Single-trial
event-related potential waveforms were baseline corrected (across the entire
sweep duration), an ocular artifact reduction algorithm was applied (Semlitch,
Anderer, Schuster, & Presslich, 1986), and the data were further filtered
(0.1 to 30 Hz, 12 dB/octave). When activity in any channel (except the EOG
channel) exceeded ± 100 µV for a single sweep, data from all channels were
rejected. Additionally, averages were baseline corrected (-100 to 0 msec).
Acoustic Change Protocol
Stimuli were presented with an onset-to-onset interval of 3 sec. They were
presented in 10 blocks, in which each of the four stimuli (tone-noise, noisetone, noise-only, and tone-only) was heard 30 times. Thus, each stimulus was
presented a total of 300 times. Stimulus order in each block was randomized
but with the provision that no single stimulus could occur twice in succession.
Subjects ignored the stimuli and watched a silent, captioned video during
testing.
The amplified EEG signals were digitized at 341 Hz over a 1501 msec (512
point) window, beginning 100 msec before stimulus onset. Responses to the
two experimental and two control stimuli were averaged separately to give four
tracings for each of the 10 subjects. Each individual tracing was based on 300
sweeps (minus the number of artifact rejects). Stimulus presentation time was
15 minutes for each of the four stimuli (excluding breaks).
Mismatch Protocol
Stimuli were presented with an onset-to-onset interval of 600 msec. They were
presented in four blocks, two with tone as standard and noise as deviant and
two with noise as standard and tone as deviant. In each block, the standard
occurred 600 times and the deviant 100 times for a deviant probability of 0.14.
Stimulus presentation order within a run was not randomized, but presentation
order across conditions and subjects were. Subjects were instructed to ignore
the stimuli and watch a silent, captioned video during testing.
The amplified EEG signals were digitized at 890 Hz over a 575 msec (512
point) window beginning 100 msec before stimulus onset. Responses to each
stimulus as both standard and deviant were averaged separately to give four
tracings for each of the 10 subjects. The two standard tracings were each
based on 1200 sweeps (minus the number of artifact rejects). The two deviant
tracings were each based on 200 sweeps (minus the number of artifact
rejects). Total stimulus presentation time was 28 minutes (excluding breaks).
Data Analysis
Response windows were developed using the group mean waveforms to aid
in response identification and measurement in the acoustic change and
mismatch negativity data from individual subjects. The response windows
were ± 50 msec with reference to the latency of the peak (N1, P2, or mismatch
negativity) in the group mean waveforms.
Results
Acoustic Change Complex
The data shown here are for recordings at Cz only-this being the site that
usually gives the largest N1-P2 complex (Vaughan & Ritter, 1970).
Group Means • Figure 2 (top) shows the group mean waveforms obtained in
response to the tone-noise stimulus and to the tone-only control stimulus.
Vertical lines show standard errors of the mean at each sampling point. Note
that these standard errors are derived from the 10 individual average values
and reflect the contributions of both within-and between-subject variance.
Figure 2. The top portion of the figure displays group mean waveforms
obtained in response to the tone-noise (thick line) and to tone-only stimuli
(thin line). The bottom portion of the figure displays group mean
waveforms obtained to the noise-tone (thick line) and tone-only (thin line)
stimuli. Vertical lines indicate the intersubject standard errors of the mean
at each sampling point.
It will be seen that the tone-only stimulus produced a classical pattern with a
clear N1-P2 complex at onset, followed by a sustained negativity, and a return
to baseline after offset (N1-P2 components to the offset of stimulation were
quite variable and will not be addressed in this paper). The tone-noise
stimulus produced the same pattern but with a second N1-P2 complex
occurring after, and presumably in response to, the acoustic change. There
was no evidence of an N1-P2 complex at the midpoint of the response to the
tone-only stimulus.
The onset responses in the group mean waveforms to the tone-noise stimuli
have peak-to-peak amplitudes of 8.0 µV with latencies of 102 msec and 185
msec for N1 and P2, respectively. The peak-to-peak amplitude of the acoustic
change complex is 5.5 µV with latencies (re change onset) of 135 msec and
245 msec, respectively.
Note that the 5.5 µV peak-to-peak amplitude of the acoustic change complex
is some 10 times the standard error of the mean for a single sample. This
response cannot be attributed to random variability either within or between
subjects.
Figure 2 (bottom) shows the group mean waveforms, and standard errors of
the means, obtained in response to the noise-tone stimulus and to the noiseonly control stimulus. The noise-only stimulus produced the N1-P2 complex at
onset, followed by a sustained negativity, and a return to baseline after
stimulus offset. The noise-tone stimulus produced the same pattern but with a
second N1-P2 complex occurring after the acoustic change. This complex was
followed by increased negativity, lasting until stimulus offset. There is no
evidence of an N1-P2 complex at the midpoint of the response to the noiseonly stimulus.
The onset potentials in the responses to the noise-tone stimuli have peak-topeak amplitudes of 7.3 µV with latencies of 99 msec and 190 msec for N1 and
P2, respectively. The peak-to-peak amplitude of the acoustic change complex
is 1.8 µV with latencies (re change onset) of 152 msec and 252 msec,
respectively.
Although the 1.8 µV peak-to-peak amplitude of the acoustic change complex
is smaller than that observed for the tone-noise stimulus, it is still some three
times the standard error of the mean for a single sample. This response
cannot be attributed to random variability.
Scalp Distribution • Figure 3 displays the grand mean waveforms for the
tone-noise and noise tone conditions. The data from Fz, Cz, Pz, A1, and A2
are displayed. Both the N1-P2 complex to stimulus onset and the acoustic
change complex show largest amplitude at Cz, with smaller amplitudes at Fz,
and at Pz. As would be expected, the polarity of the responses inverts at the
earlobe electrode sites (Vaughan & Ritter, 1970).
Figure 3. The grand mean waveforms obtained to the tone-noise stimulus
(top) and noise-tone stimulus (bottom) are shown at electrode sites Fz, Cz,
Pz, A1, and A2.
Individual Data • Figure 4 shows the waveforms obtained from each of the
10 subjects, together with the group mean waveforms from Figures 2 and 3.
The windows show the expected locations of the change responses and are
based on the group data (±50 msec). By visual inspection, all subjects show
an N1-P2 complex at noise and tone onset. Nine of the 10 subjects show a
clear acoustic change complex in response to the tone-noise stimulus, the
exception being subject 8. Six or seven appear to show an identifiable change
complex in response to the noise-tone stimulus.
Figure 4. The waveforms obtained from each of the 10 subjects at Cz to
the acoustic change from tone-noise and noise-tone are displayed
together with the group mean waveforms. The windows show the
expected locations of the acoustic change complex based on the group
mean data (±50 msec).
To obtain a more objective estimate of the presence of a demonstrable
change complex in individuals, the individual peak-to-peak amplitudes, within
the expected response window, were compared with the individual standard
errors of the differences between means at two sampling points. These
standard errors were obtained by first pooling standard error estimates for a
single mean, within the response window, and then multiplying by the square
root of 2. The results are shown in Table 1, together with individual t values,
i.e., the ratios of the N1-P2 difference to the standard error of the difference.
Taking t = 2.0 as a criterion for significance at the 5% level, all subjects show
a change complex for the tone-noise stimulus and seven show one for the
noise-tone stimulus. Moreover, all 10 subjects meet a more stringent criterion
of t = 2.7 (1% significance) for the tone-noise change. Only four subjects meet
this criterion for the noise-tone change.
TABLE 1. Peak-to-peak N1-P2 amplitudes of the acoustic change complex, average
standard errors of the difference between means at two sampling points, number of
accepted sweeps (n), and the ratio between amplitude and standard error (t).
Mismatch Negativity
The data shown here are for recordings at Fz only-this being the site that
usually gives the largest Mismatch (Näätänen, 1992).
Group Means • Figure 5 (top) shows the group mean waveforms obtained in
response to the tone as the standard stimulus and the tone as the deviant
stimulus. Vertical lines show standard errors of the mean at each sampling
point, derived from the 10 individual average values. It will be seen that there
is a clear mismatch negativity with a peak amplitude around 1.4 µV. Peak
latency in relation to stimulus onset is approximately 125 msec. Peak
amplitude of the mismatch negativity is some two times the standard error of
the difference between the two tracings.
Figure 5. The top left panel displays the group mean waveforms obtained
to the tone stimuli presented as standards and as deviants (left). The top
right panel shows difference waveforms obtained by subtracting responses
to stimuli presented as standards from responses to the same stimuli
presented as deviants. Vertical lines show standard errors of the mean at
each sampling point, derived from the 10 individual average values. The
lower panels show the same data for the noise stimuli.
Figure 5 (bottom) shows the group mean waveforms, and standard errors of
the means, obtained in response to the noise as standard stimulus and the
noise as deviant stimulus. As in the top portion of the figure, there is a clear
mismatch negativity. The peak amplitude is around 1.2 µV and the peak
latency is approximately 138 msec. The peak amplitude of the mismatch is 1.5
times the standard error of the difference between the two tracings.
Scalp Distribution • The mismatch negativity is displayed in the grand mean
difference waveforms shown in Figure 6. Data for the noise condition and the
tone condition are displayed at Fz, Cz, Pz, A1 and A2. The mismatch
negativity shows a broad frontocentral scalp distribution, with largest
amplitudes at Fz and Cz. The waveform polarity inverts at the earlobe
electrode sites.
Figure 6. The grand mean difference waveforms from electrode sites Fz, Cz,
Pz, A1, and A2 are shown for the noise stimulus (top) and the tone stimulus
(bottom).
Individual Data • Figure 7 shows the two difference waveforms obtained from
each of the 10 subjects, together with the group mean waveforms from Figure
5. The windows show the expected locations of the mismatch responses and
are based on the group data (±50 msec). By visual inspection, most of the
subjects appear to show an identifiable mismatch for both the noise band and
the tone complex.
Figure 7. Difference waveforms obtained from each of the 10 subjects at Fz in response to
the tone and noise stimuli are displayed together with the group mean waveforms. The
windows show the expected locations of the acoustic change complex based on the group
mean data (±50 msec).
Table 2 shows individual mismatch amplitudes together with standard errors
of the difference between two tracings and the ratio of amplitude to standard
error (t). The standard errors are derived as follows. For each trace, the data
within the expected time window of the mismatch are pooled to give an
average standard error. The standard errors for the standard and deviant
traces then are combined to derive the standard error of a difference- taking
into account the different number of accepted samples for the two. All 10
subjects meet the t = 2.0 criterion (5% significance) for the noise band, and
eight meet it for the tone complex. Eight subjects meet the more stringent t =
2.7 criterion (1% significance) for the noise band, and five subjects meet this
criterion for the tone complex.
TABLE 2. Amplitudes of the mismatch negativity (MMN), average standard
errors of the difference between means at a single sampling point, number of
accepted sweeps (n), and the ratio between the two (t) for two stimuli and 10
subjects.
Comparison of Response Amplitudes.
Individual amplitudes from Tables 1 and 2 were examined using a repeatedmeasures analysis of variance. The two repeated measures were Potential
(Acoustic change or Mismatch), and First stimulus or, in the case of Mismatch,
the only stimulus (Tone or Noise). The results are shown in Table 3. The main
effects of Potential and First stimulus are highly significant, as is the
interaction between the two. Post hoc analysis, using the least-significantdifference test, suggests that the large acoustic change response to the tonenoise stimulus is responsible for all three effects. Its amplitude is significantly
greater than the other three amplitudes (p < 0.0003), which do not differ
significantly from one another (p > 0.05). These findings are illustrated in
Figure 8, which shows the four mean amplitudes, together with their standard
errors. (Note that these standard errors are derived from four separate withincell estimates of variance rather than from the pooled final error term of the
analysis of variance).
TABLE 3. Repeated-measures analysis of variance in the individual amplitude
data of Tables 1 and 2. Independent variables are potential (acoustic change
complex versus mismatch negativity) and first stimulus (tone or noise).
Figure 8. Mean amplitudes for the acoustic change complex and mismatch
negativity are displayed along with standard errors derived from four
separate within-cell estimates of variance. In a analysis of variance, the
amplitude of the acoustic change complex to the tone-noise stimulus was
found to be significantly greater than in the other conditions.
Discussion
These data show that the acoustic change complex can be elicited by a
change from tonal complex to narrow-band noise, or the reverse, in the
absence of a change of rms amplitude or spectral envelope. The tone-only
and noise-only stimuli produced no evidence of an acoustic change complex
at the midpoint. This last finding supports the conclusion that the effects
observed with the tone-noise and noise-tone stimuli were not artifacts of the
concatenate-overlap procedure used to produce the stimuli.
The N1-P2 complex to changes from noiseband to tonal stimuli (and vice
versa) have been documented previously (Kaukoranta et al., 1987; Mäkelä,
Hari, & Leinonen, 1988). However, the previous studies did not control for
spectral envelope and rms amplitude. Additionally, the transition between the
noise and tonal stimuli was not carefully controlled. As a result, it cannot be
determined whether the N1-P2 was elicited by the change in periodicity, or by
the relatively abrupt intensity rise of the second stimulus.
The N1-P2 amplitudes in the change complex were less than those elicited at
stimulus onset. Moreover, the latencies of the N1 and P2 peaks in the group
mean data (around 175 and 250 msec re change onset) were higher than
those observed in the onset complex (around 100 and 190 msec re stimulus
onset). Nevertheless, the patterns are similar, suggesting that the underlying
processes are similar. This last is not a particularly radical suggestion. The
onset response is, itself, a change response, except that the change is from
silence to sound. The potential recorded at the surface of the head reflect a
change in the amount and/or synchrony of neural excitation as the effects of
the change are registered at the level of the auditory cortex (Coles & Rugg,
1996). The presence of similar potentials at the change from tone to noise, or
noise to tone, suggest similar changes in the amount and/or synchrony of
neural excitation.
The acoustic change complex observed in this study probably reflects the
spatio-temporal organization of cortical excitation along a periodic-aperiodic
continuum. This organization may involve the amount of phase-locking
occurring in the auditory pathways (Phillips, 1993, 1995). Other possibilities
should, however, be considered. Although the two stimuli were matched in
terms of gross spectral envelope, the fine structures of their spectra inevitably
differed. Similarly, although the stimuli were matched for overall rms level, the
narrow-band noise inevitably contained short-term amplitude fluctuations that
were absent in the tonal complex. Note, also, that the noise waveform was
"frozen," i.e., the identical waveform was reproduced in every presentation. A
brief amplitude increase soon after the onset of the noise could have
contributed to repeatable changes in the amount of neural excitation. Before
definitive statements can be made about the acoustic and neural origins of the
observed change response, further studies are needed, preferably involving
multiple recording sites to permit the location of primary regions of
synchronous excitation (Scherg & Von Cramon, 1986; Vaughan, Weinberg,
Lehman, & Okada, 1986).
Despite the cautions just raised, there is some support for the notion of
periodic-aperiodic coding at the cortical level in the data reported here. This
support comes from the observation that the group mean response to the
tone-only stimulus produced substantially more sustained negativity (around
2.0 µ V) than did the response to the noise-only stimulus (around 0.3 µV).
Moreover, after the tone-noise transition, the sustained negativity fell.
Similarly, after the noise-tone transition, the sustained negativity rose. These
observations are difficult to reconcile with the notion that the change
complexes are simply the result of amplitude fluctuation in the frozen noise.
A striking feature of the acoustic change findings is their sensitivity to the
direction of the acoustic change. The tone-to-noise transition produced an N1P2 amplitude of around 5 µV whereas the noise-to-tone transition produced
only 2 µV. The reality of this difference is supported by the analysis of
variance in the individual data. One possible explanation is that the noise
stimulus excites a larger cortical volume than does the tone stimulus. That is,
that there are neuronal cells that respond to the noise that were not activated
by the specific frequencies present in the periodic stimulus. If the cells
responding to the tone were completely contained within the volume of cells
responding to the noise, one would expect an onset response from newly
excited cells when the stimulus changes from tone to noise, but no effect
when the change is in the opposite direction. If the two unequal volumes
overlap, but not completely, one might expect the kind of directional sensitivity
observed here. To interpret amplitude differences solely in terms of volume
excitation, however, is an over simplification. Synchrony of firing also
influences amplitude. It is also possible that there is some type of interaction
between the response to the first portion of the stimulus and the response to
the second, such as nonsimultaneous masking or inhibition. Once again, more
detailed mapping of the sources of the change potential will be needed to
determine the exact source of the sensitivity to direction on the acoustic
change.
At first sight, the directional sensitivity observed in the change response does
not appear to occur in the mismatch data. Note, however, that the mismatch
reported here was for tone as standard versus tone as deviant and noise as
standard versus noise as deviant. During data collection, the subject actually
heard either tone as standard and noise as deviant or the reverse. From the
group mean data of Figure 5, it is apparent that the peak amplitude of the
mismatch between tone as standard and noises as deviant was close to zero.
In contrast, the peak amplitude of the mismatch between noise as standard
and tone as deviant was around 2.5 µV. There is, therefore, a directional
sensitivity in the mismatch data. The fact that the pattern is in the opposite
direction to that which would be predicted from the change data presumably
reflects the different processes involved in eliciting the two potentials.
The findings of this study support the conclusion that the acoustic change
complex offers a more sensitive index of peripheral discrimination capacity
than does mismatch negativity in terms of the signal to noise ratio of the
recordings. In the individual data, the average of the two peak-to-peak change
amplitudes was some 2.5 times the average of the two mismatch amplitudes.
A further advantage of the change protocol is that every stimulus contributes
to improving signal to noise ratio.§ In contrast, in the mismatch protocol, only
1/7 of the stimuli contribute to improving signal to noise ratio for the deviant
stimulus and it is this signal to noise ratio that dominates the standard error of
the difference between the standard and deviant traces. This disadvantage for
the mismatch protocol is partially offset by the higher total number of stimulus
presentations made possible by the lower interstimulus interval. It is possible,
however, to use a shorter interstimulus interval in the change protocol. As
shown in the Appendix, these opposing effects tend to cancel when using the
protocols employed in the present study. The difference in amplitude is,
therefore, the primary determinant of the difference in sensitivity, which is
2.5:1 if using the average of the two change complexes and 3.5:1 if using only
the larger of the two.
The amplitude advantage seen for the change protocol did not result in an
advantage in sensitivity over the mismatch protocol in terms of the number of
subjects with statistically significant responses. In fact, the two protocols were
equally sensitive in indexing the ability of individual subjects to encode a
change in periodicity. Specifically, there were 10 subjects with a statistically
significant (p ≤ 0.05) acoustic change component in the tone-noise condition,
and there were seven in the noise-tone condition. For the mismatch negativity,
eight subjects had a statistically significant response for the tone condition and
10 for the noise condition. This finding may be due, in part, to the fact that the
periodicity change was large and easily apparent. Perhaps waveforms elicited
by finer acoustic changes will show more of a difference between the acoustic
change complex and the mismatch negativity in terms of statistical
significance.
It is important to note, however, that any conclusions about relative sensitivity
of the two protocols, based on the present study, apply only to the extreme
periodicity change used here. It remains to be seen whether similar
advantages for the acoustic change complex are present for other acoustic
contrasts. Of particular interest will be the responses to changes of spectral
envelope. Sensitivity to such changes has a major influence on speech
perception capacity.
The present findings support the possible utility of the acoustic change
complex as a clinical tool in the assessment of peripheral speech perception
capacity in young, prephonological, hearing-impaired children. Before further
progress can be made, however, it must be established that the complex can
be elicited in young children despite the known immaturity of their cortical
obligatory potentials. And, if the technique is to be used with pediatric
implantees, the issue of stimulus artifact from the ongoing electrical stimulus
will need to be addressed. Even if the acoustic change complex proves
impractical as a clinical tool in this application, its continued investigation may
contribute to increased understanding of the spatio-temporal organization of
cortical excitation in response to changing sound patterns.
Acknowledgments:
This work was supported by NIH-NIDCD grant #5P50DC00178. Portions of
this paper were presented at the 21st Midwinter Meeting of the Association for
Research in Otolaryngology, St. Petersburg, Florida, February 15, 1998.
References
Aaltonen, O., Paavilainen, P., Sams, M., & Näätänen, R. (1992). Event-related
brain potential and discrimination of steady-state vowels within and between
phoneme categories: A preliminary study. Scandinavian Journal of Logopedics
and Phoniatrics, 17, 102-112.
Antinoro, F., & Skinner, P. H. (1968). The effects of frequency on the auditory
evoked response. Journal of Auditory Research, 8, 119-123.
Boothroyd, A. (1991). Assessment of speech perception capacity in profoundly
deaf children. American Journal of Otology, 12(Suppl.), 67-72.
Clynes, M. (1969). Dynamics of vertex evoked potentials: The R-M brain
function. In E. Donchin & D. B. Lindsley (Eds.), Average evoked potentials:
Methods, results, and evaluations (NASA SP-191) (pp. 363-374). Washington,
DC: U.S. Government Printing Office.
Coles, M. G. H., & Rugg, M. D. (1996). Event-related brain potentials: An
introduction. In M. D. Rugg & M. G. H. Coles (Eds.), Electrophysiology of mind.
Event-related brain potentials and cognition (pp. 1-23). New York: Oxford
University Press.
Davis, H., & Zerlin, S. (1996). Acoustic relations of the human vertex potential.
Journal of the Acoustical Society of America, 39, 109-116.
Geers, A. E., & Moog, J. S. (1994). ). Effectiveness of cochlear implants and
tactile aids for deaf children: The sensory aids study at Central Institute for the
Deaf. Volta Review, 96, 1-232.
Hillyard, S. A., & Picton, T. W. (1978). ON and OFF components in the auditory
evoked potential. Perception and Psychophysics, 24, 391-398.
Hari, R. (1991). Activation of the human auditory cortex by speech sounds. Acta
Otolaryngology. Supplement, 491, 132-138.
Imaizumi, S., Mori, K., Kiritani, S., & Yumoto, M. (1996). Neural representation of
concurrent sounds: A magnetoencephalographic study. In I. Hashimoto, Y. C.
Okada, & S. Ogawa (Eds.), Visualization of information processing in the human
brain: Recent advances in MEG and functional MRI (EEG Suppl. 47) (pp. 191197). Amsterdam: Elsevier Science.
Jerger, J., & Jerger, S. (1970). Evoked responses to intensity and frequency
change. Archives of Otolaryngology, 91, 433-436.
Kaukoranta, E., Hari, R., & Lounasmaa, O. V. (1987). Responses of the human
auditory cortex to vowel onset after fricative consonants. Experimental Brain
Research, 69, 19-23.
Kraus, N., McGee, T., Sharma, A., Carrell, T., & Nicol, T (1993). Mismatch
negativity event-related potential to speech stimuli. Ear and Hearing, 14, 158164.
Keidel, W. D., & Spreng, M. (1965). Neurophysiological evidence for the Stevens
power function in man. Journal of the Acoustical Society of America, 38, 191195.
Kurtzberg, D., Vaughan, H. G., Kreuzer, J., & Fliegler, K. Z. (1995).
Developmental studies and clinical application of mismatch negativity: Problems
and prospects. Ear and Hearing, 16, 104-116.
Mäkelä, J. P., Hari, R., & Leinonen, L. (1988). Magnetic responses of the human
auditory cortex to noise/square wave transitions. Electroencephalography and
Clinical Neurophysiology, 69, 423-430.
McGee, T., Kraus, N., King, C., Nicol, T., & Carrell, T. D. (1996). Acoustic
elements of speechlike stimuli are reflected in surface recorded responses over
the guinea pig temporal lobe. Journal of the Acoustical Society of America, 99,
3606-3614.
Näätänen, R. (1992). Attention and brain function. Hillsdale, New Jersey:
LawrenceErlbaum Associates.
Näätänen, R., Gaillard, A. W. K., & Mäntysalo, S. (1978). Early selective attention
effect on evoked potential reinterpreted. Acta Psychologica, 42, 313-329.
Näätänen. R., & Kraus, N. (1995). Special issue: Mismatch negativity as an index
of central auditory function. Ear and Hearing, 16, 1-146.
Näätänen, R., Paavilainen, P., Alho, K., Reinikainen, K., & Sams, M. (1987). The
mismatch negativity to intensity changes in an auditory stimulus sequence. In R.
Johnson, J. W. Rohrbaugh, & R. Parasuraman (Eds.), Current trends in eventrelated brain potential research (EEG Suppl. 40) (pp. 125-131). Amsterdam:
Elsevier.
Näätänen, R., Paavilainen, P., & Reinikainen, K, (1989). Do event-related
potentials to infrequent decrements in duration of auditory stimuli demonstrate a
memory trace in man? Neuroscience Letters, 107, 347-352.
Näätänen, R., & Picton, T. W. (1987). The N1 wave of the human electric and
magnetic response to sound: A review and an analysis of the component
structure. Psychophysiology, 24, 375-425.
Näätänen, R., Schröeger, E., Karakas, S., Tervaniemi, M., & Paavilainen, P.
(1993). Development of a memory trace for a complex sound in the human brain.
Neuroreport, 4, 503-506.
Onishi, S., & Davis, H. (1968). Effects of duration and rise time of tone bursts on
evoked V potentials. Journal of the Acoustical Society of America, 44, 582-591.
Ostroff, J., Martin, B. A., & Boothroyd, A. (1998). Cortical evoked responses to
spectral change with a syllable. Ear and Hearing, 19, 290-297.
Pantev, C., Eulitz, C., Hampton, S., Ross, B., & Roberts, L. E. (1996). The
auditory evoked "off" response: Sources and comparison with the "on" and the
"sustained" responses. Ear and Hearing, 17, 255-265.
Paavilainen, P., Karlsson, M. L., Reinikainen, K., & Näätänen, R. (1989).
Mismatch negativity to change in spatial location of an auditory stimulus.
Electroencephalography and Clinical Neurophysiology, 73, 129-141.
Phillips, D. P. (1993). Representation of acoustic events in primary auditory
cortex. Journal of Experimental Psychology: Human Perception and
Performance, 19, 203-216.
Phillips, D. P. (1995). Central auditory processing: A view fro auditory
neuroscience. American Journal of Otology, 16, 338-352.
Picton, T. W. (1995). The neurophysiological evaluation of auditory
discrimination. Ear and Hearing, 16, 1-5.
Picton, T. W., Woods, D. L., & Proulx, G. B. (1978). Human auditory sustained
potentials. II. Stimulus relationships. Electroencephalography and Clinical
Neurophysiology, 45, 198-210.
Sams, M., Aulanko, R., Aaltonen, O., & Näätänen, R. (1990). Event-related
potentials to infrequent changes in phonetic features of synthetic CV stimuli.
Journal of Cognitive Neuroscience, 2, 344-357.
Sams, M., Paavilainen, P., Alho, K., & Näätänen, R. (1985). Auditory frequency
discrimination and event-related potentials. Electroencephalography and Clinical
Neurophysiology, 62, 437-448.
Sandridge, S., & Boothroyd, A. (1996). Using naturally produced speech to elicit
the mismatch negativity. Journal of the American Academy of Audiology, 7, 105112.
]
Scherg, M., & Von Cramon, D. (1986). Evoked dipole source potentials of the
human auditory cortex. Electroencephalography and Clinical Neurophysiology,
65, 344-360.
Schröeger, E. (1994). An event-related potential study of sensory representations
of unfamiliar tonal patterns. Psychophysiology, 31, 175-181.
Schröeger, E., Näätänen, R., & Paavilainen, P. (1992). Event-related potentials
reveal how non-attended complex sound patterns are represented by the human
brain. Neuroscience Letters, 146, 183-186.
Semlitch, H. V., Anderer, P., Schuster, P., & Presslich, O. (1986). A solution for
reliable and valid reduction of ocular artifacts applied to the P300 ERP.
Psychophysiology, 23, 695-703.
Spoor, A., Timmer, F., & Odenthal, D. W. (1969). The evoked auditory response
(EAR) to intensity modulated and frequency modulated tones and tone bursts.
International Audiology, 8, 410-415.
Steinschneider, M., Arezzo, J., & Vaughan, H. G. (1982). Speech evoked activity
in the auditory radiations and cortex of the awake monkey. Brain Research, 252,
353-365.
Tyler, R. S. (1993). Speech perception by children. In R. S. Tyler (Ed.), Cochlear
implants: audiological foundations (pp. 191-256). San Diego: Singular Publishing
Group.
Vaughan, H. G., & Ritter, W. (1970). The sources of auditory evoked responses
recorded from the human scalp. Electroencephalography and Clinical
Neurophysiology, 28, 360-367.
Vaughan, H. G., Weinberg, H., Lehman, D., & Okada, Y. (1986). Approaches to
defining the intracranial generators of event-related electrical and magnetic
fields. In W. C. McCallum, R. Zapolli, & F. Denoth (Eds.), Cerebral
psychophysiology studies in event-related potentials (EEG Suppl. 38) (pp. 505544). Amsterdam: Elsevier.
Yingling, C. D., & Nethercut, G. E. (1983). Evoked responses to frequency
shifted tones: Tonotopic and contextual determinants. International Journal of
Neuroscience, 22, 107-118.
]
Reference Notes
1 Martin, B. A., Kurtzberg, D., & Stapells, D. R. (in press). The effects of
decreased audibility produced by high-pass noise masking on N1 and the
mismatch negativity to speech sounds /ba /and /da/. Journal of Speech and
Hearing Research.
2 Ostroff, J., Martin, B. A., & Boothroyd, A. (in preparation). Cortical evoked
responses to spectral change within ongoing vowels.
Appendix
Relative standard errors of the amplitudes of the acoustic change complex
and mismatch negativity.
We begin with four premises:
The underlying noise in the EEG signal is determined by the standard
deviation (d) of repeated voltage samples at a given temporal location in the
sampling window.
The noise in an averaged waveform is equal to d divided by the square root of
the number of sweeps (n) contributing to the average. That is:
Equation A1
The value of d is independent of the test protocol. i.e
Equation A2
The standard error of a difference between two averages is the square root of
the sum of the squares of the two standard errors. That is:
Equation A3
From (1), (2), (3), and (4): where na and nb are the number of sweeps
contributing to the averages a and b, respectively.
Equation A4
Consider, first, the acoustic change complex. Let: nac = the number of sweeps
contributing to the averaged waveform and eac = the standard error of the
difference between two means.
From equation (4):
Equation A5
Consider, next, the mismatch negativity. Let: nmm = the total number of
sweeps contributing to two averaged waveforms, p = the proportion of n
contributing to the deviant waveform, (1-p) = the proportion of n contributing to
the standard waveform, and emm = the standard error of the difference
between two means.
From equation (4):
Equation A6
From (5) and (6), the ratio between the two standard errors is or
Equation A7
In the present study, the inter-onset intervals for the mismatch and acoustic
change protocols were 600 and 3000 msec, respectively. In a given testing
time, therefore, five times as many stimuli could be presented in the mismatch
as in the change protocol. Within the mismatch protocol, the proportion of
deviants was 1/7, or 0.14. Substituting these values in equation (7):
Equation A8
It will be seen from Equation (7) that the ratio is determined by two opposing
factors. The first is the relative number of stimuli presented in the two
protocols. This factor favors the mismatch protocol. The second factor is the
division of the stimuli in the mismatch protocol into standards and deviants
and the small proportion of the latter. This factor favors the change protocol.
Under the conditions of the present study, these factors effectively cancel
each other.
*
An alternative approach would have been to create 790 msec of the tonal
complex and noise band and to shape the onsets and offsets. The
concatenation approach was chosen for two reasons: first, to ensure that any
change complex was not an artifact of the concatenation-overlap procedure;
second, to ensure that the second half of the noise-only stimulus had the
same waveform as the second half of the tone-noise stimulus.
† The spectra are superimposed to show the similarity of their band paths.
The spectral peaks of the tonal stimulus were corrected by - 13 dB. The peaks
of the spectra of the tonal stimuli are higher than those of the noise stimulus
because the energy is integrated over a smaller time frame.
‡If the standard error of the mean at a single sampling pint is represented by
se, the standard error of a difference between two independent means is se ×
√2. Peak-to-peak differences of 1.98 and 2.67 times this value may be taken
to be significantly different from zero at the 5% and 1% levels, respectively. It
could be argued that these criteria are too lax because the N1 and P2 peaks
are not selected at random. On the other hand, one could also argue for the
use of a less conservative 1-tail test, based on the known morphology of the
complex. Additionally, the means (N1 and P2) are not really independent.
Therefore, the standard error of the difference may be less than is estimated
here. In fact, application of the criteria developed here produces conclusions
that are in close agreement with judgments based on visual inspection of the
averaged waveforms. Moreover, these criteria can be applied uniformly across
subjects.
§In Clinical application it would be unnecessary to include the tone-only and
noise-only stimuli.
© 1999 Lippincott Williams & Wilkins, Inc.