Download Developing an ellipsoidal reflector for measuring otoacoustic

Document related concepts

Audiology and hearing health professionals in developed and developing countries wikipedia , lookup

Sound localization wikipedia , lookup

Soundscape ecology wikipedia , lookup

Sensorineural hearing loss wikipedia , lookup

Olivocochlear system wikipedia , lookup

Earplug wikipedia , lookup

Auditory system wikipedia , lookup

Sound from ultrasound wikipedia , lookup

Transcript
Vesa Heiskanen
Developing an ellipsoidal reflector for
measuring otoacoustic emissions
School of Electrical Engineering
Thesis submitted for examination for the degree of Master of
Science in Technology.
Espoo 15.5.2015
Thesis supervisor:
Prof. Ville Pulkki
Thesis instructor:
Prof. Bastian Epp
A!
Aalto University
School of Electrical
Engineering
aalto university
school of electrical engineering
abstract of the
master’s thesis
Author: Vesa Heiskanen
Title: Developing an ellipsoidal reflector for measuring otoacoustic emissions
Date: 15.5.2015
Language: English
Number of pages:10+44
Department of Signal Processing and Acoustics
Professorship: Acoustics and Audio Signal Processing
Code: S-89
Supervisor: Prof. Ville Pulkki
Instructor: Prof. Bastian Epp
Otoacoustic emissions (OAEs) are faint sounds emanated by the eardrum. They
can be recorded and are used as an objective measure to monitor the health of the
inner ear. Conventional measurement methods use a miniature microphone housed
in a probe that is fitted to the ear canal.
In this thesis a novel method for measuring OAEs was experimented by
using a large one-inch-diameter microphone which has the benefit of very low
inherent noise. A modular ellipsoidal reflector was developed and used to collect
OAEs and focus them to the microphone. The applicability of a such reflector
for recording OAEs was studied. Response measurements were made to examine
the acoustical properties of the different reflector setups comprised of different
part-combinations. The results show that the reflector setups providing the most
gain could be employed for OAE measurements where the temporal response of
the measurement system is not overridingly critical such as distortion product
otoacoustic emissions (DPOAEs), stimulus-frequency otoacoustic emissions
(SFOAEs) or spontaneous otoacoustic emissions (SOAEs). This limitation is due
to reflections in the response that take over 10 ms to decay.
DPOAE measurements were made to two subjects to test the reflector
measurement system in practice. Succesful measurements were accomplished at
the two used stimulus frequencies which is a proof of concept that this kind of
measurement system can be used to record OAEs. Further investigations are
needed to be able to compare the properties of the reflector system to conventional
measurement probes.
Keywords: acoustics, ellipsoidal reflector, otoacoustic emission (OAE), DPOAE,
SFOAE, SOAE
aalto-yliopisto
sähkötekniikan korkeakoulu
diplomityön
tiivistelmä
Tekijä: Vesa Heiskanen
Työn nimi: Ellipsoidin heijastimen kehittäminen otoakustisten emissioiden
mittaamiseen
Päivämäärä: 15.5.2015
Kieli: Englanti
Sivumäärä:10+44
Signaalinkäsittelyn ja akustiikan laitos
Professuuri: Akustiikka ja äänenkäsittelytekniikka
Koodi: S-89
Valvoja: Prof. Ville Pulkki
Ohjaaja: Prof. Bastian Epp
Otoakustiset emissiot (OAE) ovat kuulojärjestelmän tuottamia heikkoja ääniä,
jotka voidaan mitata. Mittausta voidaan käyttää sisäkorvan terveydentilan
tarkasteluun. OAE:t mitataan tavallisesti käyttämällä korvakäytävään sovitettavaa
anturia, joka sisältää pienikokoisen mikrofonin.
Tässä diplomityössä kokeiltiin uutta tapaa otoakustisten emissioiden mittaamiseen käyttämällä suurta yhden tuuman halkaisijan mikrofonia, jonka
etuna on todella pieni sisäinen kohina. Osista koostuva ellipsoidin muotoinen
heijastin kehitettiin kokoamaan OAE:t ja kohdistamaan ne mikrofoniin. Työn
tarkoitus oli tutkia tällaisen heijastinjärjestelmän soveltuvuutta OAE:iden
mittaamiseen. Vastemittauksilla tarkasteltiin eri heijastinkokoonpanojen akustisia
ominaisuuksia. Eniten vahvistusta tarjoavia kokoonpanoja voisi käyttää sellaisten
OAE-tyyppien mittaamiseen, joissa mittausjärjestelmän aikavasteen lyhyys ei ole
ensisijaisen ratkaiseva ja jotka voidaan tehdä esimerkiksi käyttämällä jatkuvaa
ja pitkäkestoista herätettä. Tämä rajoitus johtuu kokoonpanojen aikavasteissa
olevista heijastuksista, joiden vaimeneminen kestää yli 10 ms.
Järjestelmän testaamiseksi käytännössä, DPOAE-mittaus tehtiin kahdelle
koehenkilölle. Mittaukset onnistuivat kahdella käytetyllä herätetaajuudella, joka
osoittaa, että kehitettyä heijastinjärjestelmää voi käyttää OAE:iden mittaamiseen.
Heijastinjärjestelmän vertaaminen normaalissa käytössä oleviin OAE-antureihin
edellyttää lisätutkimuksia.
Avainsanat: akustiikka, ellipsoidi heijastin, otoakustinen emissio (OAE)
Acknowledgements
This work is carried out in Department of Signal Processing and Acoustics, Aalto
University School of Electrical Engineering, during the year 2014.
I would like to thank my instructor professor Bastian Epp from the Technical
University of Denmark for providing knowledge, guidance and practical advice along
the course of this work. I express appreciation towards my supervisor Ville Pulkki
for the opportunity of doing this work as well as his ideas, guidance and comments.
I am thankful to Ilkka Huhtakallio for teaching me how to use laboratory equipment
and using his time and vision on helping me to build measurement setups. From
the process of building the reflector I am grateful to Hannu Paajanen from the
Department of Design for lending his know-how of manufacturing techniques. Also
thanks to Kari Kääriäinen from Design Factory and Tapani Honkavaara. Thanks to
Juha Backman and Benedict Slotte from Nokia for providing their knowledge and a
miniature speaker for use in the measurements. And thanks to Javier Gómez and
the rest of the staff in the Department of Signal Processing and Acoustics for the
assistance.
Finally, I would like to thank my family and friends for all the love and support.
Otaniemi, 24.2.2015
Vesa Heiskanen
iv
Contents
Abstract . . . . . . .
Abstract (in Finnish)
Acknowledgements .
Contents . . . . . . .
Abbreviations . . . .
List of Figures . . . .
List of Tables . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
. ii
. iii
. iv
. v
. vii
. viii
. x
1 Introduction
1
2 Otoacoustic emissions
2.1 Cochlear amplifier and otoacoustic emissions . .
2.2 Measurement of otoacoustic emissions . . . . . .
2.2.1 Transient-evoked otoacoustic emissions .
2.2.2 Distortion product otoacoustic emissions
2.2.3 Stimulus-frequency otoacoustic emissions
2.2.4 Spontaneous otoacoustic emissions . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
3
3
6
7
8
10
10
3 Microphones and reflectors
12
3.1 Inherent noise in condenser microphone systems . . . . . . . . . . . . 12
3.2 Ellipsoidal reflectors . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
4 Methodology and experiments
4.1 Design of a reflector used to measure
otoacoustic emissions . . . . . . . . . .
4.1.1 Design of the reflector . . . . .
4.1.2 Reflector gain measurements . .
4.2 Otoacoustic emission measurements . .
4.2.1 Organization of the experiments
4.2.2 Stimulus parameters . . . . . .
4.2.3 Data collection and analysis . .
4.2.4 System distortion measurement
18
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
18
18
20
22
22
23
24
25
5 Results
26
5.1 Reflector gain measurement results . . . . . . . . . . . . . . . . . . . 26
5.1.1 Focus to focus gain . . . . . . . . . . . . . . . . . . . . . . . . 26
v
vi
5.2
5.1.2 Effect of offset from the focus . . . . . . . . . . . . . . . . . .
5.1.3 Discussion of the acoustical characteristics of the reflector . . .
Otoacoustic emission measurement results . . . . . . . . . . . . . . .
5.2.1 Comparison of open and sealed ear in DPOAE measurements
5.2.2 DPOAE growth . . . . . . . . . . . . . . . . . . . . . . . . . .
5.2.3 System distortion . . . . . . . . . . . . . . . . . . . . . . . . .
5.2.4 Discussion of the DPOAE measurement results . . . . . . . .
27
30
31
31
35
36
38
6 Conclusions
40
Bibliography
44
Abbreviations
CF
DPOAE
EOAE
IHC
OHC
SFOAE
SNR
SOAE
SPL
SSOAE
TEOAE
Charasteristic frequency
Distortion product otoacoustic emission
Evoked otoacoustic emission
Inner hair cell
Outer hair cell
Stimulus-frequency otoacoustic emission
Signal-to-noise ratio
Spontaneous otoacoustic emission
Sound pressure level
Synchronized spontaneous otoacoustic emission
Transient-evoked otoacoustic emission
vii
List of Figures
2.1
2.2
2.3
2.4
2.5
Structure of the ear. The outer ear is the part external to the tympanic
membrane featuring the pinna and the external auditory canal. The
middle ear is the portion internal to the tympanic membrane and
external to the cochlea. The inner ear is comprised of the cochlea and
the semicircular canals of the vestibular system. Picture adapted from
(46). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Illustration of the traveling wave motion along the basilar membrane.
The spiral appearance of the cochlea has been stretched out and the
displacement of the basilar membrane is greatly exaggerated. Figure
adapted from (49). . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Cross section of the cochlea featuring a schematic view of the organ
of Corti. Figure adopted from (42). . . . . . . . . . . . . . . . . . .
Example response of a clinical TEOAE measurement. On the left
waveform of two time-locked averages are shown. There is a few
millisecond delay preceding the response. The right side of the figure
shows the levels of the emission divided into frequency bands (blue)
and noise levels in these given bands (red). . . . . . . . . . . . . . . .
A spectrum of an DPOAE measurement from the ear of a 16-monthold child. Stimulus tones (f1 = 1.807 kHz; f2 = 2.002 kHz; L1 = 65 dB
SPL and L2 = 60 dB SPL) and resulting DPOAEs are labeled to the
figure. Several DPOAEs occur at frequencies below the primaries (4f1
– 3f2 = 1.222 kHz; 3f1 – 2f2 = 1.417 kHz and 2f1 – f2 = 1.612 kHz) and
one above the primaries ( 2f2 – f1 = 2.197 kHz). Figure adapted from
(35) [frequency component labels are redrawn due to low-quality copy
of the figure]. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.1
3
4
5
7
9
Equivalent levels of inherent noise (1/3 octave bandwidth) produced
by different elements of a microphone system equipped with a 1”
free-field microphone (sensitivity 50mV/Pa). The microphone, the
preamplifier and the measurement amplifier all contribute significantly
to the noise floor of the system in certain parts of the frequency range.
Figure adopted from (6). . . . . . . . . . . . . . . . . . . . . . . . . . 13
3.2 Comparison of 1/3 octave noise spectra of microphone systems equipped
with different size microphones. Figure adopted from (6). . . . . . . . 14
viii
ix
Inherent noise comparison between a conventional OAE probe microphone (ER-10B+) and a 1” low-noise instrumentation microphone
(G.R.A.S. 40HF) measured in an anechoic chamber. SPL is a 10s
average. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.4 Principle of the focusing effect of an elliptical shape. A ray originating from one focal point (f1) reflects to the other focal point (f2)
independent of the part on the elliptical shape where it reflects. . . .
3.5 Principle of the focusing effect of a truncated elliptical shape. If a ray
originating from the other focus (f1) collides surface of this truncated
elliptical shape it reflects to the other focus (f2). . . . . . . . . . . . .
3.6 Ellipse dimensions and ray travel distance. . . . . . . . . . . . . . . .
3.3
4.1
4.2
4.3
4.4
4.5
5.1
5.2
5.3
5.4
General idea of using a truncated ellipsoidal reflector in recording of
otoacoustic emissions. The ear canal and the microphone capsule are
at the focal points of the reflector. The distance between the foci is
404 mm. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Different reflector assemblies with test subject and microphone positioning: (a) half reflector, (b) two-sided reflector, (c) two-sided
reflector with detachable end. . . . . . . . . . . . . . . . . . . . . . .
Manufactured reflector parts: (a) all parts separately, (b) two halves
together and the detachable end alongside, (c) detachable end with
the microphone mounted in place . . . . . . . . . . . . . . . . . . . .
Picture of the setup used to measure the responses of the reflector.
The mobile device speaker element mounted in an enclosure made
out of a plastic tube, play dough and foam is on the left side of the
reflector. The measurement microphone is on the right side. . . . . .
The setup from preliminary measurements shows the equipment used
in the final measurements with the difference of speaker model and
placement. An artificial head is placed under the reflector to demonstrate the position of the test subject. The measurement microphone
is inserted inside the reflector from the other end. The reflector is
mounted to a three-legged stand featuring vibration isolating foam. .
Magnitude responses of the gain measurements between the two foci.
Gain responses of the different reflector setups plotted as a difference
to the free field response at the same distance. . . . . . . . . . . . . .
Energy decay curves for different reflector setups. . . . . . . . . . . .
The effect of offsetting the sound source away from the axis going
through the focal points of the reflector. 1/12 octave smoothing was
employed in the curves. . . . . . . . . . . . . . . . . . . . . . . . . . .
The effect of offsetting the sound source along the axis going through
the focal points. 1/12 octave smoothing was employed in the curves. .
15
16
17
17
19
20
21
22
23
27
28
29
29
x
5.5
5.6
5.7
5.8
5.9
5.10
Sound spectra measured with the reflector with two loudspeakers
producing sinusoids at 3000 Hz and 3660 Hz and the subject lying
under the reflector with his ear canal approximately at the focal point.
Left: response with sealed ear canal. Right: response with open ear
canal, where most prominent otoacoustic emissions are marked with
red circles. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Sound spectra measured with reflector from all the subjects at f2 =
2745 and f2 /f1 = 1.22. The stimulus level was approximately L1 =
L2 = 65 dB. On the left the measurement with a sealed ear and on
the right the measurement with an open ear. (a) test subject S, (b)
test subject A, (c) test subject A with a different positioning of the
stimulus producing speakers. . . . . . . . . . . . . . . . . . . . . . . .
Sound spectra measured with reflector from all the subjects at f2 =
3660 and f2 /f1 = 1.22. The stimulus level was approximately L1 =
L2 = 65 dB. On the left the measurement with sealed ear and on
the right the measurement with open ear. (a) test subject S, (b)
test subject A, (c) test subject A with a different positioning of the
stimulus producing speakers. . . . . . . . . . . . . . . . . . . . . . . .
Measured DPOAE levels from all the test subjects. Level of 2f1 – f2
distortion component, f2 /f1 = 1.22. Orange line: DPOAE levels, dark
line: linear growth reference, grey area height: noise levels. Red circles
denote that the emission level satisfies the 12 dB SNR criterion (sect.
4.2.3). (a) test subject S, (b) test subject A, (c) test subject A with a
different positioning of the stimulus producing speakers. . . . . . . . .
Comparison of speaker intermodulation distortion with two different
speaker positionings without a test subject. Left figure: speakers
positioned next to each other on the same stand. Right figure: speakers
positioned on a 2 m distance from each other. Red circles denote
second order intermodulation distortion componets, blue circles third
order components, a yellow circle a one fourth order component and
pink circles fifth order components. . . . . . . . . . . . . . . . . . . .
Comparison of speaker intermodulation distortion with two different
speaker positionings without a test subject. Red circles denote second
order intermodulation distortion components. . . . . . . . . . . . . .
31
32
34
36
37
38
Chapter 1
Introduction
Otoacoustic emissions (OAEs) are faint sounds emanating from the ear. The level
of these sounds is usually too low to be heard, but they can be measured with a
sensitive microphone and techniques designed for the purpose. OAEs can be evoked
with a sound stimulus but they can be present also without a stimulus. They are
an objective measure to assess the function of the inner ear (23). The most familiar
test of hearing is pure-tone audiometry, where different-level pure-tone sounds are
reproduced to the test subject’s ears and the subject responds by pressing a button
when he hears the sound. Hearing thresholds are determined at different frequencies
according to the minimum levels of the pure-tones that the test subject is able to
detect. In a way this test offers the most accurate information about a persons’
ability to hear since it involves the whole process of hearing.
In addition to behavioural audiometric tests, objective physiological examinations
such as measurement of OAEs have been developed. OAEs have various applications.
OAE tests can be used to evaluate the functionality of the test subject’s inner ear
when he cannot be expected to respond in an anticipated manner. As an example of
this, OAE testing is widely used in newborn hearing screenings. OAEs are useful
in audiological diagnostics along with other physiological measures on monitoring
different stages of the auditory pathway (2). As information of the function of the
inner ear is retrieved with this measure, it is also of interest in science.
OAEs are commonly measured with a probe fitted to the ear canal. This probe
features a microphone and possibly loudspeakers for reproducing a stimulus sound.
Clearly the microphone has to be in miniature-size to fit in a such probe. Small
microphones have higher inherent noise compared to microphones larger in diameter.
This inherent noise limits the lower end of the microphone’s dynamic range. When
recording low level sounds such as OAEs, low inherent noise would be a truly
desireable quality for a microphone to achieve a good signal-to-noise ratio (SNR).
The motivation of this study is to investigate if a higher SNR compared to the common
method can be obtained employing a sensitive low-noise one-inch microphone at a
considerable distance from the ear canal opening and an acoustic reflector to collect
and focus the sound emerging from the ear canal.
1
An ellipsoidal shape for the acoustic reflector has the correct properties for this study.
To our knowledge a such reflector has not been used for recording OAEs. If the
intented technique is succesful it could offer better quality recordings compared to
the common method, e.g., higher SNR, and it could provide new information about
OAEs since the ear canal is not sealed with the microphone probe. The objectives of
this thesis is to answer the following questions:
• Is recording of OAEs possible by using an acoustic ellipsoidal reflector?
• What is the gain achieved with the reflector to be designed i.e. what is the
frequency response? What about temporal characteristics?
• What is the overall SNR achievable with this system including the lower noise
floor of the measurement microphone compared to a common OAE probe?
Comparing the levels of OAEs measured with the reflector setup and conventional
methods requires quantifying the relation between sound pressure level at the entrance
of the ear canal to the level which is measured with the reflector setup farther away.
Conventional OAE measurement probes are fitted to the ear canal and thus also
pick up body noises such as breathing or swallowing, which increase the background
noise of the recording. The reflector measurement system can be mounted without
touching the test subject’s body at all, and therefore benefit from lower body noise.
Due to the noise originating from the human body (accompanied by the inherent
noise from the miniature microphone) the low frequency limit of OAE recordings
is around 500 Hz with the conventional methods. An interesting question is if the
frequency range of OAE recordings could be extended towards low frequencies with
the reflector measurement setup.
OAEs have already been recorded without sealing the ear canal with a miniature
microphone that is not housed in a probe that seals the ear canal (4). Sealing the
ear canal alters it’s acoustic load and thus OAEs recorded from an open ear reflect
more natural characteristics of the emission. The reflector technique would also have
this advantage of an open ear.
The structure of this work is described as follows. Chapter 2 focuses on OAEs: the
mechanisms behind them is discussed briefly as well as methods how to measure them.
In chapter 3 the relevant attributes of key measurement tools in this study, condenser
microphones and ellipsoidal reflectors are explained in general terms. Chapter 4
presents the methodology used to answer the research questions and the organization
of the experiments. Details of the reflector designed for the study can be found in
this chapter. Results of the study are presented and discussed in chapter 5 and
finally the thesis is concluded in chapter 6.
Chapter 2
Otoacoustic emissions
2.1
Cochlear amplifier and otoacoustic emissions
Ossicles
Figure 2.1: Structure of the ear. The outer ear is the part external to the tympanic
membrane featuring the pinna and the external auditory canal. The middle ear is
the portion internal to the tympanic membrane and external to the cochlea. The
inner ear is comprised of the cochlea and the semicircular canals of the vestibular
system. Picture adapted from (46).
The structure of the ear is shown in figure 2.1. The sound entering the ear canal
causes the tympanic membrane (eardrum) to vibrate. Three small bones known
as the ossicles transfer the energy from the eardrum to the cochlea in the inner
ear. This results in pressure variations in the fluids of the cochlea that displace
the flexible cochlear partition composed of basilar membrane, organ of Corti and
3
CHAPTER 2. OTOACOUSTIC EMISSIONS
4
tectorial membrane. The displacement takes the form of a wave pattern, that moves
from the base of the cochlea to the apex (7). A.k.a. the traveling wave. This means
that the flexible cochlear partition carries a transversal wave that is similar to the
waves formed in water. Figure 2.2 illustrates the traveling wave motion along the
basilar membrane. The amplitude increases as the traveling wave moves from the
base of the cochlea towards the apex and at a certain point the amplitude reaches a
maximum followed by a rapid decrease. The location of the maximum amplitude
peak along the cochlear partition is frequency dependent. High frequencies peak
close to the base of the cochlea and as the frequency is decreased the peaking occurs
closer to the apex in systematic order depending on the frequency. A frequency
corresponding to one certain place along the cochlear partition is referred to as the
charasteristic frequency (CF). (34)
Base
Apex
Figure 2.2: Illustration of the traveling wave motion along the basilar membrane.
The spiral appearance of the cochlea has been stretched out and the displacement of
the basilar membrane is greatly exaggerated. Figure adapted from (49).
A schematic slice through the cochlea is shown in figure 2.3 displaying the flexible
cochlear partition in the center. The organ of Corti runs along the entire length
of the basilar membrane and contains inner hair cells (IHCs) and outer hair cells
(OHCs) (42). In the human cochlea IHCs form a single row of cells while OHCs are
arranged in three to five rows (31). The IHCs convert the vibration at a certain
place along the basilar membrane into a neural code in auditory nerve (31). The
OHCs seem to involve in an active process in hearing. The basic features of this
CHAPTER 2. OTOACOUSTIC EMISSIONS
5
Figure 2.3: Cross section of the cochlea featuring a schematic view of the organ of
Corti. Figure adopted from (42).
phenomenon are provided next in this chapter.
Active biologic mechanisms have been measured in mammalian ears. Basilar membrane responses measured at the charasteristic place (peak of the traveling wave)
grow linearly at low stimulus levels but demonstrate compressive growth at high
stimulus levels in a healthy living cochlea. Death of the cochlea abolishes this
nonlinear behaviour (39). A healthy living cochlea also exhibits nonlinear behaviour
in the form of refined tuning at the CF (36). The influence of active mechanisms to
cochlear function is shown also in other measurements (33, 37).
These active biological mechanisms are often referred to as the ”active process” or
more specifically as the ”cochlear amplifier”. The cochlear amplifier is believed to be
responsible of the nonlinear behaviour and the exceptional sensitivity and frequency
selectivity in a healthy cochlea in comparison to a damaged or a dead cochlea
(17, 8, 9). Current evidence suggests that OHCs contribute to this amplification
process. The consequence to the loss of OHCs appers to be loss of the cochlear
amplifier (16, 35).
Otoacoustic emissions (OAEs) are intimately associated to the cochlear amplifier
(23). Somehow related to this amplification process hearing produces these faint but
measurable sounds which are utilized in clinical applications and research. OAEs
are vulnerable to acoustic trauma, hypoxia and ototoxic medications, all of which
cause hearing loss and damage to OHCs (35). There is a link between OHCs and the
production of OAEs similarly as OHCs are linked to the cochlear amplifier. However,
OAEs have been measured also from species without OHCs (29). Therefore OHCs
are unlikely to be the sole producer of OAEs.
For a long time all types of OAEs were thought to be caused by the same mechanism.
More recent knowledge suggests that mammalian OAEs arise from two fundamentally
CHAPTER 2. OTOACOUSTIC EMISSIONS
6
different mechanisms: nonlinear distortion and linear coherent reflection (43). Emissions from nonlinear distortion are attributed directly to the action of OHCs, and the
source of this type of emissions is believed to follow the traveling wave envelope of
the stimulus that is used to evoke the emission. That means that the actual physical
place where the emission originates in the cochlea is dependent on the frequency
of the stimulus. The linear coherent reflection emissions or ”place-fixed” emissions
are proposed to originate from impedance perturbations in the mechanics of the
cochlea or impedance mismatches present at or near the largest displacement of the
traveling wave. These impedance perturbations could be for example variations in
hair cell number and geometry (20). Slightly changing the frequency of the stimulus
does not change the physical source of the emission – hence the name ”place-fixed”.
Changing the frequency of the stimulus alters the peaking location of the traveling
wave, but as the source point of the emission stays fixed this results in phase changes
due to the altered traveling time along the basilar membrane. OAEs measured in
the ear canal are likely a combination of energy from the two mechanisms. The
association between different types of OAEs and the two generation mechanisms will
be explained later in this chapter in the sections dedicated for different emission
types. (43, 35)
2.2
Measurement of otoacoustic emissions
A usual measurement setup for detecting otoacoustic emissios (OAEs) consists of a
minituare microphone housed in a small probe that fits the ear canal. The probe
is coupled to the ear by a foam or rubber tip. The probe can also contain one or
two speakers for providing sound stimuli. The microphone signal is amplified and
sampled via an AD-converter. The output is then appropriately analyzed depending
on the type of OAE that is being measured. (35)
OAEs have been traditionally divided to different types based on the type of stimulus
used or lack of it (35). Spontaneous otoacoustic emissions (SOAEs) are recorded
without any external stimulus. Evoked otoacoustic emissions (EOAEs) are measured
during or after presenting an acoustic stimulus to the ear. EOAEs are further
categorized by the type of stimulus used. Transient evoked otoacoustic emissions
(TEOAEs) are elicited by using transient stimuli such as clicks. Distortion product
otoacoustic emissions (DPOAEs) are present when a stimulus containing two pure
tones is reproduced to the ear. Stimulus-frequency otoacoustic emissions (SFOAEs)
are elicited by one pure tone. Synchronized spontaneous otoacoustic emissions
(SSOAEs) use a transient stimulus as an averaging point, but the emission itself is
assumed to be spontaneous (38).
CHAPTER 2. OTOACOUSTIC EMISSIONS
7
Figure 2.4: Example response of a clinical TEOAE measurement. On the left
waveform of two time-locked averages are shown. There is a few millisecond delay
preceding the response. The right side of the figure shows the levels of the emission
divided into frequency bands (blue) and noise levels in these given bands (red).
2.2.1
Transient-evoked otoacoustic emissions
Transient-evoked otoacoustic emissions (TEOAEs) are evoked by transient or brief
stimulus such as a click or a tone burst (35). A emission response is present after a
short time delay after stimulus onset. A single response is very low in level and hence
multiple measurements are required to accomplish a sufficient signal-to-noise ratio
(SNR) by using time-synchronous averaging. Commonly responses resulting from
about 200 stimulus repetitions need to be averaged to get a valid TEOAE response
(19).
A TEOAE from a healthy ear has a broad spectrum resulting from the transient
stimulus used to evoke the emission. Thus a TEOAE has a broad spectrum similarly
as transient signals do. The level of the stimulus in most clinical applications is
around 80 dB peak equivalent SPL or about 45 dB greater than perceptual threshold.
(38)
Figure 2.4 shows an example of the outcome from a clinical TEOAE measurement.
The left side of the figure shows two overlapping waveforms (green and blue) of the
averaged response. The two waveforms are from two distinct averaging buffers to
which multiple click responses are divided for extracting a single TEOAE response.
Similarity of the two averaged buffers (as in the figure) communicates reliability
of the measurement. It can be seen that the response starts around 3 ms after
the stimulus onset. The first few milliseconds are usually eliminated from the final
averaged waveform because energy from the stimulus may persist in the ear canal
long enoungh to obscure the onset of the TEOAE response (35). On the right side of
the figure 2.4 are shown the TEOAE response levels divided to half octave frequency
bands.
CHAPTER 2. OTOACOUSTIC EMISSIONS
8
In general, TEOAE levels for humans are largest around 1–1.5 kHz, and decrease at
lower and higher frequencies. TEOAE levels are typically very small or immeasurable
below 0.5 kHz and above approximately 4 kHz particularly in adult ears. Middle-ear
effects and contamination by subject-generated noises such as breathing and body
movements are a likely cause for the decline below 1 kHz. The decline at high
frequencies likely reflects many factors: middle ear, frequency dependend latency
of the response and restrictions in the measurement equipment (e.g. the frequency
response of the loudspeakers built into measurement probes). (28)
Different frequency components of a TEOAE response have a different latency between
∼3 and 20 ms and on average the latency decreases with increasing frequency (21).
As usually the first few milliseconds are eliminated from a TEOAE response in order
to remove the stimulus artifact, some information at high frequencies is lost.
TEOAEs are present basically in every normally hearing subject (19). TEOAEs are
used in newborn hearing screening, which is a procedure for deciding whether further
examinations are advised. When it comes to the two different OAE generation
mechanisms in the cochlea, low-level TEOAEs arise predominantly through the linear
coherent reflection mechanism (43).
2.2.2
Distortion product otoacoustic emissions
Distortion product otoacoustic emissions (DPOAEs) arise by stimulation with two
continuous pure tones simultaneously. DPOAEs are attractive as clinical tools
because the emission frequencies can be predicted exactly by the stimulus frequencies
(38). The two pure tones of the stimulus are designated by f1 (lower frequency) and f2
(higher frequency) and the emissions are elicited at frequencies which correspond to
two-tone intermodulation distortion components (35). This means that the emissions
are at different frequencies compared to the stimulus which allows to reproduce the
stimulus and to record the emission at the same time. On average the distortion
component 2f1 – f2 yields the highest amplitude in humans and other mammalian
ears (13). Therefore 2f1 – f2 is the component that is most extensively investigated
and well-suited for clinical purposes (15).
The two tones f1 and f2 also known as primaries, interact in the cochlea close to
the characteristic place of f2 (19). Thus, it has been suggested that DPOAEs could
be used as a frequency-specific assessment of cochlear dysfunction at the f2 place.
DPOAE responses are influenced by stimulus amplitude, relative amplitude between
the two primaries and frequency ratio between them (38). That is, the primaries
need to be reasonably close in frequency and at a reasonable level to elicit a notable
response – in addition these variables can be optimized to yield high response levels
(13). When considering the two emission generation mechanisms in the cochlea (sec
2.1) DPOAEs are a mixture of both, which have also succesfully been unmixed to
separate components (20).
The two stimulus frequencies are usually reproduced by separate transducers to avoid
CHAPTER 2. OTOACOUSTIC EMISSIONS
f1
3f1–2f2
9
f2
2f1–f2
2f2–f1
4f1–3f2
Figure 2.5: A spectrum of an DPOAE measurement from the ear of a 16-month-old
child. Stimulus tones (f1 = 1.807 kHz; f2 = 2.002 kHz; L1 = 65 dB SPL and L2 = 60
dB SPL) and resulting DPOAEs are labeled to the figure. Several DPOAEs occur at
frequencies below the primaries (4f1 – 3f2 = 1.222 kHz; 3f1 – 2f2 = 1.417 kHz and
2f1 – f2 = 1.612 kHz) and one above the primaries ( 2f2 – f1 = 2.197 kHz). Figure
adapted from (35) [frequency component labels are redrawn due to low-quality copy
of the figure].
generating distortion and especially intermodulation distortion in the measurement
system itself. Intermodulation components would be at the same frequencies with
the DPOAE components. Time-synchronous averaging is used to reduce noise in the
measurement result.
Figure 2.5 shows an FFT analysis of a DPOAE response where both, stimulus and
emission frequency components can be seen. The two stimulus components clearly
have the highest amplitude while there are four DPOAE components at lower levels.
Detecting OHC impairment requires DPOAEs to be measured with stimulus levels
as close as possible to the hearing threshold where OHC function is maximal (25).
In practice this means DPOAE measurements to be done at various stimulus levels
from low to moderate or high, the range being about 20–65 dB. DPOAE level
input/output functions plot the levels of a DPOAE component measured at a fixed
CHAPTER 2. OTOACOUSTIC EMISSIONS
10
stimulus frequency f2 as a function of stimulus level L2 . In an ear without hearing
impairment the slopes of DPOAE I/O functions are steep at low stimulus levels while
at higher stimulus levels the slopes decrease (19). This mirrors the high amplification
of the cochlear amplifier at low stimulus levels and decreasing amplification at
higher stimulus levels. This kind of a compressive behaviour is observed when using
an appropriate stimulus setting where the level difference L1 – L2 increases with
decreasing stimulus level (25). A similar stimulus setting has been suggested later
(26).
2.2.3
Stimulus-frequency otoacoustic emissions
A stimulus-frequency otoacoustic emission (SFOAE) is a constant low-level pure-tone
that occurs at the same frequency and at the same time as the applied constant
pure-tone stimulus. Because the stimulus and the emission superimpose in time and
frequency, specialized measurement techniques must be used to extract the SFOAE
from the evoking stimulus. For example using a second stimulus slightly higher or
lower in frequency can be used to suppress the SFOAE (35). Then a measurement
can be made with and without the presence of the suppressor tone. The difference
of the two cases can be attributed to the SFOAE, or depending on the specifics of
the second tone, the portion that remains unsuppressed by the second tone. Other
measurement methods have been used that produce similar results (35, 5, 40, 41).
Out of the two OAE generation mechanisms in the cochlea, like TEOAEs low-level
SFOAEs arise predominantly through linear coherent reflection (43). The properties
of SFOAEs are also similar to those of TEOAEs. SFOAEs are present in nearly all
normally hearing human subjects, are strong at same frequencies where TEOAEs are
and demonstrate latencies and compressive nonlinearity similar to those of TEOAEs
(24). Charasteristics of SFOAEs in ears with normal hearing and ears with hearing
loss have not been extensively studied and therefore SFOAEs are not used clinically
(35). A laboratory experiment in 85 ears showed that SFOAEs identified hearing
loss at 0.5 kHz better than TEOAEs and DPOAEs while performing similarly to
other evoked emissions at higher frequencies except a slight performance reduction
at 4 kHz (11).
2.2.4
Spontaneous otoacoustic emissions
Spontaneus otoacoustic emissions (SOAEs) are narrow-band, often almost tonal,
acoustic signals present in the ear canal withouth the presence of an external stimulus
(28). They are not present in all the human ears with normal hearing. Estimates
of the prevalence of SOAEs go from 40% (3) to about 70% (45, 3). They are more
commonly found in females compared to males and more common in the right ear
compared to the left ear (3).
SOAEs are presumed to be caused by the linear reflection mechanism (sec 2.1) via
CHAPTER 2. OTOACOUSTIC EMISSIONS
11
multiple reflections within the cochlea (43). SOAEs have been hypotesized to result
from feedback to the generator of linear reflection emissions: the output of this
generator is reflected from stapes (a bone at the ’input’ of the cochlea) back to
the input of the generator (22). In (28) this process is described followingly: ”At
frequencies where this feedback is positive, self-sustaining oscillation results if the
loop gain is sufficient. The resulting self-sustaining oscillation is observed in the ear
canal as an SOAE. Thus, SOAEs can be thought of as continuously self-eliciting
evoked OAEs.”.
While no stimulus is used to record SOAEs there exists a variant, synchronized
spontaneous otoacoustic emission (SSOAE). They are recorded using a transient
stimulus as an averaging point with the goal to reduce background noise. Timelocked averaging is done over a ∼80 ms window after a click stimulus. As TEOAEs
persist roughly 20 ms after the click stimulus and are thus present at the start of the
averaging window, spectral analysis is done on the latter part of the ∼80 ms window
to reveal possible spontaneous emission(s). (14)
Chapter 3
Microphones and reflectors
3.1
Inherent noise in condenser microphone systems
The lower end of the dynamic range of a microphone system is limited by it’s noise
floor. In an electrical system such as a microphone system, noise is generated as a
result of physics of the devices and materials that make up the system (30). Thermal
noise and low-frequency 1/f noise are examples of noise types present in electronic
devices. In microphone systems parameters such as the size of the microphone’s
diaphragm have an effect on the noise floor.
Condenser microphones are used in acoustical instrumentation because of their
stability, flat frequency response, wide dynamic range, reasonable sensitivity and
remarkably low self-noise (27). These factors make condenser microphones a suitable
tool also for measuring OAEs. A condenser which is more commonly in electronics
know as a capacitor, is an electronic component which is used to store energy
electrostatically in an electric field. This electric field is usually created between two
electrodes such as metal plates insulated from each other so that electrons are not
able to move directly between them. The ability to store electrical energy, a charge,
is called capacitance. A condenser microphone is a capacitor where the microphone
diaphragm is one of the electrodes and a backplate is provided to function as the
other. These electrodes are insulated from each other by air. The movements of the
diaphragm cause changes in the capacitance which converts to voltage fluctuations
(27). This allows transducing acoustic signals to electric ones.
The capacitance of a capacitor is proportional to the surface area of the electrodes.
Therefore among condenser microphones a microphone with large diaphragm has
larger capacitance compared to a microphone with a small diaphragm. Capacitance
of condenser measurement microphones varies from about 3 pF for 1/8” microphones
to 70 pF for 1” microphones (6).
A condenser microphone is used together at least with a preamplifier and the
12
CHAPTER 3. MICROPHONES AND REFLECTORS
13
5
dB
re. 20 µPa
0
a) Complete system
–5
– 10
b) Preamplifier +Measurement Amplifier
– 15
– 20
c) Measurement Amplifier
– 25
– 30
8
16
31.5
63
125
250
500
1k
2k
4k
31.5 k
8k
16 k
Frequency (Hz)
Figure 3.1: Equivalent levels of inherent noise (1/3 octave bandwidth) produced by
different elements of a microphone system equipped with a 1” free-field microphone
(sensitivity 50mV/Pa). The microphone, the preamplifier and the measurement
amplifier all contribute significantly to the noise floor of the system in certain parts
of the frequency range. Figure adopted from (6).
noise produced by the amplifier adds it’s part to the noise floor. Commonly also
a measurement amplifier is used which also increases the noise floor. Figure 3.1
illustrates the noise produced by different components in a microphone system. The
noise voltages of the preamplifier and the measurement amplifier are converted to
equivalent SPL by dividing the noise voltage by the sensitivity of the microphone.
Preamlifier produces most noise at low frequecies while the measurement amplifier
noise is strongest at high frequencies. Both are the same order of magnitude. The
noise produced by the microphone itself is dominating from about 200 Hz to 10 kHz.
(6)
The noise generated by the preamplifier is a significant noise source. It may be
regarded that the preamplifier noise is composed of two main parts. One source is a
CHAPTER 3. MICROPHONES AND REFLECTORS
14
40
dB
re. 20 µPa
35
30
1/4" (4 mV/Pa)
25
20
15
1/2" (12.5 mV/Pa)
10
5
1/2" (50 mV/Pa)
0
1/1" (50 mV/Pa)
–5
– 10
8
16
31.5
63
125
250
500
1k
2k
4k
8k
16 k
31.5 k 63 k
Frequency (Hz)
Figure 3.2: Comparison of 1/3 octave noise spectra of microphone systems equipped
with different size microphones. Figure adopted from (6).
low frequency noise originating from the input circuit of the preamplifier. The voltage
of this noise is inversely proportional to both the frequency and the capacitance of
the microphone. The other noise source is related to the amplifier and has a flat
voltage spectrum in the entire operational range. (6)
Because the low frequency noise generated in the preamplifier is a function of
microphone capacitance, the bigger the microphone diaphragm the lower the noise
that is generated in the preamplifier. It can be noted that increasing capacitance
leads to higher sensitivity, and thus, the aforementioned relation can also be thought
in terms of sensitivity. For small microphones in sizes such as 1/4” and 1/8”, only
the preamplifier noise needs to be taken into account because it dominates over the
microphone noise. In large and most sensitive microphones the microphone noise is
most significant. A comparison of noise spectra for microphone systems equipped
with different size microphones is shown in figure 3.2. (6)
CHAPTER 3. MICROPHONES AND REFLECTORS
15
The miniature microphones used in common OAE measurement probes can be
electret consenser microphones, which unlike normal air condenser microphones, do
not require a polarizing voltage supplied from the measurement amplifier. Instead,
a permanently charged material provides the electric field between the diaphragm
and the backplate. The material can be used in the backplate or in the diaphragm
(10). In a study comparing different microphone types it was concluded that the
electret microphone type is slightly quieter at low frequencies than a comparable air
condenser microphone due to absence of a polarization voltage resistor (48).
In this work it was assumed that a remarkably bigger sized air condenser microphone
outperforms a miniature electret condenser microphone in terms of lower inherent
noise. Our experimentation with the noise levels between a 1” low-noise air condenser
microphone and one conventional OAE measurement probe microphone support this
claim. A plot of the noise levels from this experimentation made in an anechoic
chamber is shown in figure 3.3. The 1” air condenser has clearly lower noise at
most frequencies. The background noise of the anechoic chamber is included in the
plots and the inherent noise of the microphones can be lower at some frequecies.
Nevertheless the measurement conditions were similar for the both microphones.
25
ER-10B+
G.R.A.S. 40HF
20
15
SPL, avg (dB)
10
5
0
-5
-10
-15
-20
10 2
10 3
10 4
Frequency (Hz)
Figure 3.3: Inherent noise comparison between a conventional OAE probe microphone (ER-10B+) and a 1” low-noise instrumentation microphone (G.R.A.S. 40HF)
measured in an anechoic chamber. SPL is a 10s average.
CHAPTER 3. MICROPHONES AND REFLECTORS
3.2
16
Ellipsoidal reflectors
Ellipsoidal reflectors can be used to collect and focus acoustical waves in a similar
manner as a reflector of a spotlight focuses light. Ellipsoidal reflectors have been
used in acoustics for example in medical applications to focus pressure pulses to
break up, e.g., kidney stones (1, 47) and underwater acoustics (32).
f1
f2
Figure 3.4: Principle of the focusing effect of an elliptical shape. A ray originating
from one focal point (f1) reflects to the other focal point (f2) independent of the part
on the elliptical shape where it reflects.
The principle of the reflecting properties of an elliptical shape can be seen in figure
3.4. A ray originating from the other focus (f1 in picture) refclects to the other focus
(f2) independent of the part on the elliptical shape it collides and reflects from. An
elliptical object can naturally be truncated. Then all the rays originating from one
focus reflect to the other focus if they collide to the objects surface as seen in figure
3.5. Basically it is possible to truncate the elliptical shape where desired and the
remaining surface has the reflection properties of an elliptical shape. In practical
applications reflectors are naturally three-dimensional and most commonly axially
symmetric. Rotating the truncated ellipse in figure 3.5 around the axis going through
the two focal points would result in an axially symmetric ellipsoidal reflector.
In figure 3.6 an ellipse and it’s two axes are presented. The distance f from the center
of the ellipse to one of the focal points can be calculated by
√
f = a2 − b 2 ,
where a is the length of the semi-major axis and b the length of the semi-minor
axis. The distance a reflected ray originating from focus travels to the other focus is
constant. That is, in an ellipsoidal reflector every reflected ray from one focal point
to the other travel the same distance between the foci. Travel distance of a ray is
equal to the length of the major axis, i.e., semi-major axis times two
n + v = 2a.
This also means that rays with equal propagation velocity arrive to the other focus
at the same time. This is a good quality for a reflector because there will be no
CHAPTER 3. MICROPHONES AND REFLECTORS
17
f1
f2
Figure 3.5: Principle of the focusing effect of a truncated elliptical shape. If a ray
originating from the other focus (f1) collides surface of this truncated elliptical shape
it reflects to the other focus (f2).
b
f1
a
f2
f
n
v
Figure 3.6: Ellipse dimensions and ray travel distance.
cancellation of frequencies due to phase differences of the arriving waves. Therefore
there is no coloration in this sense. Only the small part of the waves that go directly
from one focus to another without reflecting come there at a different time. How
much sooner the direct part of waves arrive depends on the reflector’s dimensions.
Chapter 4
Methodology and experiments
4.1
4.1.1
Design of a reflector used to measure
otoacoustic emissions
Design of the reflector
A reflector was to be designed to collect and focus sound coming from the ear for
measuring otoacoustic emissions. Sound emanating from the ear canal will naturally
spread to all directions available limited by the head and the pinna. A reflector
should gather as much of this sound as possible. Frequency response of the reflector
should be flat and the impulse response as short as is conceivable. In an ellipsoidal
reflector all reflecting waves originating from one focal point arrive to the other focal
point at the same time if they have the same propagation velocity (see sect. 3.2).
This is a good feature for a reflector since there are no phase differences among the
reflected waves to cause coloration. Only the small amount of direct sound arriving
before the reflected sound can cause coloration. An ellipsoidal reflector was decided
to be made for measuring otoacoustic emissions. The ear canal opening would be
placed to the other focus and microphone to the other. The reflector would have the
shape of a prolate ellipsoid. A prolate ellipsoid is an ellipsoid of revolution respect
to the axis running trough the focal points, i.e. an axially symmetric ellipsoid.
An ellipsoidal reflector can be truncated to fit specific needs. Naturally this results
in less reflecting surface. If a minimally truncated ellipsoidal reflector is used the
sound will at least partly be reflected back to the ear canal and then back to the
microphone due to reflections from the subject and diffraction from the edge of the
reflector, as shown with dashed line in figure 4.1. This would result in longer impulse
response and create modes to the frequency response. If the reflector is truncated
more, less reflections going back and forth will result and therefore shorter impulse
response and milder modes in the frequency response can be obtained.
To test the effect of truncating the reflector in practice a modular structure was
18
CHAPTER 4. METHODOLOGY AND EXPERIMENTS
19
ellipsoidal reflector
220 mm
ear canal
microphone
460 mm
Figure 4.1: General idea of using a truncated ellipsoidal reflector in recording of
otoacoustic emissions. The ear canal and the microphone capsule are at the focal
points of the reflector. The distance between the foci is 404 mm.
designed. The reflector was designed to break into three parts: two halves and a
detachable end. The two halves are boat-like shapes that would result from cutting
a prolate reflector in half lengthwise along the axis of symmetry. The detachable
end was a end cap to the microphone-side of the reflector to test how much the area
behind the microphone causes reflections. This kind of reflector functions also as
a resonator due to the sound reflecting back and forth as was shown in figure 4.1.
The amount of these reflections can be decreased by using the reflector without the
detachable end thus reducing the resonance.
Three reasonable reflector setups could be put together from the three parts. The
three setups are shown with microphone and test subject placement in figure 4.2.
The setup (a) was used to determine the gain achieved with the most truncated half
reflector, which would not reflect the sound back to the ear canal. The two other
setups were used to measure the gain with less truncation and to test the amount of
reflections that result from the detachable end of setup (c).
Many manufacturing techniques and materials were considered when designing a
reflector. These included vacuum forming of plastic, using a computer-controlled
cutting machine (CNC router) and 3D printing. Vacuum forming would have been
most likely done over a CNC router cut mold. At first a fairly small reflector was
decided to be made and possibly try with a different size later. Also the reflector was
decided to be small enough to fit above a test subjects shoulder while one ear would
be at the focal point of the reflector. Reflector’s inner surface length was decided to
be 460 mm and width 220 mm as seen in figure
√ 4.1. Distance between the two foci
now resulted to be approximately 404 mm (= 4602 − 2202 ).
The most reasonable manufacturing technique for this size model was to directly cut
the ellipsoid form to a material by a CNC router. 400 kg/m3 density polyurethane
foam model board was decided as the material which was believed to be hard and
dense enough for the purpose. It is also sufficiently stable material to avoid any
deformations. The surface of the milled parts were smoothed by sanding them. The
reflection coefficient of the surface was further increased by painting it with a glossy
black paint containing a hardening agent (the result is similar to a car paint surface).
The paint also gave a smooth and uniform coating to the porous material. The
CHAPTER 4. METHODOLOGY AND EXPERIMENTS
(a)
20
(b)
(c)
Figure 4.2: Different reflector assemblies with test subject and microphone positioning:
(a) half reflector, (b) two-sided reflector, (c) two-sided reflector with detachable end.
manufactured reflector parts are shown in figure 4.3.
4.1.2
Reflector gain measurements
Measurements were made to study how much gain the manufactured reflector gives at
different frequencies. A swept sine method (12) was used to measure the impulse and
frequency response of the reflector. A small mobile device loudspeaker (diaphragm 9
x 14 mm) was placed to the other focus of the reflector and a 1/2” measurement
microphone (G.R.A.S 40 HT system) was place to the other focus. A photograph of
the measurement setup is seen in figure 4.4.
Measurements were made in the big anechoic chamber of the Department of Signal
Processing and Acoustics in Aalto University. Sweep measurements were done with
Fuzzmeasure software.
Impulse responses of three different reflector assemblies were measured. The three
assemblies were as in figure 4.2: a half reflector, a two-sided reflector and the twosided reflector with a detachable end. The measurement was done with both the
sweep reproducing speaker and the microphone at focal points for all the three
reflector assemblies. In addition the effect of displacing the speaker from the focus
was measured for all the reflector assemblies. The displacement was in the range
CHAPTER 4. METHODOLOGY AND EXPERIMENTS
21
(a)
(b)
(c)
Figure 4.3: Manufactured reflector parts: (a) all parts separately, (b) two halves
together and the detachable end alongside, (c) detachable end with the microphone
mounted in place
between 1 to 3 cm and was done in two directions: along the axis going through the
focal points and perpendicularly to that axis. As a comparison a free field response
was measured at the distance of the focal points (404 mm) and at 3 cm distance.
A first order RC high-pass filter was employed in the speaker wiring to protect the
speaker due to limitations of specifying the frequency range of the measurement
sweep in the software. The cutoff frequency of the high-pass filter was approximately
at 200 Hz and the roll-off was 6 dB per octave.
CHAPTER 4. METHODOLOGY AND EXPERIMENTS
22
Figure 4.4: Picture of the setup used to measure the responses of the reflector. The
mobile device speaker element mounted in an enclosure made out of a plastic tube,
play dough and foam is on the left side of the reflector. The measurement microphone
is on the right side.
4.2
4.2.1
Otoacoustic emission measurements
Organization of the experiments
The experiments were carried out by the author at the Department of Signal Processing and Acoustics in Aalto University School of Electrical Engineering. A
measurement setup was built to an anechoic chamber to achieve a minimal background noise. The measurement was done in a such way that a test subject could
lay down his other ear pointing upwards. The developed reflector was mounted
above the ear and the ear canal opening was positioned to one of the reflector’s
focal points. A low-noise large diaphragm microphone (G.R.A.S. 40HF system) was
positioned to the reflector’s other focal point. Stimuli were reproduced using two
active Genelec 8020B active speakers. The measurement equipment is shown in
picture 4.5. The stand holding the reflector featured vibration isolating foam to
mitigate noise originating from test subject’s body and movements from entering
the microphone.
A comparison measurement was made by sealing the ear canal of the test subject
with an earplug and repeating the measurement. This was done to examine if the
two tone distortion products originate from the ear or possibly from somewhere
else in the measurement setup. Also an additional measurement was done with
CHAPTER 4. METHODOLOGY AND EXPERIMENTS
23
Figure 4.5: The setup from preliminary measurements shows the equipment used
in the final measurements with the difference of speaker model and placement. An
artificial head is placed under the reflector to demonstrate the position of the test
subject. The measurement microphone is inserted inside the reflector from the other
end. The reflector is mounted to a three-legged stand featuring vibration isolating
foam.
an alternate stimulus producing speaker positioning related to measurement of
distortion originating from the measurement system itself (see sec. 4.2.4). This
was done with the author as a test subject to observe the possible changes in the
results. The measurement procedure was identical to the principal measurement.
The given measurement was done without confirming the stimuli levels at the
destination position and thus the results are not directly comparable to the principal
measurement.
Two male test subjects aged 28–30 participated the measurement which includes the
author. Letters A and S are used as identifiers for the author and the other test
subject respectively.
4.2.2
Stimulus parameters
Distortion product otoacoustic emissions (DPOAEs) were evoked by two continuous
pure tones. f1 was the lower tone and f2 the higher. Selected frequencies were [f1 =
2250 Hz, f2 = 2745 Hz] and [f1 = 3000 Hz, f2 = 3660 Hz]. In this work a stimulus,
CHAPTER 4. METHODOLOGY AND EXPERIMENTS
1.
2.
3.
4.
24
L1 [dB] L2 [dB]
65
65
60
52.5
55
40
50
27.5
Table 4.1: Levels of the primary tones f1 and f2 .
a tone pair, is repeatedly referred to by naming only the frequency of the higher
frequency f2 . The ratio of the frequencies f1 and f2 was 1.22 for the both stimuli.
Both primary tones were reproduced at four different levels. Levels of the tones f1
and f2 are referred to as L1 and L2 respectively. Levels were calibrated by placing a
G.R.A.S 46AF measurement microphone directly under the reflectors focal point and
measuring the sound pressure level of the stimulus tone. Speaker gains were adjusted
to meet the desired levels. The levels of the primary tones are shown in table 4.1.
These levels follow a paradigm L1 = 0.4 L2 + 39 dB used in other experiments in the
field (25, 18). Stimulus tones were generated using a LabView software controlled
National Instruments hardware. Length of the stimulus was 65 seconds of which
approximately a 60 second long window starting 2.5 s after stimulus onset was
analyzed.
4.2.3
Data collection and analysis
The microphone signal was fed from the microphone’s preamplifier to a National
Instruments PXI-4461 card controlled by LabView software and recorded with a
script programmed for the purpose. Further data analysis was done in Matlab
software environment.
Recordings were averaged in the time domain and then analyzed with an FFT
algorithm. The same size window was used for both the averaging and the FFT
analysis. The window size was selected so that an integer number of periods of
the both tones in the stimulus fit exactly to the window. This allowed the use of
rectangular window which enabled studying accurate levels and frequencies in the
FFT analysis. At stimulus frequency [f1 = 2250 Hz, f2 = 2745 Hz] the length of the
window was 6860 samples which is about 156 ms at the 44.1 kHz sample rate that
was used. At the other stimulus frequency [f1 = 3000 Hz, f2 = 3660 Hz] the length
of the window was 7350 samples which is about 166 ms.
Approximately a 60 second part of the 65 second long stimulus was analyzed starting
2.5 s after stimulus onset. 386 windows were averaged in the time domain at the
stimulus frequency f2 = 2745 Hz and 360 windows at the stimulus frequency f2 =
3660 Hz.
A signal-to-noise ratio (SNR) criterion was decided for accepting only otoacoustic
emissions at sufficiently high level above the background noise as valid emissions.
CHAPTER 4. METHODOLOGY AND EXPERIMENTS
25
Noise level was defined as the mean value of the FFT bins surrounding the emission
bin at a 1/12–octave band. That is, the frequency of the emission was the center
frequency of the 1/12–octave band. Emission was accepted to be valid if the level of
the frequency bin containing the emission was 12 dB above the noise level.
4.2.4
System distortion measurement
Measurements conducted with exactly the same stimulus parameters as in the
DPOAE measurements, but without a test subject, were done to study the amount
of distortion in the measurement setup itself. Also analysis was done with the
same parameters as in the DPOAE measurements and are described in section
4.2.3. In addition, the effect of alternate loudspeaker positioning to the amount of
distortion was experimented. In the principal test setup the two stimulus producing
loudspeakers were placed side by side on the same stand. In the alternate test setup
one of the two loudspeakers was taken from the stand and placed 2 meters away from
the other speaker on a block of foam, while keeping the distance from the reflector’s
focus (’stimulus destination’) the same 2.5 m. Also the input gains of the stimuli
were kept the same.
Chapter 5
Results
5.1
5.1.1
Reflector gain measurement results
Focus to focus gain
Measured and unsmoothed magnitude responses of different reflector setups are
presented in figure 5.1. All of the responses are divided piecewise with the free field
response measured at the distance of the focal points of the reflector, i.e. the difference
has been plotted to the graph. This subtracts the response of the loudspeaker (and
the microphone) from the plot. Hence the free field response is a straight line at 0
dB. Also a free field response at 3 cm distance is added as a comparison.
Full reflector with all parts has strong modes in the response at frequencies below 2
kHz, a strong notch with a depth of about 10 dB between 3.5 – 4.5 kHz and a rippled
response at frequencies above 7 kHz. The amplification between 100 – 2500 Hz and
above 4.5 kHz is 20 dB at minimun compared to free field at the same distance. The
strong modes below 2 kHz peak at frequencies of 172 Hz, 612 Hz, 1016 Hz, 1397 Hz
and 1778 Hz.
The two-sided reflector comprised of two halves has slightly less pronounced modes
than the full reflector and less amplification at frequencies below 2 kHz. The response
shows less ripples in general. There are some notches around 2.5 kHz and 6.5 kHz.
There is a double peak at [280 Hz, 360 Hz] and other mode peaks below 2 kHz are
at 744 Hz and 1188 Hz. The single-sided (i.e. half) reflector has overall lower gain
and less ripples in the magnitude response compared to the other reflector setups.
The full reflector gives higher gain compared to the 3 cm distance free field response
at nearly all frequencies above 5 kHz. The response of the two-sided reflector has
also higher gain than the 3 cm free field response at 5 kHz, but then has a notch
between about 6–7 kHz. From 7.3 kHz and above the two-sided reflector has higher
gain compared to the 3 cm free field response. The single-sided reflector response
reaches the amplification of the 3 cm free field response from about 11 kHz on.
26
CHAPTER 5. RESULTS
27
70
Full
Two-sided
Single-sided
Free field 3 cm
Free field focal distance
60
Relative level [dB]
50
40
30
20
10
0
-10
128
256
512
1024
2048
4096
8192
16384
Frequency [Hz]
Figure 5.1: Magnitude responses of the gain measurements between the two foci.
Gain responses of the different reflector setups plotted as a difference to the free field
response at the same distance.
The energy decay curves measured for different reflector setups can be seen in figure
5.2. In the case of the full reflector the curve shows strong reflections spaced about
2.7 ms in time decaying only 6 dB between the first reflections. It takes 28.6 ms for
a reflection decayed by 40 dB. The reflections decay faster for the two-sided reflector
the first reflection being over 10 dB lower in level and decaying at slightly lower pace
from there on. In the case of the single-sided reflector the decay of the reflections
happen very fast within few milliseconds. The first reflection is already about 27 dB
lower in level and the second 50 dB lower compared to the direct sound.
5.1.2
Effect of offset from the focus
The effect of displacing the sound source perpendicularly away from the reflector’s axis
of symmetry (line going through the focal points) was measured. The measurement
microphone was always at the other focal point of the reflector. Figure 5.3 shows
gain measurements responses from the full reflector with the loudspeaker at focus
and with an sideways offset of 1, 2 and 3 cm (the diaphragm size of the loudspeaker
CHAPTER 5. RESULTS
28
0
Full
Two-sided
Single-sided
Energy [dB]
-10
-20
-30
-40
-50
0
5
10
15
Time [ms]
Figure 5.2: Energy decay curves for different reflector setups.
is 9x14 mm). The curves are smoothed with 1/12 octave smoothing. The four
different responses are identical from 128 Hz to 2 kHz. The curve of 3 cm offset
starts to deviate from the others after 2 kHz, showing lower or similar gain compared
to the others from there on. The given response has a remarkably stronger notch
than other responses at 4 kHz and the gain decreases clearly from 10 kHz on with
an eminent difference compared to the other responses. Response from the 2 cm
offset shows similar behaviour to the 3 cm offset response, but with larger gain on
most frequencies. The 1 cm offset response starts to deviate slightly from the focal
response around 4 kHz and deviates more when reaching 10 kHz and frequencies
above.
An offset of the sound source along the line going through the reflector’s focal points
was also measured. Responses of these measurements are shown in figure 5.4 with
focal response, an offset of 2.5 cm from the focus towards the microphone and an
offset of 3 cm to the other direction away from the microphone. At frequencies below
6.5 kHz the smallest distance between the sound source and the microphone results
in greatest gain although being offset from the focus. From 6.5 kHz on the in focus
response has the highest gain at most frequencies. The response measured with offset
away from the microphone features a larger gain between 7–9 kHz than the other
offset response, but has lower gain at all other frequencies compared to the other
two responses. This response also has a deep notch at about 4 kHz.
CHAPTER 5. RESULTS
29
70
Focal
1 cm off sideways
2 cm off sideways
3 cm off sideways
Free field focal distance
60
Relative level [dB]
50
40
30
20
10
0
-10
128
256
512
1024
2048
4096
8192
16384
Frequency [Hz]
Figure 5.3: The effect of offsetting the sound source away from the axis going through
the focal points of the reflector. 1/12 octave smoothing was employed in the curves.
70
2,5 cm offset towards
Focal
2 cm offset away
Free field focal distance
60
Relative level [dB]
50
40
30
20
10
0
-10
128
256
512
1024
2048
4096
8192
16384
Frequency [Hz]
Figure 5.4: The effect of offsetting the sound source along the axis going through
the focal points. 1/12 octave smoothing was employed in the curves.
CHAPTER 5. RESULTS
5.1.3
30
Discussion of the acoustical characteristics of the reflector
The results show that the full reflector expectedly yields the highest amplification.
The promiment reflections at about 2.7 ms intervals clearly limit the use of a such
reflector in time-domain measurements such as measuring TEOAEs. The travel time
of a sound wave from one focus to the other focus and back is 2.68 ms which matches
the reflection interval. Therefore the reflecting likely occurs between the sound source
and the microphone. To increase the interval between reflections the reflector’s
dimensions should be made larger. If 10 ms time window between reflections would
be desired, the reflector’s dimensions should be made four times larger. A 10 ms
time window already could be applicable for TEOAE measurements.
The two-sided reflector, which is the more truncated version, has faster decaying
temporal response, but still strong enough first reflections to make it unlikely
applicable in time-domain OAE measurements in this size. The response of the
single-sided reflector decays fast and would be in this respect perfect for time-domain
measurements, but unfortunately it has low gain at low frequecies, which limits it’s
applicability. 1–4 kHz being the common band in TEOAE measurements, at 1 kHz
the difference to the 3 cm free field response is about 12 dB and at 4 kHz 6 dB.
The full and two-sided reflector setups could be used in measurements where the
temporal response is not that critical such as DPOAE measurements, where two
continuous sinusoids are used as a stimulus and the stimulus and OAE are present
at the same time. The response of the full reflector has higher gain compared to the
3 cm distance free field response also at low frequencies due to resonance peaks, but
above frequencies about 5 kHz the gain is nearly always higher. DPOAEs seem to be
generally measured at frequencies up to little over 6 kHz, which means the frequency
of the higher stimulus tone f2 . DPOAE data exists also from measurements at f2 =
8 kHz in a research application (44). The optimum frequency ratio f2 /f1 in DPOAE
generation from humans being about 1.2 the 2f1 – f2 frequency component would
be at frequency 3.84 kHz with f2 = 6 kHz and at 5.33 kHz with f2 = 8 kHz. At
frequency 5.3 kHz the full reflector’s response has already about 3 dB higher gain
compared to the free field response at 3 cm distance. Other DPOAE components at
frequencies higher than the stimulus are evidently at the optimum frequencies to be
recorded with this kind of reflector. For example the third order component 2f2 – f1
is above 5 kHz for stimulus frequecies f2 = 4.3 kHz and above.
One could also utilize the gain provided by the strong resonance peaks at frequencies
below 2 kHz when measuring OAEs of which frequencies are accurately predicted
such as DPOAEs or SFOAEs. It has to be noted that in the given reflector design
placing the test subjects ear canal to the reflector’s focus brings the test subject’s
head close to the reflector’s opening and thus can alter the acoustic load and the
resonant frequecies. This may bring unwanted variability to the test conditions if
employing such a technique.
The offset of sound source away from the reflector’s axis of symmetry starts to affect
CHAPTER 5. RESULTS
31
the frequency response above 2 kHz frequencies as seen in the results. Offset along
the reflector’s axis of symmetry affects the shape of the response above about 2 kHz
frequecies, while below this frequency the offset only causes a shift in gain for this
part of the response related to the distance from the microphone. These factors
have to be taken into account in the margin of error in measurements when absolute
values of emission level are of interest. However these measurements do not have
the information of the transfer function from the ear to the microphone to deduce
absolute SPLs emanating from the ear canal.
5.2
Otoacoustic emission measurement results
5.2.1
Comparison of open and sealed ear in DPOAE measurements
SPL [dB]
Sealed ear
Open ear
60
60
40
40
20
20
0
0
-20
-20
-40
-40
-60
-60
-80
1
2
4
6
Frequency [kHz]
8 10
-80
1
2
4
6
Frequency [kHz]
8 10
Figure 5.5: Sound spectra measured with the reflector with two loudspeakers producing sinusoids at 3000 Hz and 3660 Hz and the subject lying under the reflector with
his ear canal approximately at the focal point. Left: response with sealed ear canal.
Right: response with open ear canal, where most prominent otoacoustic emissions
are marked with red circles.
An FFT analysis from the DPOAE measurements for subject S is shown in figure
5.5. The case of an ear sealed with an earplug is shown on the left side and the case
of an open ear is shown on the right side. The frequency components of the stimulus
and distortion components from the speakers are shown in both cases. In the case of
an open ear, some otoacoustic emissions are clearly shown. Most prominent ones
of them corresponding to frequencies 2f1 – f2 and 2f2 – f1 . These are third order
two-tone intermodulation distortion components.
CHAPTER 5. RESULTS
32
In figures 5.6 and 5.7 similar measured data as in the previous figure (5.5) is shown
from the two test test subjects at two different stimulus frequencies. Subfigures (b)
and (c) both show the data from subject A with different speaker positioning which
affected the distortion originating from the measurement system (see sec 5.2.3). This
was an additional experiment and the cases (b) and (c) are not strictly comparable
because it was not verified that in case (c) the speakers produce the same level
stimuli as in case (b). Yet, the speakers gains were kept intact and the distance of
the speaker is approximately the same.
DPOAE, f2 = 2745 Hz
(c)
SPL [dB]
(b)
SPL [dB]
(a)
SPL [dB]
Sealed ear
Open ear
50
50
0
0
-50
-50
50
50
0
0
-50
-50
50
50
0
0
-50
-50
1
2
4
6 8 10
Frequency [kHz]
1
2
4
6 8 10
Frequency [kHz]
Figure 5.6: Sound spectra measured with reflector from all the subjects at f2 = 2745
and f2 /f1 = 1.22. The stimulus level was approximately L1 = L2 = 65 dB. On the
left the measurement with a sealed ear and on the right the measurement with an
open ear. (a) test subject S, (b) test subject A, (c) test subject A with a different
positioning of the stimulus producing speakers.
The FFT analysis of the measurements at a stimulus frequency of f2 = 2745 Hz in
figure 5.6 show frequency components of the stimulus and distortion components
CHAPTER 5. RESULTS
33
from the speakers (or elsewhere in the measurement system) in the case of a sealed
ear on the left side. In addition to these frequency components DPOAEs are shown
in the case of the open ear on the right side. The stimulus level was the highest that
was used L1 = L2 = 65 dB. 2f1 – f2 frequency components are marked with a red
circle. The SNR of this component for subject S was 9.4 dB and did not satisfy the
set 12 dB SNR criterion. For subject A the SNR criterion for component 2f1 – f2
was met in both cases shown in the figure. Interestingly the frequency component
2f2 – f1 from subject S is shown also in the case of sealed ear. This indicates that
the component arose from the measurement system itself and is shown in section
5.2.3. The system distortion measurements showed also that the 2f1 – f2 distortion
can be produced by the measurement system itself when there is no test subject.
In figure 5.7 the FFT analysis from measurements at the highest stimulus level
at frequency f2 = 3660 Hz is shown. Frequency components on the right side
marked with red circles are third order intermodulation components corresponding
to frequencies 2f1 – f2 and 2f2 – f1 . They have emerged when there is no earplug
and are thus clearly DPOAEs. SNR values for these DPOAEs are 29 dB or greater
fullfilling the 12 dB criterion. For subject A the levels of the 2f2 – f1 components in
open ear case are similar to the levels of the noise around the given component in
the sealed ear case, and are thus neglected. Also other third order components and
one fifth order intermodulation component marked with blue circles were present
when the ear canal was open. The SNRs of these components were 17 dB or higher.
CHAPTER 5. RESULTS
34
DPOAE, f2 = 3660 Hz
(c)
SPL [dB]
(b)
SPL [dB]
(a)
SPL [dB]
Sealed ear
Open ear
50
50
0
0
-50
-50
50
50
0
0
-50
-50
50
50
0
0
-50
-50
1
2
4
6 8 10
Frequency [kHz]
1
2
4
6 8 10
Frequency [kHz]
Figure 5.7: Sound spectra measured with reflector from all the subjects at f2 = 3660
and f2 /f1 = 1.22. The stimulus level was approximately L1 = L2 = 65 dB. On the left
the measurement with sealed ear and on the right the measurement with open ear.
(a) test subject S, (b) test subject A, (c) test subject A with a different positioning
of the stimulus producing speakers.
CHAPTER 5. RESULTS
5.2.2
35
DPOAE growth
DPOAE measurements were done at two different stimulus frequencies and both
frequencies were measured at four different stimulus levels which can be found in
table 5.1. Figure 5.8 shows the levels of 2f1 – f2 DPOAE component resulting
from the four different stimulus levels used at two different stimulus frequencies.
These input/output functions present the level of the given DPOAE component as a
function of growing stimulus level L2 . DPOAE levels are depicted with an orange
line while the height of the grey area presents the noise level. Red circles denote
that the DPOAE level satisfies the 12 dB SNR criterion. A dark line with a slope of
0.39 has been added as a reference for linear growth in every plot. The slope of the
given line is determined by calculating the slopes between the first two points of the
DPOAE growth functions obtained with the f2 = 3660 Hz stimulus and taking the
mean of these three slopes.
1.
2.
3.
4.
L1 [dB] L2 [dB]
65
65
60
52.5
55
40
50
27.5
Table 5.1: Levels of the primary tones f1 and f2 .
In the DPOAE measurements at frequency f2 = 2745 Hz only 1/3 of the measurement
points fullfill the 12 dB SNR criterion and are considered valid. The level functions
obtained are nonmonotonic for subject S and A in the normal test setup (figures a
and b), but both only contain one valid data point.The I/O function at frequency f2
= 2745 Hz increases monotonically for subject S with the alternate measurement
setup.
Out of the DPOAE measurements done at a frequency f2 = 3660 Hz there is one
nonvalid measurement at the lowest stimuli level for subject A with the alternate
measurement setup. All the other points fullfilled the 12 dB SNR criterion. At the
given stimulus frequency all the obtained I/O functions increase monotonically. For
subject S the slope of the I/O function decreases slightly after the second highest
stimulus level L2 = 52.5 dB, thus exhibiting compressive behaviour. For subject A
the slope decreases after the third highest stimulus level, but increases again on from
the second highest stimulus level. For subject A with the alternative test setup the
I/O function is roughly linear.
CHAPTER 5. RESULTS
36
(a)
SPL [dB]
f 2 = 2745 Hz
0
-10
-10
-20
-20
-30
-30
-40
-40
30
SPL [dB]
0
(b)
SPL [dB]
40
50
60
L
0
-10
-10
-20
-20
-30
-30
-40
-40
30
0
(c)
f 2 = 3660 Hz
0
40
50
60
0
-10
-10
-20
-20
-30
-30
-40
-40
30 40 50 60
Stimulus level L2 [dB]
dp
ref
30
40
50
60
30
40
50
60
30 40 50 60
Stimulus level L2 [dB]
Figure 5.8: Measured DPOAE levels from all the test subjects. Level of 2f1 – f2
distortion component, f2 /f1 = 1.22. Orange line: DPOAE levels, dark line: linear
growth reference, grey area height: noise levels. Red circles denote that the emission
level satisfies the 12 dB SNR criterion (sect. 4.2.3). (a) test subject S, (b) test
subject A, (c) test subject A with a different positioning of the stimulus producing
speakers.
5.2.3
System distortion
System distortion measurements were done with the exactly same measurement
setup and stimulus levels as in the DPOAE measurements but without a test subject.
Figure 5.9 shows an FFT analysis of the measurement at stimulus frequency f2 =
2745 Hz at the highest level L1 = L2 = 65 dB. The plot on the left shows the result
from the measurement with the loudspekers next to each other and the plot on the
right side shows the result after moving one of the speakers 2 meters away from the
other. Frequency components that arise from intermodulation distortion are marked
to the plot. Two second order intermodulation components at frequencies f2 – f1 and
CHAPTER 5. RESULTS
37
Intermodulation distortion, f2 = 2745 Hz
SPL [dB]
Speakers next to each other
Speakers 2 m apart
60
60
40
40
20
20
0
0
-20
-20
-40
-40
-60
-60
-80
1
2
4 6 8 10
Frequency [kHz]
-80
1
2
4 6 8 10
Frequency [kHz]
Figure 5.9: Comparison of speaker intermodulation distortion with two different
speaker positionings without a test subject. Left figure: speakers positioned next
to each other on the same stand. Right figure: speakers positioned on a 2 m
distance from each other. Red circles denote second order intermodulation distortion
componets, blue circles third order components, a yellow circle a one fourth order
component and pink circles fifth order components.
f2 + f1 are marked with red circles; third order components at 2f1 – f2 , 2f2 – f1 , 2f1
+ f2 and f1 + 2f2 with blue circles; one fourth order component at 3f2 – f1 with a
yellow circle and fifth order components at 3f2 – 2f1 and 4f2 – f1 by pink circles.
The results show that the level of most of the given intermodulation components
decreased visibly when the speaker was moved further away, some of them disappearing completely to the noise floor. From the third order components 2f1 – f2 and
2f2 – f1 that are observed in our DPOAE measurements, the level of the component
2f1 – f2 reduced by over 17 dB being visibly indistinguishable from the noise floor
(SNR 3 dB) after placing the speakers further away from each other. The level of
the component 2f2 – f1 decreased over 15 dB but still has a SNR of 15 dB.
Figure 5.10 shows results of the system distortion measurement performed at stimulus
frequency f2 = 3660 Hz at the highest level L2 = 65 dB. The left side plot is the
result when stimulus producing loudspeakers are placed side by side and the right
side plot is the result when one of the speakers is moved away from the other to a 2
m distance. The third order distortion components 2f1 – f2 and 2f2 – f1 observed in
the DPOAE measerements do not emerge visibly from the noise floor in either case
with maximum SNRs of 2 dB.
Two second order intermodulation distortion components at frequencies f2 – f1 and
f2 + f1 are marked with red circles to both plots. The other frequency components
CHAPTER 5. RESULTS
38
Intermodulation distortion, f2 = 3660 Hz
SPL [dB]
Speakers next to each other
Speakers 2 m apart
60
60
40
40
20
20
0
0
-20
-20
-40
-40
-60
-60
-80
1
2
4 6 8 10
Frequency [kHz]
-80
1
2
4 6 8 10
Frequency [kHz]
Figure 5.10: Comparison of speaker intermodulation distortion with two different speaker positionings without a test subject. Red circles denote second order
intermodulation distortion components.
that have high levels above the noise floor are harmonic distortion components and
of course, the frequency components of the stimulus itself. The level of the second
order component f2 – f1 decreased almost 24 dB and disappeared to the noise floor
when one loudspeaker was moved further away from the other. The component f2
+ f1 decreased by 32 dB still having a SNR of over 8 dB after relocating the other
loudspeaker. It has to be noted that in the case of loudspeakers positioned further
apart, there is a remarkable 10 dB decrease in the level of the lower tone of the
stimulus entering the microphone inside the reflector. This likely has influence on
the levels of the intermodulation distortion components. It is also likely that there
is a similar difference between stimulus levels at the point where the ear canal of
the test subject is positioned, which should be kept in mind when interpreting the
results from the alternate test setup.
5.2.4
Discussion of the DPOAE measurement results
Taking into account the system distortion information, frequency components of
biological origin could be measured in these measurements, that is DPOAEs. While
the system distortion results at frequency f2 = 2745 Hz show that the 2f1 – f2
distortion component is present even without a test subject, it is present with subject
A also in the alternative speaker positioning where the given distortion component
does not arise from the measurement system itself. Therefore DPOAEs could be
measured at least with the alternative test setup at this stimulus frequency.
CHAPTER 5. RESULTS
39
In the system distortion measurements at stimulus frequency f2 = 3660 Hz no notable
third order intermodulation components emerged. The third order component 2f1 –
f2 was measured in the DPOAE measurements from the both subjects and is clearly
a DPOAE. Also the component 2f2 – f1 was measured from subject S.
Although done with a small sample of subjects and some inherent distortion in the
system, these measurements serve as a proof of concept that recording of OAEs is
possible from a considerable distance by using an ellipsoidal reflector.
Chapter 6
Conclusions
A modular ellipsoidal reflector was developed and it’s applicability to measurements
of otoacoustic emissions (OAEs) was examined. The response measurement data
from the reflector show that clearly the most gain is achieved with the least truncated
reflector setups that have the most closed structure and feature the most reflecting
surface. These setups are suitable for measurements where the temporal response is
not overridingly critical, because there are reflections in the response that take over
10 milliseconds to decay. The least truncated reflector setups could be usable for
the measurements of distortion product otoacoustic emissions (DPOAEs), stimulusfrequency otoacoustic emissions (SFOAEs) and spontaneous otoacoustic emissions
(SOAEs). The frequency responses of these reflector setups have high resonance
peaks which may be inconvenient. The gain with the most closed reflector setup
compared to a 3 cm distance free-field response is nearly always higher above about
5 kHz, which makes measurements at this frequency region desireable. Yet, this
frequency region is at the very upper end of the frequency range where common
OAE measurements are made. With the more truncated setups a similar frequency
limit is at even higher frequencies.
To test the developed setup to its purpose, OAE measurements were performed
to two test subjects of which the author was the other one. DPOAEs could be
measured at two stimulus frequencies. These measurements serve as a proof of
concept that recording of OAEs from a considerable distance is possible with this
kind of a reflector.
The comparison of achievable SNR between the developed setup and conventional
OAE measurement equipment was not done due to limited time available. Further
studies are needed to answer this question.
40
Bibliography
[1] M. R. Bailey. Acoustic and cavitation fields of a pressure release ellipsoidal
reflector. The Journal of the Acoustical Society of America, 101(5):3138–3138,
1997.
[2] C. I. Berlin and L. J. Hood. Current physiologic bases of audiologic interpretation
and management. In J. Katz, L. Medwetsky, R. Burkard, and L. Hood, editors,
Handbook of clinical audiology, chapter 22. Lippincott Williams & Wilkins, 2009.
[3] R. C. Bilger, M. L. Matthies, D. R. Hammel, and M. E. Demorest. Genetic
implications of gender differences in the prevalence of spontaneous otoacoustic
emissions. Journal of Speech, Language, and Hearing Research, 33(3):418–432,
1990.
[4] A. Boul and B. Lineton. Spontaneous otoacoustic emissions measured using an
open ear-canal recording technique. Hearing Research, 269(1–2):112 – 121, 2010.
[5] D. Brass and D. T. Kemp. Time-domain observation of otoacoustic emissions
during constant tone stimulation. The Journal of the Acoustical Society of
America, 90(5):2415–2427, 1991.
[6] Brüel & Kjær. Technical documantation - microphone handbook vol 1: Theory,
1996.
[7] G. v. Békésy. Experiments in hearing. McGraw-Hill, New York, 1960.
[8] P. Dallos. The active cochlea. The Journal of neuroscience : the official journal
of the Society for Neuroscience, 12(12):4575–85, 1992.
[9] H. Davis. An active process in cochlear mechanics. Hearing Research, 9(1):79–90,
1983.
[10] J. Eargle. The microphone book. Focal Press, Woburn (MA), 2001.
[11] J. C. Ellison and D. H. Keefe. Audiometric predictions using stimulus-frequency
otoacoustic emissions and middle ear measurements. Ear & Hearing, 26(5):487–
503, 2005.
[12] A. Farina. Simultaneous measurement of impulse response and distortion with a
swept-sine technique. In Audio Engineering Society Convention 108, Feb 2000.
41
BIBLIOGRAPHY
42
[13] S. A. Gaskill and A. M. Brown. The behavior of the acoustic distortion product,
2f1–f2, from the human ear and its relation to auditory sensitivity. The Journal
of the Acoustical Society of America, 88(2):821–839, 1990.
[14] T. J. Glattke and M. S. Robinette. Otoacoustic emissions. In R. J. Roeser,
M. Valente, and H. Hosford-Dunn, editors, Audiology: diagnosis, chapter 22.
Thieme New York, 2008.
[15] M. P. Gorga, K. Nelson, T. Davis, P. A. Dorn, and S. T. Neely. Distortion
product otoacoustic emission test performance when both 2f1–f2 and 2f2–f1
are used to predict auditory status. The Journal of the Acoustical Society of
America, 107(4):2128–2135, 2000.
[16] R. Hallworth and H. Jensen-Smith. The morphological specializations and
electromotility of the mammalian outer hair cell. In G. A. Manley, R. R. Fay,
and A. N. Popper, editors, Active processes and otoacoustic emissions, chapter 5.
New York : Springer, 2010.
[17] A. Hudspeth. Mechanical amplification of stimuli by hair cells. Current Opinion
in Neurobiology, 7(4):480–486, 1997.
[18] T. Janssen, P. Kummer, and W. Arnold. Growth behavior of the 2 f1-f2 distortion
product otoacoustic emission in tinnitus. The Journal of the Acoustical Society
of America, 103(6):3418–3430, 1998.
[19] T. Janssen and J. Müller. Otoacoustic emissions as a diagnostic tool in a clinical
context. In G. A. Manley, R. R. Fay, and A. N. Popper, editors, Active processes
and otoacoustic emissions, chapter 13. New York : Springer, 2010.
[20] R. Kalluri and C. A. Shera. Distortion-product source unmixing: A test of
the two-mechanism model for dpoae generation. The Journal of the Acoustical
Society of America, 109(2):622–637, 2001.
[21] D. T. Kemp. Stimulated acoustic emissions from within the human auditory
system. The Journal of the Acoustical Society of America, 64(5):1386–1391,
1978.
[22] D. T. Kemp. Physiologically active cochlear micromechanics – one source of
tinnitus. In G. L. David Evered, editor, Tinnitus – Ciba foundation symposium
85, pages 54–81. London: Pitman Books Ltd., 1981.
[23] D. T. Kemp. Otoacoustic emissions: Concepts and origins. In G. A. Manley,
R. R. Fay, and A. N. Popper, editors, Active processes and otoacoustic emissions,
chapter 1. New York : Springer, 2010.
[24] D. T. Kemp and R. Chum. Properties of the generator of stimulated acoustic
emissions. Hearing research, 2(3-4):213–232, 06 1980.
[25] P. Kummer, T. Janssen, and W. Arnold. The level and growth behavior of the
2 f1–f2 distortion product otoacoustic emission and its relationship to auditory
BIBLIOGRAPHY
43
sensitivity in normal hearing and cochlear hearing loss. The Journal of the
Acoustical Society of America, 103(6):3431–3444, 1998.
[26] P. Kummer, T. Janssen, P. Hulin, and W. Arnold. Optimal l1–l2 primary tone
level separation remains independent of test frequency in humans. Hearing
Research, 146(1–2):47 – 56, 2000.
[27] T. Lahti. Akustinen mittaustekniikka. Raportti / Teknillinen korkeakoulu,
sähkötekniikan osasto, akustiikan ja äänenkäsittelytekniikan laboratorio, ISSN
0356-083X ; 38. Teknillinen korkeakoulu, Helsinki, 2. korj. p. edition, 1997.
[28] B. L. Lonsbury-Martin and G. K. Martin. Otoacoustic emissions: Basic studies
in mammalian models. In G. A. Manley, R. R. Fay, and A. N. Popper, editors,
Active processes and otoacoustic emissions, chapter 8. New York : Springer,
2010.
[29] G. A. Manley. Evidence for an active process and a cochlear amplifier in
nonmammals. Journal of Neurophysiology, 86(2):541–549, 2001.
[30] C. D. Motchenbacher and F. C. Fitchen. Low-noise electronic design. Wiley,
New York, 1973.
[31] A. R. Møller. Hearing : anatomy, physiology, and disorders of the auditory
system. Plural Publishing, San Diego (CA), 3rd ed. edition, 2013.
[32] M. Müller. Focusing of shock waves in water by different ellipsoidal reflectors.
AIP Conference Proceedings, 208(1):143–148, 1990.
[33] A. L. Nuttall and D. F. Dolan. Steady-state sinusoidal velocity responses of
the basilar membrane in guinea pig. The Journal of the Acoustical Society of
America, 99(3):1556–1565, 1996.
[34] J. O. Pickles. An introduction to the physiology of hearing. Emerald, Bingley,
4th ed. edition, 2012.
[35] B. Prieve and T. Fitzgerald. Otoacoustic emissions. In J. Katz, L. Medwetsky,
R. Burkard, and L. Hood, editors, Handbook of clinical audiology, chapter 21.
Lippincott Williams & Wilkins, 2009.
[36] W. S. Rhode. Observations of the vibration of the basilar membrane in squirrel
monkeys using the mössbauer technique. The Journal of the Acoustical Society
of America, 49(4B):1218–1231, 1971.
[37] W. S. Rhode and A. Recio. Study of mechanical motions in the basal region
of the chinchilla cochlea. The Journal of the Acoustical Society of America,
107(6):3317–3332, 2000.
[38] M. S. Robinette and T. J. Glattke. Otoacoustic emissions. In R. J. Roeser,
M. Valente, and H. Hosford-Dunn, editors, Audiology: diagnosis, chapter 21.
Thieme New York, 2000.
BIBLIOGRAPHY
44
[39] M. A. Ruggero, N. C. Rich, A. Recio, S. S. Narayan, and L. Robles. Basilarmembrane responses to tones at the base of the chinchilla cochlea. The Journal
of the Acoustical Society of America, 101(4):2151–2163, 1997.
[40] K. S. Schairer, D. Fitzpatrick, and D. H. Keefe. Input–output functions for
stimulus-frequency otoacoustic emissions in normal-hearing adult ears. The
Journal of the Acoustical Society of America, 114(2):944–966, 2003.
[41] K. S. Schairer and D. H. Keefe. Simultaneous recording of stimulus-frequency
and distortion-product otoacoustic emission input–output functions in human
ears. The Journal of the Acoustical Society of America, 117(2):818–832, 2005.
[42] J. Schnupp, I. Nelken, and A. King. Auditory neuroscience : making sense of
sound. MIT Press, Cambridge, Mass, 2011.
[43] C. A. Shera and J. J. Guinan. Evoked otoacoustic emissions arise by two
fundamentally different mechanisms: A taxonomy for mammalian oaes. The
Journal of the Acoustical Society of America, 105(2):782–798, 1999.
[44] J. Syka. Acoustical signal processing in the Central Auditory System. Plenum,
New York, 1997.
[45] C. Talmadge, G. Long, W. Murphy, and A. Tubis. New off-line method for detecting spontaneous otoacoustic emissions in human subjects. Hearing Research,
71(1–2):170 – 182, 1993.
[46] WebMD LLC. WebMD Image collection: Human anatomy, picture of the ear,
2014. http://www.webmd.com/brain/picture-of-the-ear (last checked 12.1.2015).
[47] W. M. Wright and D. T. Blackstock. Focusing of n waves in air by an ellipsoidal
reflector. The Journal of the Acoustical Society of America, 102(2):741–746,
1997.
[48] A. J. Zuckerwar, T. R. Kuhn, and R. M. Serbyn. Background noise in piezoresistive, electret condenser, and ceramic microphones. The Journal of the Acoustical
Society of America, 113(6):3179–3187, 2003.
[49] G. Zweig, R. Lipes, and J. R. Pierce. The cochlear compromise. The Journal of
the Acoustical Society of America, 59(4):975–982, 1976.