Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
4.9, 4.11 Received22 September1971;revised8 November1971 The Central Origin of the Pitch of Complex Tones: Evidence ' from Musical Interval Recognition * A. J. M. HOUTSMA ANDJ. L. GOLI)STEINi Centerfor Communications Sciences, Research Laboratoryof Electronics, Massachusetts Institute o.fTechnology, Cambridge,Massachusetts 02139 The human auditory system'sability to recognizesimplemelodiesthat correspond to fundamentalperiods in sequences ofperiodicsounds devoidoffundamental energywasstudiedthroughmusical intervalidentification experiments.Stimuli comprisingtwo randomlychosensuccessive upper harmonicswere presented both monotically(two harmonicsto oneear) and dichotically(oneharmonicto eachear). Subjectscould recognize melodies equallywellwith bothmodesof stimuluspresentation. The resultsimply that thepitchof thesecomplextonesis mediatedby a centralprocessor operatingon neuralsignalsderivedfrom those effectivestimulusharmonicsthat are tonotopicallyresolved. introducedto auditionby Ohm in 1843 (Plomp and Smoorenburg,1970). This earlier work is discussed An essentialbasis for practicing the art of music is morefully in Secs.V and VI. We reportin this papera the human auditory system'sability to perceivemelody number of new psychophysical experimentsthat were in a sequenceof periodic or near-periodicsounds.In designedwithin a musicalframeworkto explorehow Western culture, a musical note scale has been develthe auditorysystemretrievesand encodes the informaoped which characterizesmusical sounds solely by tion of the fundamentalfrequencyfrom a sequence of their fundamentalfrequencies,and a melodycomprises periodicsounds. a sequenceof noteson sucha scale.Musical instruments are designedin sucha way that when a note is played I. METHODS a periodic or quasiperiodicsoundis generatedwhose fundamental frequencyis designatedby the note. On A. General Paradigm listening to a sequenceof suchmusicalsoundsone can In order to simulate musical behavior as closely as easily retrieve the original series of notes from the the basicprocedurewasto presenta subject auditory sensation,regardlessof the particular Fourier possible, consisting of a numberof spectrum generatedby the instrument. Some instru- with a simplemusicalmessage andto havehim respondby identifying ments (e.g., strings) produce sounds with a rich musicalsounds, or melodies spectrum comprising the fundamental and many it. In caseswherethe setof musicalmessages partials; others (e.g., flute) have little more than the was known to the subjectbeforehand,a responsebox bare fundamental; and still others (French horn) was used; otherwisethe subjectwas askedto write melodyusinga relativenote scale. generatelower noteswith relatively little energy at the downthe perceived The musicalsoundswere complextonescomprising lower partials, including the fundamental (Saunders, upperharmonicsof equalintensity;no 1937, 1946). Alsomelodiesplayedon any giveninstru- two successive ment can easily be recognizedafter the soundhas been energyat the fundamentalfrequencyitselfwaspresent. passed through a bandpassfilter with a bandwidth Despitethefact that morecomplex stimulimighthave much narrowerthan the spectraof the originalsounds; provided better simulation of musical-instrument one can readily observethis when listening to music sounds,theserelativelysimplesyntheticstimuliwere they minimizethe numberof free from an inexpensivetransistorradio.This phenomenon preferredbecause of melody invariance over a large class of spectral stimulusparameters,and offerbetter controlover the transformationshas had profoundimpact on auditory locationof spectralenergyin the effectivestimulus. because a largebodyof scienceever sincethe conceptof Fourier spectrumwas In addition,theywereselected INTRODUCTION 520 Volume 51 Number 2 (Part 2) 1972 Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 128.97.27.21 On: Tue, 17 Feb 2015 22:42:40 PITCH OSCILLATOR OF COMPLEX TONES DUAL DUAL CHANNEL CHANNEL ELECTRONIC AUDIO I PDP-4 COMPUTER OSCILLATOR 2 SWITCH AMPLIFIER 'I• ATTENUATOR I • ATTENUATOR[ ANSWER BOX FIG. 1. Equipmentdiagramfor all two-toneexperiments. participatedin somemore descriptiveand qualitative experiments. All had at least somedegreeof musical training and were familiar with musical notation and dictation. A few could not perform the required If the same harmonic numbers were chosen for each identification or recognition tests initially and were note, one couldeasilyrecoverthe melody describedby given sometraining (lessthan 1 h) starting with tone harmonics,which the (missing)fundamentalsmerely by tracking one of complexesof up to six successive the two partials, since the frequenciesof harmonic were then graduallyreducedto the two-tonestimuli partialsare alwaysintegralmultiplesof the fundamental usedin the test experiments. frequency. In this situation, both partials would IDENTIFICATION OF describethe same melody as the fundamental, merely II. BASIC EXPERIMENTS: EIGHT KNOWN MUSICAL INTERVALS transposedupward in key. In order to prevent the subject from using this trivial cue, the harmonic A. General Procedures numberswere randomizedfrom note to note, and hence the subject was required to retrieve the notes from Five basicexperiments werecarriedout havingmany possiblyrelevantpsychophysical and physiologicaldata alreadyexistedfor thesestimuli (Goldstein,1967, 1970; Goldstein and Kiang, 1968; Kiang, 1965; Sachs and Kiang, 1968; Sachs,1969). hearing a sequenceof pairs of randomly chosensucces- features in common. The task of the subject was to siveupper harmonics. identify on eachtrial whichout of eightknowntwo-note melodies(or musicalintervals)was presented[-Fig. 2(a)-]. These simple melodieshad identical envelope B. Apparatus time structure[-Fig. 2 (b)-], and all beganwith the same The apparatusconsistedof two programmableoscilnote.The noteswereapproximatelytunedto the natural lators (General Radio 1161-A and KrohnHite 4031 R), scale,that is, frequencyratiosof 16/15, 9/8, 6/5, and an electronic switch (Grason-Stadler), a two-channel 5/4 for the minor and major second,and minor and audio amplifier, and a set of TDH-39 headphones. major third, respectively.The stimuli representing the notes comprisedtwo successive harmonics,the the headphoneswas better than 50 dB below stimulus number of the lower harmonicbeing chosenrandomly level. (In one experiment, where four successive for eachnote over a rangeof three successive integers harmonicswere needed,we useda third programmable [-Fig.2(c)-]. The middleof the lowerharmonicnumber oscillator, KrohnHite 4031 R, and a dual multiplier, range,•, and the fundamentalfrequencyof the first CBL 47.) Subjectswere seatedin an IAC model 1200 note, f0, werechosenasindependent parametersin measound-insulated chamber. Switches and oscillators were suringidentificationperformanceexpressed as the percontrolledby a DEC PDP-4 computerwhich generated centageof correctresponses. We chosethis particular all random events, performedstimuluscomputations, set of musical intervals, not becausewe assumethey stored responses,and controlled feedback. An equiphaveany inherentsignificance, but merelybecause they ment diagram is shownin Fig. 1. provide, at least in the Western culture, a convenient language which musically trained people can C. Subjects Harmonic and intermodulation distortion measured at understand. Three subjectsparticipatedin the basicexperiments. All weremale, between18 and 31 yearsof age, and had quite extensivemusicaltraining and experience(majors in organ,viola, and singing).All subjectscouldperform the required tests with little special training; test sessionswere limited to a maximum of 2 h daily. An additional sevensubjects,four male and three female, The subjectswerefirst testedin preliminaryexperimentsfor their ability to identify musicalintervalsby using periodic sounds with fundamental energy; a control run of 50 trials was carried out, the stimuli beingsquarewaveswith fundamentalfrequencies equal to the notesin Fig. 2(a). Subjectsweregivena "key" ascribinga numberto eachintervalandwereinstructed The Journalof the AcousticalSocietyof America 521 Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 128.97.27.21 On: Tue, 17 Feb 2015 22:42:40 HOUTSMA AND i 1•4• ,4 [x]J '" "' GOLDSTEIN eachtrial,whileNH t•referred nofeedback. A typical I..,1"'-' I..I •'.' 01...11•01,..I i'-' ]'.' 11.4 ]",o II •" o]'.' •1 I I,.l II cl •"',,•] •f(•/fo 5/4 6/5 9/8 16/15 15/168/9 5/6 4/5 run consistedof 50 trials for subjectAH and 25 trials for the other subjects.Fundamentalfrequencieswere incremented in stepsof 100Hz (or sometimes 200 Hz), and for eachfundamentalfrequencya number of runs wastaken,eachtime incrementingthe averageharmonic number ff by one. As many runs were taken as were 500 250 500 TIME (ms) (b) necessaryto make performancedrop from perfect (100%correct)to chance(12.5ø/o correct,corresponding to one out of the eight intervals guessedcorrectly); typicallyaboutsixrunsat eachfundamentalfrequency were required. The resulting psychometricfunctions werethenremappedinto"equal performance contours," relating performance level to both fundamental frequency,f0, and averageharmonicnumber,if. B. Monotic Experiments n-! n-I n n Stimuli were presentedmonotically at a sensation level of 20 dB. Equal performancecontoursare shown n n+l n n+l n+l n+2 n+l f/fo n+2 f/fo concentrated (c) [R•GHT "ILEFT EAR ß -J 50 tn 35 - I, AR [ I • , n 20 • I I ! n-2 n-I n in Fig. 3(a). On onesubject(AH) a controlexperiment wasperformedwith the partials of eachnote presented in time sequence,rather than simultaneously.We chosevaluesfor f0 and ff which had previouslyyielded perfectperformance.Despite the fact that f0 can easily be computedfrom this stimulus,the subjectwas not able to perform better than chance,even after many n+l f/fo (d) Fro. 2. Experimental paradigm for musicalinterval identifica- tion experiments:(a) musical intervals to be identified, (b) time-envelope of the total stimulus,(c) the threepossibletwo-tone stimuli for each of the two notes; for each note a random choice wasmadeamongthe threepossiblestimuli, (d) stimulusconfigura- tion for one note of a dichoticallypresentedmelody,with the efforts. We concluded that randomization of harmonic number forces the subject to use both partials simultaneously. The experimentalresults show clearly that the best performance is achieved with the lowest harmonic numbers, that is, the larger the spacing between harmonics on a log-frequency scale, the better the performance. This finding, together with what is known about the physiologyof cochlearfrequency analysis (Kiang, 1965; Goldstein, Baer, and Kiang, 1971)and the psychophysics of resolutionof partialsin complextones (Plomp, 1964) suggestedthe hypothesis that both harmonicsof the soundsemployed in this experimentare processedthrough separatechannelsof the cochlearoutput to obtain successfulidentification. C. Dichotic Experiments and Relation to Monotic Results addition of simulated combination tones. The separate-channelhypothesiswas tested employing probably the most extreme channelseparationone to push a correspondingbutton on an answer box can think of, namely separateears. We thought that, within 4 sec after each stimuluspresentation,after while negative resultswould not necessarilyinvalidate whichfeedbackwasprovided.All threesubjectsscored the hypothesis,positive results would prove it to be perfectly, which indicatesthat they did possess the correct. The experimentalparadigm was the same as requiredmusicalskills,andthat performance limitations in the monatic experiment,except that the stimulus so often encounteredin identificationexperimentsof this kind and which are commonlyattributed to imperfectmemory (Miller, 1956) were apparently of little significancehere. Next, in the basicexperiments, subjectsweretested individually.AH and SW were given feedbackafter 522 Volume51 Number2 (Part 2) partials were presenteddichotically,one to each ear. The resultsare shownin Fig. 3(b). Comparisonwith the monatic resultsshowsthat eachsubject'sperformance is essentially the same under both stimulus conditions, suggestingthat a central mechanism integratesandprocesses informationfrombothcochleas, 1972 Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 128.97.27.21 On: Tue, 17 Feb 2015 22:42:40 PITCH OF COMPLEX SUBJECT SW 14- TONES NH AH _ 12 - MONOTI C IO-Pc;o= _ 20dBSL - 2-IOO• ,•- ' ' ' ' ' 0 ,,;,o •;o,•oo ,;oo •ooo .o-"••o,,,, •o•.s• 4- 20dBSL •, •o•.• 80 2- I00• o, , , , ' " o ,;oo ,;oo ;oo f L• I I I I IF DICHOTIC I2• 50dBSPL IOI- Pc= • (d) •/ •o•-,,•_• 0L • I I I I 14FP c= DICHOTIC '•1- •o,,,,. 50 •a 0t • i i i i 200 400 600 800 I000 fo(Hz) b I I i i • 200 400 600 800 I000 fo(Hz) I J I I i i I 1 I 200 400 600 800 I000 fo(Hz) Fro. 3.Performance contours for musical interval identification experiments with pairs ofrandomly chosen successive upper harmonics. Stimulus conditions: (a)monotic, 20 dBSL, (b) dichotic, 20 dBSL, (c)monoti'c, 50(40) dBSPL, (d)dichotic, 50(40) dBSPL, (e) dichotic, 50 (40)dBSPL, withtwo simulated combination tones. andthattheinputsto thismechanism aresimilarunder andfourthexperiment undertaken at higherstimulus both monoticand dichoticstimulation. intensity levels, 50dBSPLforsubjects AH andSW, Theeffectof stimulus intensity uponbothmonotic and40 dB SPLforsubject NH. Thesesound-pressure anddichotic performance wasinvestigated in a third levels(SPLs)werechosento avoidthe auraldifference TheJournal of theAcoustical Society of America 523 Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 128.97.27.21 On: Tue, 17 Feb 2015 22:42:40 HOUTSMA AND GOLDSTEIN tone, f•.--f•, in the monoticcase(Plomp,1965; Goldstein, 1967), and interaural crosstalkin the dichotic case (Zwislocki, 1953). Equal performancecontours are shownin Figs.3(c) and 3(d). human listenexto retrieve the notes representedby a sequenceof periodicsounds.While these experiments provide informationabout the conditionsunder which subjectscan perform a certain interval identification task, they do not show that the subject "hears" D. Influence of Aural Combination Tones melody.The subjectcouldhave learnedto discriminate and identify eight stimuli merely on the basisof some Comparing results from all four experiments,it unspecified, subjective criterion, rather than by appearsthat each subject'sperformanceis essentially recognizing each stimulus as a particular musical the sameunder all four stimulusconditions,exceptthat interval, although each subjectassertedthat the latter for higher-intensitymonotic stimulation[-Fig. 3(c)• was true. performancecontoursare shiftedupwardsby approxTo obtainmoreobjectiveevidencethat the measured imately 2 or 3 harmonic numbers.Such an upward performance truly reflects the musical behavior of shift might be expected becauseof the presenceof aural combinationtones of the type fl--k (f•.-- fl) melody perception,a control experimentwas carried generatedin the peripheral ear for a monotic stimulus out in which subjectswere requiredto name previously unknown four-note melodies using standard musical comprisingthe frequenciesfl and f•. (Zwicker, 1955; notation. On each trial four periodic soundsat 50 dB Plomp, 1965; Goldstein, 1967, 1970; Goldsteinand Kiang, 1968). These combinationtones provide the SPL were presentedin regular time sequence,each sound being a two-tone complexwith frequenciesat ear with two or three harmonics below those contained successiveharmonicsof some (missing) fundamental in the stimulus, which are probably very useful since frequency in the range of 200-400 Hz. The lower all the results thus far indicate that performance harmonic number was chosenrandomly from '3, 4, improves with decreasingharmonic number. These and 5 for each sound. Since each note could be repcombination tones could make the e•ective average resentedby three different sounds,a four-notemelody harmonic number about 2 or 3 lower than the actual could be presentedin 34 (i.e., 81) different ways. value of • in Fig. 3. In the dichotic experiments, Four different four-note melodies were used, which combination tones are not present; in the monotic were unknownto the subjectsprior to the experiment. experimentat 20 dB SL they would be near or below In the first part of the experiment, subjectscould threshold(Goldstein,1967). There are two obviousways to test this combination- determine which of the four melodieswas presented, and their task was to recognize all four by writing tone hypothesis.One approachwould be to repeat the them down using a relative note scale. Since each third experiment(mediumintensity,monotic)and add a number of pure tones to the stimulus having the melody could be representedin 81 different ways, exact frequency, amplitude, and phase to cancel all subjectswere allowedto listen to severalpresentations aurally generatedcombinationtones(Goldstein,1967); of each melody before answering.In the secondpart, all four of the four-note melodiesused in the first part identificationperformanceshouldthen approachthat were presentedat random, and subjectswere askedto for dichotic experiments.The technical and experimental difficultiesof providingthe correctcancellation identify eachmelody presentedby pushingthe approamplitude and phasefor eachnote causedus to eschew priate button on a responsebox. This experimentwas this approach.Instead, an experimentwas carried out performed both with monotically and dichotically presentedtone complexes,using 10 subjectsall of usingdichoticstimuliat 50 dB SPL (40 dB for subject NH) with the additionof two moretonesthat approx- whom were familar with musical dictation. Since our imately simulated the aural combinationtones which principal interest was to investigate whether the reflecteda naturalmusicalbehavior the ear generates under monotic conditions. The subject'sresponse rather than someingeniouslyacquiredskill for performstimulus paradigm is shown in Fig. 2(d) and the experimentalresultsin Fig. 3 (e). Performancecontours ing a complicatedtask, very little time was spent on showedthe same upward shift as is seenin the third training. The results of the monotic experiment were that experiment,and the similaritiesof Figs. 3(c) and 3(e) nine out of ten subjectscharacterized eachsequence of are considered strongevidencethat suchperformance four soundsby the missing fundamentalsin the first differences as occur between monotic and dichotic part and scoredperfectlyor nearperfectlyin the second stimulus conditions can be attributed to combination part. The samewastrue for six subjectsin the dichotic tonesgeneratedin the peripheralauditorysystem. experiment;three had somedifficultiesin both parts, III. EXPERIMENTAL TESTS FOR and the one subjectwho couldnot perceiveconsistent MUSICAL BEHAVIOR melodiesin the first part undermonoticconditionswas The purpose of the experimentsreported in the also unable to do so for dichotic stimuli. The three previoussectionwasto investigatemelodyperception, subjectswho participatedin our previousbasicexperiwhich was taken operationallyas the ability of the ments had no difficulties with either the monotic or 524 Volume 51 Number 2 (Part 2) 1972 Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 128.97.27.21 On: Tue, 17 Feb 2015 22:42:40 PITCH OF COMPLEX dichotic tests. These results provide direct evidence that the phenomenonunder study doesindeed reflect normal musical behavior rather than a sophisticated learned skill that is not directly relevant to melody perception. IV. EVIDENCE FOR PERCEPTION Since random TWO OF variations MODES COMPLEX in harmonic OF MUSICAL TONES number from one note to the next are highly unusual in normal musicalpractice,we might wonderwhat would happen in melody recognitionexperimentslike thosediscussed in Sec.III if the harmonicsof eachnotevaried systematically rather than randomly. As was noted earlier, a choice of the same harmonic number for every note would lead to a trivial situation where retrieving the frequencyof either the missingfundamentalor a single upper harmonic would lead to a report of the same melody.A more interestingstimuluswould be onewhere TONES soundshas beenstudiedactively and reportedin terms of musicalpitch perceptionevoked by single sounds with fixedharmonics(Seebeck,1841;Helmholtz, 1863; Hermann, 1912; Fletcher, 1924; Schouten, 1940b; Thurlow and Small, 1955; deBoer,1956; Flanaganand Guttman, 1960; Schouten,Ritsma, and Cardozo,1962; Ritsma, 1962; Smoorenburg,1970). Reviews of this literature have recently been given by Plomp, 1967; Small, 1970; Schouten,1970; and Ritsma, 1970. We have chosento performexperimentsusingsequences of sounds with random harmonic number because we found that successive notesevokedin subjectsa sense of musical interval and provided a context for that feature of each sound that was being contrasted. In addition, this choiceminimized the opportunitiesfor behavioral responsesthat are directly correlated with tracking of partials. Finally, it enabledus to demonstrate directly the role of fundamental retrieval in musical behavior. Despite our departuresfrom earlier procedures,we the fundamental and harmonics describe different believe that our investigations involve the same melodies.Such stimuli are given by any of the sound phenomenon as has been described in earlier work. sequencesused earlier for which the harmonic numbers were not all equal. Of the 81 possible four-sound De Boer (1956) reported pitch matches in which sequences usedin Sec. III to representeach four-note inharmonic complex tones comprisingfive or seven melody, one sequencewas chosen for which no two partials with uniform frequencyspacingwere aurally successive noteswere representedby soundswith the matched to periodiccomplextoneswith a fundamental same harmonic numbers. A recognition experiment that differedsystematicallyfrom the spectralspacing of the inharmonic sound. Schouten, Ritsma, and similar to the first part of the experimentin Sec. III was performed. Four different four-note melodieswere Cardozo (1962) producedsimilar data for AM comused,whichwereunknownto the subjectsprior to the plexes(three partials) which are reproducedin Fig. 4, experiment. Each melody was representedby one and Smoorenburg(1970) found essentiallythe same four-soundsequence.Thesestimuli provideconsistent results using complexesconsistingof only two tones. but different information dependingon whether the All these tone complexeswere presentedmonotically subjectretrievesthe missingfundamentalor a partial. or diotically (i.e., one ear stimulated, or both ears A number of subjectswere asked to listen to such identicallystimulated).Usinga musical-intervalmatchstimuliand to reportwhichnote sequence they would ing paradigm, we obtained very similar results with associatewith each of them. Some reported a note two-tone stimuli presented both monotically and sequence which was consistent with the missing dichotically (Houtsma and Goldstein, 1971). Two fundamentals; othersreporteda melodycorrespondingadditionalexperimentswereperformedto examinethe to one of the partials; one subjectwas able to report relation betweenthe findingsof pitch-matchingexperimelodies described by boththe upperandlowerpartial. ments and our experimentson interval identification. Helmholtz(1863) mentionedthat complexperiodic The first was an interval identification experiment soundscan be perceived "synthetically," i.e., the with the same paradigm as was used in the basic complexis perceivedas one sound,havingone pitch; experiments,except that for each note the two freor "analytically,"i.e., partialsare heard individually, quencies (with successiverandom harmonic numbers) eachonehavingits own pitch. Crossand Lane (1963) were uniformly shifted up or down in frequency by showed that these two modes of musical behavior can be controlledby previoustraining.Our experiments do indeedsupporttheseobservations andshow,moreover, that listeners can switch from one mode to the other when this providesthem with relevant informationfor a particular task. V. RELATION TO EARLIER WORK FUNDAMENTAL RETRIEVAL ON For well over a century,the phenomenon of fundamental retrievalin the perceptionof periodicmusical some random increment less than one-fourth of the fundamental frequency. This limit was imposed to avoid pronouncedpitch ambiguities that are to be expectedfor larger inharmonic frequency shifts from the data shown in Fig. 4. Stimuli were presented monoticallyat 50 dB SPL to one subject{AH) and at 40 dB to another (NH). We chosevaluesfor i and f0 that had yielded perfect identification performance without the random frequencyshift. The results are shown in Fig. 5. It is obvious that performance is considerablyless than perfect, although the relevant The Journal of the AcousticalSocietyof America 525 Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 128.97.27.21 On: Tue, 17 Feb 2015 22:42:40 HOUTSMA AND n=7 ,/ 240 GOLDSTEIN n=8 / • // n =9 / o // =. ,•/ Fod' / // n/=I0 ,,/•/ o/ FIG. 4. Pitch as a function of ,..•/c), / • n=II 200 n= 12 18o /•o • /• • 160 / u ,/ 1200 / / //- 1400 / / // •'• /Z / 1600 CENTER / • /•B / '• a ./ / 1800 FREQUENCY /' /• o / /o . _..... z o SUBJECTH ß f• 2000 2200 RR the center frequency for a three-componentcomplextone. Stimulus intensity is 35 dB SL. The fundamental frequency, p, of an amplitude-modulated soundcomprisingthe harmonic frequencies (n- 1)p, np, and (n+l)p is adjusted to match aurally the musical pitch of another amplitude-modulated sound comprisingthe inharmanic frequenciesf-200 Hz, f, and f4-200 Hz. The first effect is defined by the dashed lines of slope 1/n. (From Schouten,Ritsma,andCardozo, 1962, with permission from The Journal of the Acoustical Societyof A •nerica). 2400 (Hz) messagewas preservedin the differencefrequenciesof the complex tones, indicating that inharmonic frequency shift of a complex tone does indeed alter its pitch. Performancewas better than chance (12.5% correct) and improved with increasing harmonic number •. This was predictable from the particular choiceof musicalintervals to be identifiedand the pitch uncertainty introduced by the uniformly distributed random frequency shift by using the simple decision model outlined in the legendof Fig. 5. In the secondexperiment we examinedthe musical relevanceof the empirical relation between pitch of a complex tone and inharmonic frequency shift of its partials by applying this relation to our musicalinterval identification paradigm. The experiment differed from the previous one in that the inharmonic soundswere chosento preserve the original musical intervals. For the first note a random frequencyshift, positive or negative, was chosenwith a maximum of one-fourth of the fundamental frequency; then the empirical"first effect" relation (Schouten,Ritsma, and Cardozo,1962) shownin Fig. 4 was usedto computethe inharmonicfrequencyshift for the secondnote required to give the originalmusicalinterval. Figure 5 showsthe results; identification performanceis almost perfect. Occasionalmistakes are easily accountedfor by the additionaldifficultyfor the subjectof havingto identify eight intervals in a roving key, sinceintervalsno longer (1) Despite differencesin experimental procedure, there is clear evidence that our studies on fundamental retrieval behavior using interval identification reflect the samebasicphenomenon as the work of many other investigatorson "periodicitypitch" or "residuepitch." (2) Inharmonic tone complexesdo have a definite musicalvalue, providedthat the deviation from the harmonic situation is not so large that it will cause pitch ambiguities.This value is, at leastfor the pitch resolutionrequiredin our experiments,well described by what is knownin the literatureas the "first effect" of inharmonicfrequencyshift. (3) The largedeparturesfrom the first effectwhich havebeenreportedto occurundercertainmonaticand diotic stimulusconditions(de Boer, 1956; Schouten, Ritsma, and Cardozo,1962; Ritsma, 1970; Smoorenburg,1970;van denBrink, 1970)reflectonlya stimulus modification attributable to aural combination tones (Goldsteinand Kiang, 1968; Ritsma, 1970; Smoorenburg, 1970), rather than essentialfeatures of the central processorof musicalpitch. VI. DISCUSSION The two currentlypopulartheoriesof musicalpitch perception are the frequencydetectiontheory(Helm- holtz, 1863; Fletcher,1924) and the periodicity-detection theory(Schouten, 1940a;Licklider,1962).According to the formertheory,the cochlearspectrumanalyzer started with the same note because of the random maps the energyof the fundamentalfrequencytonoinharmonicfrequencyshift introducedin the first note. topically,and pitch is associated with spatialposition Having establishedthe relation betweenearlier work of the fundamental; if the stimulushas no energy at on musicalpitch and our basicexperiments,we make the fundamental frequency, nonlinearitiesin the the followingconclusions. peripheralear will introduceit. Accordingto the 526 Volume 51 Number 2 (Part 2) 1972 . Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 128.97.27.21 On: Tue, 17 Feb 2015 22:42:40 PITCH OF COMPLEX TONES ioo 80-- Fro. 5. Effects of inharmonic fre- quency shift on musical interval identification performanceof two subjects. The solid line indicates performancefor the random frequency shift paradigm predicted by a simplifieddecisionmodel which assumesthat (1) the musicalvalue for each note is randomly distributed with a rectangularpdf of width fo/2•, (2) the means of the nine notes in Fig. 2(a) are uniformly separatedby f0/16, and (3) the first note of each interval is ignored,while the secondnote is detected for maximum expected correct answers. 60-- 4O fo = 300 Hz 2O OPEN SYMBOLS' CLOSED 0 2 latter theory, somemechanismis presumedto measure temporal periodicitiesat the output of the cochlear spectrum analyzer and then associate pitch with temporal interval. All the dichotic experimentsdiscussedin this paper prove directly that neither energy at the fundamental frequency nor fundamental periods in the cochlear output are necessaryconditionsfor a subjectto retrieve fundamental frequencies in a sequenceof periodic sounds. This definite inadequacy of the periodicitydetection theory is a new finding. Earlier attempts to reconcile the establishedsensationof a missing fundamental (Seebeck,1841)with Helmholtz'sfrequency-detection theory by meansof aurally generateddistortion tones (Fletcher, 1924) had already proved inadequate. Musical pitch perception could be established for which difference SPL A H(•) 40dB SPL N H (o) CONTROLLED SYMBOLS' RANDOM SHIFT SHIFT • 0 with stimuli 50dB tone distortion had to be ruled out becauseof low stimulusintensity (Schouten, 1938; Plomp, 1967; Goldstein, 1967) or external maskingat the differencefrequency (Licklider, 1954). Furthermore, difference frequency did not designate the musical pitch of inharmonic tone complexes (Hermann, 1912; Schouten, 1940b; de Boer, 1956; Schouten,Ritsma, and Cardozo,1962). Considering these theories from a converse viewpoint, the results of the experimentsreported in this paper show, moreover, that a sufficient condition for fundamental tracking is indeed given by energy at the fundamental, which can very easily be demonstrated, but probably not by fundamental periodsin the cochlear output. The data suggest that such fundamental periods are probably irrelevant for the followingreasons: 4 6 8 (2) With monotic stimuli the possibilitiesfor fundamental periodsin the cochlearoutput are enhancedas the harmonic number n is increased. Nevertheless, interval identification performancealways deteriorates with increasingharmonic number. (3) All monotic stimuli for which identification performancewasbetter than chanceeither consistedof behaviorally resolvabletones, or generatedsuch tones as combinationtones(Plomp, 1964; Goldstein,1967). The similarity of the monoticand dichoticbehavior and the close correlation between the limits on funda- mental retrieval and behavioral frequency resolution strongly suggest that fundamentals of complex-tone stimuli are retrieved by meansof a central mechanism which operateson thosestimulustonesor combination tones that can be resolved in the cochlea. This conclu- sion is a radical departure from the theory of the "residue,"which is definedas' "the joint perceptionof thosehigher Fourier componentswhich the ear fails to resolve" (Schouten,Ritsma, and Cardozo,1962). The experimentsreported here imply that the unresolvedremainderof the harmonicseries,or residue,is not responsible for fundamentalretrieval. The finding that performancefor dichotic stimuli is boundedby harmonicnumbersimilarlyas undermonoticstimulus conditionscannot be accountedfor by cochlearfre- quencyresolution alone;thecause mustbemorecentral. The finding that the only effectiveconveyorsof musicalpitch are thosepartialswhichcanbe resolved behaviorallyis actuallynot new. Ritsma (1962, 1963, 1967a,1967b)found that the existenceregionof the "tonal residue" is bounded by an upper harmonic number whosemagnitudedependson the complexity (1) Monotic stimuli, which can provide cochlear of the stimulusand the experimentalparadigm; in fundamental periods,give no better performancethan addition,he foundthat the mosteffectiveor dominant dichotic stimuli: performance differences between pitch conveyorsare the harmonics3, 4, and 5. Our monotic and dichotic conditions at higher intensities findingthat there is no noticeabledrop in performance are well accounted for bv aural combination tones. for harmonic numbers below 4 is not inconsistent with The Journalof the AcousticalSocietyof America 527 Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 128.97.27.21 On: Tue, 17 Feb 2015 22:42:40 HOUTSMA AND GOLDSTEIN Ritsma's findings,sincethe effect is probably too small fibersand, at leastfor lowerfrequencies, in the temporal to make performancein our experimentsdeteriorate firing patterns of individual fibers (Kiang, 1965; below perfect (Bilsen, 1971). Smoorenburg(1970) Whitfield, 1970). From our work it appearsthat all concluded from his pitch matching data for two-tone stimuli that the effective harmonic numbers for fundamentalsof 200 Hz have an upper boundof about 9; this conclusionis supported and extended to other fundamentalfrequenciesby our results. The effectivenessof the lower partials alone in conveyingmusical pitch is also consistentwith some readily observedmusical phenomena.First, there are somemusicalinstruments,e.g., piano strings (Young, 1952), which produce soundswhose higher partials are inharmonic. From our results we conclude that this is not necessarilydisturbing, since only the lower partials are effective in communicating the note. Second,Mach's (1881) observation--thatdissonance is equally pronouncedwhen the soundsfrom two mistuned simple tone sourcesare presented to one ear or to separate ears, and can therefore not be attributed to beating effectsin the cochlea--is also quite consistent with our experimental findings. Simple tones whose frequenciesare related by ratios of small integersare effective conveyersof musical pitch independentlyof whether they are presentedmonotically or dichotically. Presumably, stimulus conditionsthat evoke no definitive musicalpitch contributeto a musician'sjudgment of dissonance (cf. Plomp and Levelt, 1965). Little specificcan be said at this point about the neural mechanism that mediates fundamental retrieval. Its closeconnectionwith behavioralfrequencyresolution suggeststhat neural signals that allow one to recognizesimple tones and retrieve the missingfundamental of harmonic complex tones are common in early stages of processing.Whatever kind of neural mechanismis postulated, for complex-tonestimuli it will operateon neural signalsderivedfrom peripherally resolved tones. It is known that information about the frequency of such tones is-preservedafter the neural transformationin the form of the placeof activenerve the constraintscouldconceivablybe met by mechanisms basedon either time or placeinformation.An important simplification for further work on musical pitch is offered by the empirical finding that fundamental retrievals for dichotically and monotically presented two-tone stimuli are essentially identical. Thus, the basicpropertiesof the centralprocessor of musicalpitch can be investigatedwith the two-tone dichoticstimulus that avoids confounding and irrelevant phenomena caused By peripheral interactions among stimulus partials. Earlier efforts to find cochlea-generatedplace (yon B•k•sy, 1961) and time (Schouten,1940a)cuesin responseto the periodicity or missingfundamental of complex input stimuli now appear to be irrelevant; mechanisms more central than the cochlea must be investigated. ACKNOWLEDGMENTS The authors are deeply indebted to W. F. Kelley, D. J. Callahan, and L. D. Braida for their valuable assistance in developing the instrumentation and computer software that made these experimentspossible. Informative discussions werecontributedby H. S. Colburn, N. !. Durlach, D. M. Epstein, and W. M. Siebert. This work was supported by the National Institutes of Health (Grant 5 P01 GM14940-05 and Grater5 T01 GM01555-05). * This report is basedon a PhD thesissubmittedto the Dept. of Electrical Engineering, Mass. Inst. of Technol., June 1971, by A. J. M. Houtsma, and supervisedby J. L. Goldstein.The basicresultswerepresentedat the 79th Meeting of the Acoustical Society of America in Atlantic City, April 1970, [-A. J. M. Houtsmaand T. L. Goldstein,J. Acoust.Soc.Amer. 48, 88(A) (1970)]. t Also at the Eaton PeabodyLaboratory of Auditory Physiology, MassachusettsEye and Ear Infirmary, Boston, Massachusetts 02114. REFERENCES FLANAC, AN, J. L., and GUTTMAN,N. (1960). "Pitch of Periodic Pulseswithout Fundamental Components,"J. Acoust.Soc.Amer. Observedon the CochlearModel with Nerve Supply," J. Acoust. 32, 1319-1328. Soc. Amer. 33, 888-896. F•.v.TCaV. R, H. (1924). "The PhysicalCriterion for Determining B•.smq, F. A. (1971). "Pitch Perceptibilityfor Multi-Component the Pitch of MusicalTone," Phys. Rev. 23, 427-437. Signals,"J. Acoust.Soc.Amer. 51, 113(A). Go•.•)sTv. xN, J. L. (1967). "Auditory Nonlinearity," J. Acoust. •)v. BOy. R, E. (1956). "On the Residuein Hearing," PhD thesis, Soc.Amer. 41, 676-689. Univ. of Amsterdam. Go•.•)sxmN,J. L. (1970). "Aural CombinationTones," in FreDE BOER, E. (1956). "Pitch of Inharmonic Signals," Nature quencyAnalysisand PeriodicityDetectionin Hearing, R. Plomp andG. F. Smoorenburg, Eds. (Sythoff,Leiden,The Netherlands). 178, 535-536. vo•qB•c•s•r, G. (1961). "Concerningthe FundamentalComponent of Periodic Pulse Patterns and Modulated Vibrations Goxa)sxv. xN, J. L., BAY. R, T., and KXANG,N.Y. S. (1971). "A Theoretical Treatment of Latency, Group-delay and Tuning Characteristicsfor Auditory Nerve Responsesto Clicks and Tones," in Physiologyof theAuditorySystem,M. B. Sachs,Ed. CROSS,D., and LANE, H. (1963). "Attention to SingleStimulus (National EducationalConsultants,Baltimore,Md.) J. L., and KIANG,N.Y. S. (1968)."Neural Correlates Properties in the Identification of Complex Tones," in Experi- GOLDSTEIN, mentalAnalysisof theControlofSpeechProductionandPerception," of the Aural CombinationTone 2f,-f2," Proc. IEEE 56, 981-992. Univ. of Michigan ORA Rep. No. 05613-1-P. VAN•)v.NBRXN•, G. (1970). "Two Experimentson Pitch Perception: Diplacusisof Harmonic AM Signalsand Pitch of Inharmonic AM Signals," J. Acoust. Soc. Amer. 48, 1355-1365. 528 Volume 51 Number 2 (Part 2) 1972 Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 128.97.27.21 On: Tue, 17 Feb 2015 22:42:40 PITCH OF COMPLEX vo• H•.•mto•z, H. L. F. (1863).Die LehreyondenTonempfindungertals physiologische Grundlage fiir die Theorieder Musik (F. Vieweg& Sohn,Braunschweig). H•.a•t^•, L. (1912)."NeueVersuchezur Frageder Unterbrechungstone,"Arch. ges.Physiol.146, 249-294. Hou•s•t^,A. J. M., and Go•t>s•.x•,J. L. (1971)."Perception of TONES R•xs•^, R. J. (1970). "Periodicity Detection," in Frequency Analysisand PeriodicityDetectionin Hearing,R. Plomp and G. F. Smoorenburg,Eds. (A. W. Sijthoff, Leiden, The Netherlands). Sacas, M. B. (1969). "StimulusResponseRelation for Auditory Nerve Fibres: Two-Tone Stimuli," J. Acoust. Soc. Amer. 45, 1025-1036. SACaS,M. B., and Kx^•o, N.Y. S. (1968). "Two-Tone Inhibition of Complex Tones,"Res.Lab. of Electron.,MIT Tech.Rep.484. in Auditory-Nerve Fibres," J. Acoust.Soc.Amer. 43, 1120-1128. S^um>•.as,F. A. (1937). "The Mechanical Action of Violins," KI•O, N.Y. S. (1965).Disc_harge Patternsof SingleFibersin the J. Acoust. Soc. Amer. 9, 81-98. Cat'sAuditoryNerve(MIT Press,Cambridge,Mass.). SAum)•.as,F. A. (1946). "Analysisof the Tones of a Few Wind Lm•c•I•)•.a,J. C. R. (1954). "PeriodicityPitch and 'Place' Instruments," J. Acoust. Soc. Amer. 18, 395-401. Pitch," J. Acoust.Soc.Amer.26, 137(A). Sc}touxE•, J. F. (1938). "The Perceptionof SubjectiveTones," Lic•c•n)•.a, J. C. R. (1962). "PeriodicityPitch and Related Proc. Kon. Nederl. Akad. Wetenschap41, 1086-1093. Auditory ProcessModels," Int. Audiol. 1, 11-35. S½}toux•.•,J. F. (1940a). "The Residue and the Mechanism of M•c}t, E. (1881). "Zur Analyseder Tonempfindungen," Sit- Hearing," Proc. Kon. Nederl. Akad. Wetenschap43, 991-999. zungsber. Kaisersl. Akad. Wissensch.,Mathemat.-Naturwis- Sc}toux•.•, J. F. (1940b). "The Perception of Pitch," Philips sensch.,Klasse 92, II Abt. Tech. Rev. 5, 286-294. M•.a, G. A. (1956). "The Magical Number Seven Plus or SCHOUTEN, J. F., RITSMA,R. J., and CARDOZO, B. L. (1962). Minus Two," Psych. Rev. 63, 81-97. "Pitch of the Residue," J. Acoust. Soc. Amer. 34, 1418-1424. O}m, G. S. (1843). "Ueber die DefinitiondesTones,nebstdaran SCHOU•.N, J. F. (1970). "The ResidueRevisited," in Frequency gekniipfterTheorie der Sirene und iihnlicher ton bildender Analysis and Periodicity Detectionin Hearing, R. Plomp, and Vorrichtungen,"Ann. Phys. Chem.59, 513-565. G. F. Smoorenburg,Eds., (A. W. Sijthoff, Leiden, The NetherP•o•t•, R. (1964). "The Ear as a FrequencyAnalyzer,"J. Acoust. lands). SE•.•.CK, A. (1841). "Beobachtungenfiber einige Bedingungen Soc. Amer. 36, 1628-1636. P•o•t•, R. (1965). "Detectability Threshold for Combination der Entstehungvon T6nen," Ann. Phys. Chem. 53, 417-436. S•t•LL, A.M., Ja. (1970). "Periodicity Pitch" in Foundationsof Tones," J. Acoust.Soc.Amer. 37, 1110-1123. Modern Auditory Theory, J. R. Tobias, Ed. (Academic, New PLO•t•, R. (1967). "Pitch of ComplexTones," J. Acoust. Soc. York), Vol. I. Amer. 41, 1526-1533. S•tOO•EN]•Ua•,G. F. (1970). "Pitch Perceptionof Two-Frequency PLOMP,R., and LEVELT,W. J. M. (1965). "Tonal Consonance Stimuli," J. Acoust. Soc. Amer. 48, 924-942. and Critical Bandwidth," J. Acoust.Soc.Amer. 38, 548-560. THUaLOW,W. R., and S•t•nL, A.M., Ja. (1955). "Pitch PercepPzo•tv, R., and S•tooaE•auag, G. F., Eds. (1970). Frequency tion for Certain Periodic Auditory Stimuli," J. Acoust. Soc. Analysis and PeriodicityDetectionin Hearing (A. W. Sijthoff, Amer. 27, 132-137. Leiden, The Netherlands). WmTmEL•), I. C. (1970). "Central Nervous Processingin Relation RtTS•tA,R. J. (1962). "ExistenceRegionof the Tonal Residue. to Spatio-Temporal Discrimination of Auditory Patterns," in I," J. Acoust.Soc.Amer. 34, 1224-1229. FrequencyAnalysis and PeriodicityDetectionin Hearing, R. Plomp, R•TS•tA,R. J. (1963). "ExistenceRegionof the Tonal Residue. and G. F. Smoorenburg,Eds. (A. W. Sijthoff, Leiden, The Netherlands). II," J. Acoust.Soc.Amer. 35, 1241-1245. YOUN% R. W. (1955). "Inharmonicity of Plain Wire Strings," R•TS•tA,R. J. (1967a). "FrequenciesDominantin the Perception of the Pitch of Complex Sounds,"J. Acoust. Soc. Amer. 42, J. Acoust.Soc. Amer. 24, 267-273. 191-198. ZWmKE•, E. (1965). "Der Ungew6hnlicheAmplitudengangder R•TS•tA,R. J. (1967b). "FrequenciesDominant in Pitch Percep- nichtlinearenVerzerrungendes Ohres," Acustica5, 67-74. ZW•sLOCKI, J. (1953). "AcousticAttenuationbetweenthe Ears," tion of Periodic Pulses of Alternating Polarity," IPO Ann. Progr. J. Acoust.Soc. Amer. 25, 752-759. Rep. 2, Eindhoven, pp. 14-24. Musical Intervals: Evidencefor the Central Origin of the Pitch The Journalof the AcousticalSocietyof America 520 Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 128.97.27.21 On: Tue, 17 Feb 2015 22:42:40