Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Copyright 1991 by the American Psychological Association, Inc. 0096-3445/9!/i3.00 Journal of Experimental Psychology: General 1991, Vol. 120, No. I, 101-105 CHARMed, But Not Convinced: Comment on Metcalfe (1990) D. Stephen Lindsay Williams College Metcalfe (1990) proposed her Composite Holographic Associative Recall Model (CHARM) as a model of eyewitness suggestibility. CHARM has many appealing properties and performed well in simulations of suggestibility. Most important, CHARM provides a formal mechanism by which suggestions impair memory without affecting performance on McCloskey and Zaragoza's (1985a) Modified Test. Nonetheless, a number of shortcomings limit CHARM'S usefulness as a model of suggestibility: (a) in the simulations, control terms differ from those in human studies; (b) the model makes a counterintuitive prediction about performance on a recognition pair composed of an event detail and its control term; (c) CHARM models association, not remembering; and (d) most of the intelligence in the simulations lies in the programmer rather than in CHARM itself, which limits the model's constraint on memory blending. Loftus and her colleagues (e.g., Loftus, Miller, & Burns, 1978) have demonstrated that misleading suggestions about details in an event can impair performance on tests of memory for those details. For example, subjects might view a slide sequence that includes a photo of a man holding a hammer (the event detail) and later receive either a misleading suggestion to the effect that the man had been holding a wrench (the suggested detail) or neutral information to the effect that the man had been holding a tool (the control). On a subsequent forced-choice recognition test of memory for details seen in the slides, subjects more often err on misled than on control items by selecting the suggested detail (e.g., wrench) rather than the event detail (e.g., hammer). Loftus (e.g., Loftus, 1981, 1989; Loftus & Loftus, 1980) interpreted these findings as evidence that misleading suggestions impair memory for event details. McCloskey and Zaragoza (I985a) argued that there are reasons to expect poorer misled than control performance on Loftus's "Standard Test" even if misleading suggestions do not impair ability to remember event details. Their arguments were sharpened by the results of six experiments in which the critical test pair consisted of the event detail and a novel distractor (e.g., hammer vs. screwdriver). McCloskey and Zaragoza argued that if suggestions impair memory for event details, then subjects should do more poorly on misled than control items on this "Modified Test." No such effect was obtained. In a response to McCloskey and Zaragoza's (1985a) critique, Loftus, Schooler, and Wagenaar(I985) argued that the Work on this comment was supported by Williams College and the Bronfman Science Center. 1 thank Janet Metcalfe for engaging in a very productive dialogue concerning an earlier draft of this article. 1 also thank David Levine for help on matrix algebra; William Batchelder, Colleen Kelley, and an anonymous reviewer for helpful and insightful comments on earlier drafts; Jonathan Schooler for bringing Ray Pike's article to my attention; and Marcia Johnson for general support and advice. Correspondence concerning this article should be addressed to D. Stephen Lindsay, Department of Psychology, Williams College, Bronfman Science Center, Williamstown, Massachusetts 01267. 101 Modified Test is insensitive to subtle "blending" effects, in which the suggested detail distorts the memory trace of the event detail. McCloskey and Zaragoza (1985b) rejected this defense and questioned the unexplained selectivity and intelligence of the blending process described by Loftus et al. (1985): "Why, when a stop sign and a yellow yield sign are blended, is the result a red yield sign as opposed to a yellow stop sign, or an orange sign with 5.5 sides and the word YSEOD on it?" (p. 386). They concluded that "the memory blend notion is not sufficiently well-developed to merit serious consideration" (p. 386). Metcalfe (1990) recently presented a well-developed theory of memory blends in the form of her Composite Holographic Associative Recall Model (CHARM). Like suggestibility studies involving human subjects, the CHARM simulations consisted of three phases (see Table 1). First, two vectors were convolved (e.g., a vector called "man" was convolved with a vector called "hammer"). Second, one of the previously stored vectors was convolved with a third vector (e.g., the vector called "man" was convolved with a vector called "wrench" [the misleading suggestion] or with a vector called "tool" [the control term]), creating a composite memory trace including information about both associations. At test, the vector common to both associations ("man") was correlated with the composite memory trace and the resultant vector (a blend of both "hammer" and either "wrench" or "tool") was compared with the vectors in a recognition pair. The Standard Test was simulated by using the event and suggested vectors as the recognition pair, whereas the Modified Test consisted of the event vector and a new vector (e.g., "screwdriver"). The results of these simulations corresponded closely to McCloskey and Zaragoza's (1985a) findings (see Table 1). The correct vector was selected less often when both of the vectors in the recognition test pair had been convolved with the cue (i.e., Misled/Standard Test) than when only the correct vector in the test pair had been convolved with the cue (i.e., Control/Standard Test and Misled/Modified Test and Control/Modified Test). Results of other simulations nicely mirrored Belli's (1989) and Tversky and Tuchin's (1989) findings with yes-no tests; Chandler's (1989) studies in which control terms were not presented in the post-event informa- 102 COMMENTS Table 1 Phases in CHARM Simulations of Suggestibility Phase Event Misled condition Control condition [MAN'HAMMER] [MAN'HAMMER] Narrative [MAN'HAMMER + MAN»WRENCH] [MAN'HAMMER + MANTOOL] Test cue MAN#[MAN*HAMMER + MAN*WRENCH] = MAN#[MAN*HAMMER + MAN'TOOL] = HXOMER 34% correct 64% correct 66% correct 65% correct nwfcBwes Standard test (HAMMER/WRENCH) Modified test (HAMMER/SCREWDRIVER) Note. Percentages are from Metcalfe (1990, Simulation la). The contents of the composite memory trace (which are not themselves accessible to consciousness) are shown in brackets. The symbol * indicates convolved with and the symbol # indicates correlated with. The superimposed words to the right of the equal signs represent the single "blended" vectors produced when the composite trace is cued. tion; Zaragoza, McCloskey, and Jamis's (1987) recall data; and Loftus's (1977) and Belli's (1988) color shifts. Metcalfe's research is valuable for a number of reasons. CHARM'S interactive nature, use of composite storage, and success at simulating a variety of memory phenomena (Eich, 1982, 1985; Metcalfe, 1990) make it an appealing model. At a more specific level, this work helps to integrate the suggestibility literature into the broader context of contemporary theoretical approaches to memory and cognition. This is important because discussions of suggestibility have sometimes been theoretically naive. Most important, CHARM demonstrates a formal mechanism by which suggestions could impair ability to remember event details without affecting performance on the Modified Test. Noninteractive, unidimensional trace strength models of memory would hold that virtually any memory-impairing effect of suggestions would affect performance on the Modified Test (Zaragoza & McCloskey, 1989). CHARM provides a formal demonstration of a system in which the effects of interpolated material on ability to remember event details depends, in part, on how memory is tested. Despite these virtues, CHARM suffers from several shortcomings that limit its usefulness as a model of eyewitness suggestibility.' First, the control terms used in the simulations differ in an important way from those used in studies of human suggestibility. Second, the model makes a counterintuitive prediction about performance on a forced-choice test pair composed of an event detail and its control term. Third, in its current form, CHARM is a model of association, not a model of remembering. Finally, CHARM itself does little to constrain the nature or extent of memory blending because most of the intelligence involved in the simulations lies in the human who runs them rather than in CHARM itself. Prototype Versus Superordinate Control Terms In most of Metcalfe's (1990) simulations, the vectors representing the event detail, suggested detail, and novel detail were constructed to be similar to the control term. This was done by making randomly selected subsets of the features of each of the specific terms ("hammer," "screwdriver," and "wrench") have the same values as they did in the vector that represented the control term ("tool"). Consistent with other researchers' terminology (e.g., Posner & Keele, 1968), Metcalfe referred to these control terms as prototypes. In studies of human suggestibility the controls have been superordinate terms (e.g., "tool"), not prototypes (e.g., ?). Metcalfe identified her control terms as both prototypes and superordinates (e.g., "Item 32 was the prototype 'tool'"; p. 152). CHARM can handle prototypicality, but is is much less clear that it can represent the relationship between basic level and superordinate terms. This is a complex issue, but I would argue that a superordinate term does not reduce to a blend of its subordinates; for example, "tool" does not refer to an object with features typical of hammers, screwdrivers, wrenches, etc.2 The difference between category terms and prototypes is important for many reasons. In the current context, one important difference is that the control terms used in studies of human suggestibility are true and accurate references to the event details (the man was holding a tool, and he was not holding a "hamscrewren"). 1 Metcalfe (1990) suggested that the real-world implications of her model are restricted to cases in which there is a real-world object that closely matches the blend: "There are no real world objects that comprise a blend between a stop sign and a yield sign... [so] a literal blend would be ruled out immediately, even if such were retrieved from memory (as the CHARM model says it is)" (p. 158). This need not limit the real-world implications of the model, because the response selection processes could be made to return a "no response" decision if the retrieved vector does not adequately match a lexical vector. Thus, blending could impair response selection even if the blend does not match any vector in the lexicon. Mn natural categories, a prototype is the most representative exemplar of a category. Rosch and Mervis (1975) found that the items judged to be most prototypical of a category shared the greatest number of features with other category members, and Rosch, Mervis, Gray, Johnson, and Boyes-Braem (1976) found that members of basic level categories have shapes and motor movements in common. These ideas and findings do not suggest, however, that superordinates reduce to prototypes. 103 COMMENTS Counterintuitive Prediction CHARM'S Intelligence Even if CHARM could use superordinate terms analogous to those used in studies with human subjects, its responses to such control terms would likely differ from humans'. Like people, CHARM more often errs on the Standard Test (event detail "hammer" vs. suggested detail "wrench") when the suggested detail had been added to the composite trace than when the control term ("tool") had been added to it. Consider what would happen if the test pair consisted of the event detail and the control term (e.g., "hammer" vs. "tool"). Given such a test pair, CHARM would perform more poorly in the control condition (in which "tool" had been added to the composite trace) than in the misled condition (in which "wrench" had been added to the trace). For CHARM, there is little difference between misleading suggestions and control terms. Indeed, in the simulations in which within-category structure was not represented, the sole difference between control terms and misleading suggestions was that only the latter appeared on the test. In those simulations, "hammer," "wrench," and "tool" were arbitrary labels assigned to unrelated vectors. Therefore simply switching the labels "Misled/ Standard Test" and "Control/Standard Test" on Metcalfe's data tables provides a good prediction of how CHARM would perform on a "hammer" versus "tool" test pair when no within-category structure was represented in the items: The "impairment" observed in the control condition given such a test pair would be the same as the impairment observed in the misled condition given the Standard Test.3 It is not immediately obvious what human subjects would do if tested with a "hammer" versus "tool" pair, but it would be odd indeed if misled performance exceeded control performance on such a test pair to the same (or nearly the same) extent as control performance exceeded misled performance on the Standard Test.4 As was mentioned above, McCloskey andZaragoza (1985b) decried the unexplained intelligence Loftus et al. (1985) attributed to blending, noting that no account had been offered for why blending results in the particular sensible errors used as lures on the test. At first glance, CHARM appears to answer this critique: The nature of blending is specified by the ordered quality of vectors, the extent of blending distortion is determined by the similarity of the vectors stored in the composite trace (and by weights, etc.), and only sensible blends appear as output because the responses are selected by matching the blended output vector to a lexicon of stored vectors. On closer examination, however, it appears that CHARM itself accounts for little of this intelligence. As Craik and Lockhart (1986) pointed out in a comment on Metcalfe's (Eich's) 1985 article, the model assumes that the features of an item are inherent in the stimulus itself, and this assumption is difficult to justify: "While stimulus properties place constraints on possible processing operations, there is little purpose in attempting to list features of a stimulus apart from cognitive operations... [because] without some point of view or conceptualization of a stimulus event, the number of such features is unbounded" (p. 361). Lewandowsky and Hockley (1987) provided empirical support for this criticism in the form of two studies demonstrating effects of orienting task when stimulus materials were held constant. The likelihood and the nature of memory blends in CHARM depend on how objects are represented and on how to-be-associated pairs of items are selected. CHARM tells us neither how stimuli are transformed into ordered and weighted sets of "features" representing discrete "items" nor how pairs of items are selected to be associated. This limitation is not at all unique to CHARM; most if not all formal models of memory sidestep issues of encoding and categorization. In the absence of any account of these processes, however, CHARM itself does relatively little to answer McCloskey and Zaragoza's (1985b) criticism of the unexplained selectivity and intelligence of blending. Association Versus Remembering In its present form, CHARM is a model of association, not a model of remembering. Remembering involves an attribution of aspects of current mental experience to prior experience (cf. Anderson & Bower, 1972; Jacoby, Kelley, & Dywan, 1989; Johnson, 1989;Kintsch, 1974; Mandler, 1972;Tulving, 1984). In the case of eyewitness suggestibility, what is to be modeled is a memory error in which subjects allegedly remember something that was suggested to them as something that they witnessed during a particular past event (Lindsay & Johnson, 1989). People do not recollect only abstract, decontextualized lexical "items," but rather aspects of specific past events. Furthermore, people generate many associations during attempts to recall that they do not emit as recalled items (e.g., Anderson & Bower, 1972). CHARM has no basis for attributing a retrieved vector to a particular past event, because no information about occurrence (source, context, conditions of encounter) is stored. Thus, although association is undoubtedly a very important contributor to and aspect of remembering, CHARM would be a much better model of memory if it were modified to use information about conditions of encounter (source). 3 This effect might be somewhat smaller than a reversal if withincategory structure was represented (i.e., "tool" used as a prototype and "hammer" and "wrench" derived from it). Because of the similarity relations, both the control term and the misleading suggestion would produce less distortion of the event detail in this case, and control terms would produce slightly less distortion of event details than would misleading suggestions. On the other hand, within-category similarity would also make the "tool" versus "hammer" discrimination more difficult than the Standard Test ("hammer" vs. "wrench") discrimination. In any case, as long as "hammer," "wrench," and "tool" were discriminable, CHARM would more often err on a "hammer" versus "tool" test pair if "tool" had been added to the composite trace (control condition) than if "wrench" had been added to it (misled condition). 4 CHARM's prediction that distortion of memory of the event detail increases with the dissimilarity between the event detail and the suggested detail also seems counterintuitive. Would a misleading suggestion to the efTect that the calculator had been hidden under a cloth really impair memory for "hammer" to a greater extent than a suggestion to the effect that the calculator was hidden under a wrench? 104 COMMENTS General Critique of Convolution-Correlation Models At a more general level, vector convolution-correlation models such as CHARM seem problematic for a number of reasons. First, it is difficult to imagine an efficient system in which all items are represented by the same ordered set of features (cf. Craik & Lockhart, 1986; Pike, 1984). To represent ideas as diverse as "hammer," "tool," and "thief with the same ordered set of features, the vectors would have to be very large (with values of zero on most features). Convolution and correlation with large vectors are very time-consuming processes (indeed, even encoding items into such large vectors might take a lot of time). Second, people do not restrict association formation to single pairs of items. In Metcalfe's (1990) studies, memory of a slide that shows a man in an office hiding a stolen calculator under a hammer in his tool box was simulated by convolving vectorial representations of "man" and "hammer." If people associate "man" with "hammer," surely they also associate "man" with "calculator," "tool box," "office," "thief," "slide sequence," and so on. Pair-wise association between particular objects seems an odd way to simulate human memory for a naturalistic scene—if only because a very large (possibly unbounded) number of associations would have to be performed in very little time. Third, given a processor that could perform a large number of convolutions involving large vectors in very little time, the composite trace would quickly become very messy. With the associations mentioned above, for example, the retrieval cue "man" would produce a single vector composed of a superimposed melange of features from "hammer," "calculator," "tool box," "office," "thief," and "slide sequence." Summary In summary, the data produced by the simulations reported in Metcalfe (1990) very closely match the pattern of results obtained with human subjects, and CHARM has several appealing properties. Most important, CHARM'S interactive nature provides a formal mechanism by which misleading suggestions could impair memory for event details without affecting performance on the McCloskey and Zaragoza (1985a) Modified Test. Nonetheless, in its present form the model does little to expand our understanding of eyewitness memory. The control terms used in the simulations differ in an important way from those used in studies with humans, and the model makes a counterintuitive prediction about responses to control terms presented in recognition test pairs. In its present form, CHARM is a model of association, not of remembering, because no information about conditions of encounter is stored. Moreoever, CHARM itself tells us little about the nature or likelihood of memory blends, because most of the intelligence involved in the simulations lies in the human who runs them rather than in the model itself. Finally, the model's key assumptions appear to be problematic. Although these shortcomings might be reduced in modified versions of CHARM or in other formal models, it may be more profitable to make use of some of the general principles that underlie CHARM (e.g., the interactive relation between encoding and retrieval) in research conducted with human subjects. References Anderson, J. R., & Bower, G. H. (1972). Recognition and retrieval processes in free recall. Psychological Review, 79, 97-123. Belli, R. F. (1988). Color blend retrievals: Compromise memories or deliberate compromise responses? Memory and Cognition, 16, 314-326. Belli, R. F. (1989). Influences of misleading postevent information: Misinformation interference and acceptance. Journal of Experimental Psychology: General, US, 72-85. Chandler, C. C. (1989). Specific retroactive interference in modified recognition tests: Evidence for an unknown cause of interference. Journal of Experimental Psychology: Learning, Memory, and Cognition, 15, 256-265. Craik, F. I. M., & Lockhart, R. S. (1986). CHARM is not enough: Comments on Eich's model of cued recall. Psychological Review, 93, 360-364. Eich, J. M. (1982). A composite holographic associative recall model. Psychological Review, 89. 627-661. Eich, J. M. (1985). Levels of processing, encoding specificity, elaboration, and CHARM. Psychological Review, 92, 1-38. Jacoby, L. L., Kelley, C. M., & Dywan, J. (1989). Memory attributions. In H. L. Roediger III & F. I. M. Craik (Eds.), Varieties of memory and consciousness: Essays in honor ofEndel Tulving (pp. 391-422). Hillsdale, NJ: Erlbaum. Johnson, M. K. (1989). Reality monitoring: An experimental phenomenological approach. Journal of Experimental Psychology: General, 117, 390-394. Kintsch, W. (1974). The representation of meaning in memory. Erlbaum: Hillsdale, NJ. Lewandowsky, S., & Hockley, W. E. (1987). Does CHARM need depth? Similarity and levels-of-processing effects in cued recall. Journal of Experimental Psychology: Learning, Memory, and Cognition, 13, 443-455. Lindsay, D. S., & Johnson, M. K. (1989). The eyewitness suggestibility effect and memory for source. Memory and Cognition, 17, 349358. Loftus, E. F. (1977). Shifting human color memory. Memory & Cognition, 5, 696-699. Loftus, E. F. (1981). Mentalmorphosis: Alterations in memory produced by the mental bonding of new information to old. In J. Long & A. Baddeley (Eds.), Attention and performance IX (pp. 417-434). Hillsdale, NJ: Erlbaum. Loftus, E. F. (1989, June). Human memory: Evolution and creation. Paper presented at the annual meeting of the American Psychology Society, Alexandria, VA. Loftus, E. F., & Loftus, G. R. (1980). On the permanence of stored information in the human brain. American Psychologist, 35, 409420. Loftus, E. F., Miller, D. G., & Burns, H. J. (1978). Semantic integration of verbal information into a visual memory. Journal of Experimental Psychology: Hitman Learning and Memory, 4, 19-31. Loftus, E. F., Schooler, J. W., & Wagenaar, W. (1985). The fate of memory: Comment on McCloskey and Zaragoza. Journal of Experimental Psychology: General, 114, 375-380. Mandler, G. (1972). Organization and recognition. In E. Tulving & W. Donaldson (Eds.), Organization of memory (pp. 146-167). San Diego, CA: Academic Press. McCloskey, M., & Zaragoza, M. (1985a). Misleading postevent information and memory for events: Arguments and evidence against memory impairment hypotheses. Journal of Experimental Psychology: General, 114, 1-16. McCloskey, M., & Zaragoza, M. (I985b). Postevent information and memory: Reply to Loftus, Schooler, and Wagenaar. Journal of Experimental Psychology: General, 114, 381-387. COMMENTS Metcalfe, J. (1990). Composite holographic associative recall model (CHARM) and blended memories in eyewitness testimony. Journal of Experimental Psychology: General, 119, 145-160. Pike, R. (1984). Comparison of convolution and matrix distributed memory systems for associative recall and recognition. Psychological Review, 91, 281-294. Posner, M. I., & Keele, S. W. (1968). On the genesis of abstract ideas. Journal of Experimental Psychology, 77, 353-363. Rosch, E. H., & Mervis, C. B. (1975). Family resemblances: Studies in the internal structure of categories. Cognitive Psychology, 7, 573-605. Rosch, E. H., Mervis, C. B., Gray, W. D., Johnson, D. M., & BoyesBraem, P. (1976). Basic objects in natural categories. Cognitive Psychology, 8, 382-439. Tulving, E. (1984). Precis of Elements of Episodic Memory. Brain and Behavioral Sciences, 7, 223-268. 105 Tversky, B., & Tuchin, M. (1989). A reconciliation of the evidence on eyewitness testimony: Comments on McCloskey and Zaragoza (1985). Journal of Experimental Psychology: General, 118, 86-91. Zaragoza, M. S., & McCloskey, M. (1989). Misleading postevent information and the memory impairment hypothesis: Comment on Belli and reply to Tversky and Tuchin. Journal of Experimental Psychology: General, 118, 92-99. Zaragoza, M. S., McCloskey, M., & Jamis, M. (1987). Misleading postevent information and recall of the original event: Further evidence against the memory impairment hypothesis. Journal of Experimental Psychology: Learning, Memory, and Cognition, 13, 36-44. Received May 1, 1990 Revision received July 20, 1990 Accepted August 1, 1990 • Butcher, Geen, Hulse, and Salthouse Appointed New Editors, 1992-1997 The Publications and Communications Board of the American Psychological Association announces the appointments of James N. Butcher, University of Minnesota; Russell G. Geen, University of Missouri; Stewart H. Hulse, John Hopkins University; and Timothy Salthouse, Georgia Institute of Technology as editors of Psychological Assessment: A Journal of Consulting and Clinical Psychology, the Personality Processes and Individual Differences section of the Journal of Personality and Social Psychology, the Journal of Experimental Psychology: Animal Behavior Processes, and Psychology and Aging, respectively. As of January 1, 1991, manuscripts should be directed as follows: • For Psychological Assessment send manuscripts to James N. Butcher, Department of Psychology, Elliott Hall, University of Minnesota, 75 East River Road, Minneapolis, Minnesota 55455. • For JPSP: Personality send manuscripts to Russell G. Geen, Department of Psychology, University of Missouri, Columbia, Missouri 65211. • For JEP: Animal send manuscripts to Stewart H. Hulse, Johns Hopkins University, Department of Psychology, Ames Hall, Baltimore, Maryland 21218. • For Psychology and Aging send manuscripts to Timothy Salthouse, Georgia Institute of Technology, School of Psychology, Atlanta, Georgia 30332.