Download Interactions between attention, context and learning in primary

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Allochiria wikipedia , lookup

Convolutional neural network wikipedia , lookup

Emotion and memory wikipedia , lookup

Neuroeconomics wikipedia , lookup

Synaptic gating wikipedia , lookup

Emotion perception wikipedia , lookup

Perception wikipedia , lookup

Cortical cooling wikipedia , lookup

Neuropsychopharmacology wikipedia , lookup

Aging brain wikipedia , lookup

Environmental enrichment wikipedia , lookup

Executive functions wikipedia , lookup

Perceptual learning wikipedia , lookup

Visual servoing wikipedia , lookup

Response priming wikipedia , lookup

Psychophysics wikipedia , lookup

Eyeblink conditioning wikipedia , lookup

Stimulus (physiology) wikipedia , lookup

Visual search wikipedia , lookup

Time perception wikipedia , lookup

Neural correlates of consciousness wikipedia , lookup

Visual extinction wikipedia , lookup

Attention wikipedia , lookup

Transsaccadic memory wikipedia , lookup

Neuroesthetics wikipedia , lookup

Visual selective attention in dementia wikipedia , lookup

P200 wikipedia , lookup

Visual spatial attention wikipedia , lookup

Feature detection (nervous system) wikipedia , lookup

Visual N1 wikipedia , lookup

C1 and P1 (neuroscience) wikipedia , lookup

Transcript
Vision Research 40 (2000) 1217 – 1226
www.elsevier.com/locate/visres
Interactions between attention, context and learning in primary
visual cortex
Charles Gilbert a,*, Minami Ito b, Mitesh Kapadia a, Gerald Westheimer a
b
a
The Rockefeller Uni6ersity, 1230 York A6enue, New York, NY 10021, USA
Laboratory for Neural Control, National Institute for Physiological Sciences, Myodaiji, Okazaki 444 -8585, Japan
Received 4 June 1999; received in revised form 12 October 1999
Abstract
Attention in early visual processing engages the higher order, context dependent properties of neurons. Even at the earliest
stages of visual cortical processing neurons play a role in intermediate level vision — contour integration and surface
segmentation. The contextual influences mediating this process may be derived from long range connections within primary visual
cortex (V1). These influences are subject to perceptual learning, and are strongly modulated by visuospatial attention, which is
itself a learning dependent process. The attentional influences may involve interactions between feedback and horizontal
connections in V1. V1 is therefore a dynamic and active processor, subject to top-down influences. © 2000 Elsevier Science Ltd.
All rights reserved.
Keywords: Primary visual cortex; Perceptual learning; Contextual influences; Receptive fields; Visuospatial attention
To explore the role of attention in any cortical area
one has to take into account the role that area plays in
the processing of visual information. The primary visual cortex (V1) is engaged in much more complex
processes than previously believed, and the very concept of the receptive field has undergone a substantial
modification. The processes included under the rubric
of intermediate level vision, such as contour integration
and surface segmentation, may be represented at the
earliest stages of the visual cortex pathway. This view is
in sharp contrast to the classical notion about V1
analyzing only the most rudimentary attributes of the
visual world, such as local orientation or the position in
depth of local features. Instead, the responses of neurons in V1 are dependent on the precise geometric
relationships between line segments or texture elements
within the receptive field and contours and surfaces
extended for considerable distances outside the receptive field. The response properties of cells to features in
the visual scene are highly dependent on the context
within which those features are placed, and those con* Corresponding author. Tel.: +1-212-3277670; fax: + 1-2123277844.
E-mail address: [email protected] (C. Gilbert)
textual influences are the properties most modulated by
attention. In addition, the contextual modulation is
influenced by learning, and the attentional influences
themselves are subject to learning effects.
The idea that the response of neurons to local attributes is dependent on the global characteristics of
contours and surfaces is consonant with the basic tenet
of Gestalt psychology. The principle, as put forward by
Wertheimer, holds that the ‘‘behavior of the whole
cannot be derived from its individual elements nor from
the way these elements fit together; rather the opposite
is true: the properties of any of the parts are determined
by the intrinsic structural laws of the whole’’
(Wertheimer, 1923). The contextual modulation of receptive fields, though currently put under the category
of the ‘non-classical’ receptive field, is actually close to
one of the original descriptions of the receptive field by
Stephen Kuffler. He emphasize that ‘‘… not only the
areas from which responses can actually be set up by
retinal illumination may be included in a definition of
the receptive field but also all areas which show a
functional connection, by an inhibitory or excitatory
effect on a ganglion cell. This may well involve areas
which are somewhat remote from a ganglion cell and
by themselves do not set up discharges’’ (Kuffler, 1953).
0042-6989/00/$ - see front matter © 2000 Elsevier Science Ltd. All rights reserved.
PII: S 0 0 4 2 - 6 9 8 9 ( 9 9 ) 0 0 2 3 4 - 5
1218
C. Gilbert et al. / Vision Research 40 (2000) 1217–1226
One now can extend the concept of receptive field even
further to include the potential for dynamic changes on
a range of time scales (from seconds to weeks) and
depending on experience, behavioral task and attention
and expectation.
A number of studies have shown that cells in primary
visual cortex are sensitive to stimuli lying outside their
receptive fields. The classical measure of the receptive
field, known as the minimum response field, is obtained
by placing an optimally oriented line in different positions within the visual field. Complex stimuli, composed
of multiple line segments presented simultaneously,
show strong modulation of cells’ responses relative to
the response to a single line within the receptive field.
This has led to the distinction between the ‘classical’
and ‘non-classical’ receptive field, although even those
properties termed non-classical would fall under
Kuffler’s classic description of the receptive field. The
non-classical properties have been implicated in a range
of roles including contour integration, perceptual fill-in,
surface segmentation and orientation contrast (Blakemore & Tobin, 1972; Maffei & Fiorentini, 1976; Nelson
& Frost, 1978; Allman, Miezin & McGuinness, 1985;
Nothdurft & Li, 1985; Tanaka, Hikosaka, Saito, Yukie,
Fukada & Iwai, 1986; Orban, Guylas & Vogels, 1987;
Gilbert & Wiesel, 1990; Knierim & Van Essen, 1992; Li
& Li, 1994; Kapadia, Ito, Gilbert & Westheimer, 1995;
Lamme, 1995; Sillito, Grieve, Jones, Cuderio & Davis,
1995; Rossi, Rittenhouse & Paradiso, 1996; Zipser,
Lamme & Schiller, 1996; Kastner, Nothdurft & Pigarev, 1997; Levitt & Lund, 1997; Kapadia, Westheimer & Gilbert, 1998, 1999).
An important source of contextual influences is a
plexus of long range horizontal connections that link
cells with widely separated receptive fields. This is a
network of connectivity formed by the axons of cortical
pyramidal cells. Pyramidal and spiny stellate cells represent 80% of the neurons in the cortex, each cell can
form lateral connections of varying extents. The longest
connections run for 5 – 6 mm, from one end of the
axonal arbor to another (Gilbert & Wiesel, 1979, 1983,
1989; Rockland & Lund, 1982; Gilbert, 1992; Bosking,
Zhang, Schofield & Fitzpatrick, 1997). From the point
of view of the neurons receiving these connections, the
area of visual space represented by the area of cortex
from which they integrate input can be an order of
magnitude larger than their own receptive fields. The
horizontal connections are distributed in discrete clusters of axon collaterals, showing an underlying specificity in the connections. This specificity is related to the
columnar functional architecture of cortex, with connections linking columns of similar orientation preference and cells with widely separated receptive fields
(Gilbert & Wiesel, 1979, 1983, 1989; Malach, Amir,
Harel & Grinvald, 1993; Weliky, Kandler, Fitzpatrick
& Katz, 1995; Bosking et al., 1997; Kisvarday, Toth,
Rausch & Eysel, 1997). The registration between the
horizontal connections and the columnar functional
architecture mirrors the perceptual phenomenon of
greater saliency for contours made of similarly oriented
line segments, and is reflected in the geometry of contextual influences (Wertheimer, 1923; Grossberg &
Mingolla, 1985; Ullman, 1990; Field, Hayes & Hess,
1993; Wolfson & Landy, 1999).
One must also allow for possible contributions from
feedback connections in the global response properties
of cortical neurons. It has been suggested, by looking at
the effect of inactivation of V2 on center-surround
interactions in V1 (Hupe, James, Payne, Lomber,
Girard & Bullier, 1998) and because of the delayed time
course of responses to texture discontinuities (Lamme,
1995; Zipser et al., 1996) that at least some of the
contextual influences are derived from feedback connections from higher order cortical areas to area V1. As
shown below, however, the contextual facilitation we
observe follows the full time course of the response. In
any event, it is not clear that routing information
through V2 back to V1 requires a substantial additional
delay, and there might be equivalent delays in the
lateral influences originating from V1. The spatial characteristics and columnar specificity of horizontal connections and the rules of perceptual interactions
(contour saliency and contrast enhancement) are quite
comparable. The interplay between attention and contextual influences, as we will develop, does argue for an
interaction between intrinsic horizontal connections
and feedback connections in the final outcome of visuospatial integration.
The contextual influences for cells in V1 show a
powerful facilitation that depends on the precise relative placement of line segments inside and outside the
receptive field. Psychophysical experiments measuring
the detectability of a line segment show a similar facilitation when the line is associated with a nearby,
collinear line segment (Dresp, 1993; Polat & Sagi, 1993,
1994; Kapadia et al., 1995). Subjects can detect a line at
40% lower contrast when paired with a collinear line
than when presented in isolation. The response of cells
in the superficial layers of the primary visual cortex of
alert monkeys to a single line inside the receptive field
can be facilitated as much as 3-fold by adding a line
segment outside the receptive field (Fig. 1, see Kapadia
et al., 1995; Polat, Mizobi, Pettet, Kasamatsu & Norcia, 1998). The facilitation is maximal when the second
line is collinear, adjacent to and of the same orientation
as the first. The fall off in facilitation in contrast
detection thresholds and the facilitation in the responses of cells to contextual stimuli, when measured at
the same eccentricity, follow the same dependency on
the separation between the lines along the collinear
axis, on the lateral offset between the lines, and on the
relative orientation of the two lines. The spatial scale of
C. Gilbert et al. / Vision Research 40 (2000) 1217–1226
these effects also mirror the scale of the long range
horizontal connections. The fact that a line outside the
receptive field by itself does not activate the cell, yet
when presented in conjunction with a line within the
receptive field can triple the response of the cell, indicates a substantial degree of non-linearity in cell responses. The facilitation can be blocked by placing a
perpendicular line between the two collinear lines, suggesting a role of the facilitation in linkage of line
segments that compose a contour.
The contextual modulation of cell responses demonstrates that one cannot predict the response of a cell to
a complex visual scene from its responses to the individual elements of the scene presented in isolation. This
non-linearity can be accounted for by the nature of the
e.p.s.p. generated by the horizontal connections, which
is much larger when the cell is depolarized than when it
Fig. 1. Facilitation in detection and responses to a target line by a
collinear flanking line. (A) Subjects were asked to report the presence
of a line presented in the near periphery (at 4° eccentricity) at a range
of luminances, when presented in isolation and in conjunction with a
flanking line. As measured by the psychometric curves, when the
target was flanked by a colinear line, it could be detected at much
lower contrast than when it was presented alone. This effect was
diminished as the flanking line was separated from the target, shifted
along an axis orthogonal to its orientation, or changed in relative
orientation. (B) Cells were recorded in the superficial layers of area
V1. One line was placed within the receptive field and a second line
was placed outside the receptive field. The cell did not respond to the
second line presented by itself, but its response was increased 2- or
3-fold when the second line was presented in conjunction with the line
within the receptive field. The facilitation dropped off with increasing
separation along the collinear axis, by a lateral offset between the two
lines, and by changing the orientation of the flanking line. At
matched eccentricities the contrast reduction by a flanking line measured psychophysically and the facilitation measured physiologically
showed dependency on line separation and relative orientation over
similar spatial scales (Kapadia et al., 1995).
1219
Fig. 2. Averaged response histograms among cells showing contextual
facilitation The response to contextual stimuli at the relative angle
and attentional state showing the highest facilitation was selected for
each cell. The response of each cell was normalized to the same unit
area. The facilitation extended throughout the entire time course of
the cells’ responses. The black underlining indicates stimulus presentation time. The gray bar indicates the time period of significant
facilitation (P B0.05). From Ito and Gilbert (1999).
is at rest (Hirsch & Gilbert, 1991). As a result, the
horizontal input, activated by a contextual stimulus, is
stronger when the cell is simultaneously activated by
interlaminar connections, which would be activated by
stimuli lying within the receptive field.
In the presence of more complex visual environments, and under distributed attention (see below) the
facilitation is seen not just with stimuli presented at
threshold contrast but with suprathreshold stimuli as
well (Ito & Gilbert, 1999). The time course of the
facilitation follows that of the response itself, and the
effect of attention, which we will outline below, is
exerted on the earliest components of the response.
Over a recorded population of cells in the superficial
layers of primary visual cortex, the presence of a contextual line produces an average 2-fold facilitation, and
this extends throughout the full time-course of the
response (Fig. 2). In this series of experiments, the data
were selected from the animal’s attentional state that
showed the maximal amount of facilitation for each
cell.
The idea that the facilitation relates to the integration
and saliency of contours is supported by experiments in
which a contour is embedded in a complex background
(Fig. 3). Here, a line was placed within the receptive
field at the optimum orientation for the cell, and this
line was surrounded by an array of randomly positioned and oriented lines. Individual line segments in
the surrounding array were then moved into alignment
with the line within the receptive field, forming a long
straight line extending well beyond the receptive field.
When there is a single line within the receptive field
surrounded by the complex background, the response
of the cell is greatly inhibited by the surround. As lines
1220
C. Gilbert et al. / Vision Research 40 (2000) 1217–1226
are shifted into alignment with the line within the
receptive field, the cell is lifted from the inhibition. As
the line is further extended by adding line segments, the
response can be further facilitated to levels much
greater than those seen when the line within the receptive field is presented in isolation. This shows that the
responses of neurons within V1 are as dependent on the
global characteristics of contours extending well beyond the boundary of the classical receptive field as
they are upon the attributes of features lying within the
receptive field. It also indicates that the process of
intermediate level vision may find its substrate as early
in the visual cortical pathway as the primary visual
cortex (Kapadia et al., 1995; Polat et al., 1998). The
evidence for V1 participation in contour integration, in
the sense of contour saliency, extends the evidence for
responses in V2 to illusory contours (von der Heydt
and Peterhans, 1989).
With this as background, one can then pursue the
question of the role of attention in modulating the
responses of cells in V1. Since the responses of cells in
V1 depend on the precise geometric juxtaposition of
lines inside and outside the receptive field, it is important to take the specificity of their responses to stimuli
Fig. 3. Facilitation in responses by collinear stimuli in a complex background. Top, the range of stimuli used, from a single bar within the
receptive field (1), one bar inside and one or two flanking bars (2, 3), a single flanking bar alone (4), the inside bar embedded in a complex
background of randomly positioned and oriented bars (5), within that background adding one, two and 4 collinear flanking bars (6, 7, 8). Bottom,
the cells show varying amounts of facilitation to the flanking bars, but this facilitation is greater in the presence of the complex background. The
background inhibits the response of the cell, but the presence of a global contour extending beyond the receptive field removes the inhibition and
produces responses greater than that obtained by the isolated bar within the receptive field. From Kapadia et al. (1995).
C. Gilbert et al. / Vision Research 40 (2000) 1217–1226
Fig. 4. Stimulus array used to explore the role of visuospatial
attention on the contextual facilitation in perceived brightness and on
the responses of cells in primary visual cortex. Four sets of target and
flanking lines were presented symmetrically around the fixation point
(the target stimuli, to which the subject had to respond, were the
inner ones of the pairs). A reference line was presented near the
fixation spot, and the subject had to report whether one of the target
lines was brighter or dimmer than the reference. On some trials the
subject was cued to the position of the odd target prior to the
presentation of the stimulus array (focal attention) and on others was
cued to all four positions (distributed attention). For the physiological experiments, the focal attention trials were further divided into
trials in which attention was focussed on the receptive field position
and trials in which attention was focussed onto a position away from
the receptive field. The orientation and positions of the bars were
adjusted such that the target bar was centered within the receptive
field and oriented at the optimum orientation of the cell, with the
remaining bars distributing symmetrically around the fixation point.
From Ito, Westheimer and Gilbert (1998) and Ito and Gilbert (1999).
Fig. 5. Effect of attention on contrast discrimination threshold and
contextual facilitation. Top, the stimulus array was either a set of
four target lines and a reference line near the fixation spot, or four
pairs of target and flanking lines. Bottom, there were marked differences in both the threshold and facilitation between distributed
attention and focal attention trials. The most marked difference was
observed in the facilitation in perceived brightness by the flanking
line. From Ito et al. (1998).
of higher order complexity when exploring attentional
effects. We therefore designed a stimulus array and a
behavioral paradigm that would probe the effect of
visuospatial attention on contextual influences as well
as upon single line stimuli within the receptive field.
The experiments were designed to provide a behavioral
measure of these effects as well as to explore the effect
1221
of attention on cell responses. Here we used
suprathreshold stimuli and a discrimination task rather
than the detection task used initially. The subjects, both
human and non-human primate, were asked to report
whether a target line, presented at 3.5° eccentricity, was
brighter or dimmer than a reference line presented
adjacent to the fixation spot. In addition to the target
line, on some trials there was a flanking collinear line to
which the subjects did not attend. Four sets of target
and flanking lines were presented, placed symmetrically
around the fixation point at the 45/135/225/315° meridia (Fig. 4). On some trials, termed the distributed
attention trials, the subject was cued to all four target
positions. In these the subject had to determine whether
any of the four target lines were brighter or dimmer
than the reference line. On other trials, termed the focal
attention trials, the subject was cued to the single
position in which the odd target would be presented. In
this task, even though the target stimuli were presented
at suprathreshold stimuli, there was still a large facilitatory effect of the flanking line. In the presence of the
flanking line, the psychometric curve on the perceived
brightness of the target line relative to the reference line
was shifted to the left. The shift of the mean in the
curve provided a measure of the change in perceived
brightness, and the slope provided a measure of the
discrimination threshold. One can then determine the
relative effects of distributed versus focal attention on
the threshold and on the facilitation.
For naı̈ve subjects, there was a substantial difference
in the perceptual facilitation of a flanking line between
distributed and focal attention (Fig. 5). The effect was
much stronger under distributed attention. In effect, if
one is already attending to a stimulus, there is little
additional saliency derived from the stimulus characteristics themselves. There was also an influence of attention on discrimination threshold, consistent with earlier
studies (Beck & Ambler, 1973; Cohn & Lasley, 1974;
LaBerge, 1983; Eriksen & St. James, 1986; Krose &
Julesz, 1989; Lindblom & Westheimer, 1992a,b; Balz &
Hock, 1997). In examining the effect of attention on the
responses of cells, one can further subdivide the category of focal attention into attention to the receptive
field position and attention away from the receptive
field position. An example of a cell recorded in one
monkey, SA, is shown in Fig. 6. Under distributed
attention the cell showed the nearly 3-fold facilitation
by contextual stimuli that we had seen previously.
Under focal attention, the facilitation disappeared.
Because the animals receive different cues under focal
and distributed attention, and because the sizes of
receptive fields in V1 are relatively small, it is natural to
ask whether predictive eye movements might influence
the results. The animals were highly trained on the task
of fixation, and any movement outside of the fixation
window (0.5° radius) aborted the trial. The fixation had
1222
C. Gilbert et al. / Vision Research 40 (2000) 1217–1226
to be more precise than that allowed by the window,
since the animals had to detect a slight dimming of the
fixation spot. Measurements of eye position with a
scleral search coil bore out this argument. There was
less than 0.1° difference in eye position between the
different attentional states, which was not significant
and with no systematic relationship between mean eye
position and the cue position (Fig. 7). There was also
no difference in the variance of eye position between
the different attentional states. Shifting eye position on
the order of 0.1° produced no change in the amount of
facilitation.
An important factor in the attentional influences is
effect of learning. One can train subjects not just on the
attributes of a visual stimulus, but even an attentional
task is itself subject to learning. This was seen in the
focal vs. distributed attention task, where the performance observed under distributed attention converged
to that seen under focal attention (Fig. 8). That is, the
performance under distributed attention after training
improved to the level of focal attention before training.
The improvement effectively represents an ability to
perform ‘multi-focal’ attention, by increasing the number of attentional foci maintained in parallel. This
learning might be equivalent to that described for the
transformation from serial to parallel search tasks by
learning (Sireteanu & Rettenbach, 1995). The improvement shows an interesting specificity for the stimulus
configuration used in training. If the number of targets
and flanks are increased from four to eight (e.g. eight
targets presented at the 0/45/90/135/180/225/270/315°
meridia, equidistant from the fixation point), there is a
return to the original level of performance on the
distributed attention task, but only for the new stimulus
positions — the original positions retain the improved
performance. The improvement is not, however, visuotopically specific (it transfers to visual field locations
Fig. 6. Influence of attention on responses in primary visual cortex.
For this cell, recorded in the superficial layers of primary visual
cortex, when the animal distributed attention to all four target
locations there was a severalfold increase in the response to the target
plus flanking line compared to the target alone response. This facilitation disappeared when the animal was cued to focus attention to one
of the target lines. From Ito and Gilbert (1999).
Fig. 7. Measurement of eye position, receptive field size and contextual facilitation. (A) Minimum response field profile and stimulus
placement. In this example, a single bar was placed at half bar length
intervals, providing a measure of receptive field boundary. The target
stimulus was centered over the position of highest response, and a
second stimulus of equal size was placed outside the receptive field to
ensure that it would not activate the cell by itself. (B) Attentional
modulation of responses of cell shown in A. Contextual facilitation
was maximal when the monkey attended to the receptive field position (focal on), and disappeared when the animal distributed his
attention to all four sites or attended to positions away from the
receptive field. Note that the target alone response was the same for
focal attention on and focal attention away, even though the contextual modulation was very different. (C) There was no predictive eye
drift towards the cued location while we collected the responses
shown in B. The mean eye position under distributed attention is
represented as the center of the graph, and the eye positions measured
under the other conditions are shown relative to this point. For each
attentional state, the averaged position and standard deviation are
indicated. The outer circle represents a distance 0.5° from the center.
The arrow indicates the direction toward the receptive field (RF) of
this cell. (D) Eye positions measured in the entire day’s session,
during which we recorded from the cell shown in this Figure. We
divided data into six groups for each cued location. Description was
same as in C. Arrow and mark indicate corresponding direction
toward cued-location. Since the receptive field position was different
for each cell, directions towards the receptive field are rotated to the
same direction (arrow RF) and the other cued locations are shown
relative to the RF direction. (E) The data shown in B (focal on) was
divided into two groups, one half taken from trials where the eye
fixations were closer to the receptive field of the cell (near) and the
other half from trials was the eye was farther from the receptive field
(far). Though the eye position in these two groups differed by 0.11°,
there was no difference in the degree of facilitation. Taken from Ito
and Gilbert (1999), Fig. 3.
other than those in which the training stimuli were
located). If the original four stimulus array is rotated to
the new positions (e.g. from the 45/135/225/315° meridia to the 0/90/180/270° meridia), one sees the improved level of performance in the new positions.
Effectively the subjects are capable of performing a
mental rotation of the stimulus configuration. The specificity for stimulus configuration, rather than for
C. Gilbert et al. / Vision Research 40 (2000) 1217–1226
Fig. 8. Effect of training on attentional task. Top, results for seven
subjects, including to Macaque monkeys. The facilitation under distributed attention converged to that measured under focal attention
after several weeks of training. Bottom, in the Macaques, as a result
of overtraining, the endpoint of training showed opposite effects in
the two monkeys. Monkey SA showed greater facilitation under
distributed attention, and monkey UM showed greater facilitation
under focal attention. From Ito et al. (1998) and Ito and Gilbert
(1999).
visuotopic locus, speaks to an interaction between areas
which represent a global representation of the complex
array and V1, where precise location is represented.
1223
Perceptual learning can originate from a combination
of top-down influences, as suggested from its configuration-dependency, and from local, early cortical processing, as as suggested from its visuotopic specificity (Shiu
& Pashler, 1992; Treisman, Veira & Hayes, 1992;
Ahissar & Hochstein, 1993; Sagi & Tanne, 1994;
Gilbert, 1994; Fahle & Morgan, 1996; Crist, Kapadia,
Westheimer & Gilbert, 1997). In addition to the role of
attention in enabling perceptual learning of stimulus
attributes, it is important to emphasize the role of
perceptual learning of attentional tasks. One cannot,
therefore, expect an equivalency between subjects in
correlating attention with physiology — it is important
to measure performance on the same individuals from
which physiological data are obtained (e.g. one must do
psychophysics with the monkeys used for the microelectrode recording studies).
In the non-human primates we studied, there were
further learning dependent changes as a result of overtraining. As a result, one animal, SA, showed more
contextual facilitation under distributed attention and
the other animal, UM, showed greater contextual facilitation under focal attention (Fig. 8). The fact that the
animals were overtrained on the task leads to the
different effects of attention on facilitation than that
observed before recording and to differences relative to
the results obtained from human subjects. This led to
differences in the attentional effects measured in area
V1, as seen over the population of recorded cells (Fig.
9). For subject SA, cells showed more contextual facilitation under distributed attention than under focal
attention. In contrast, there was little difference in the
Fig. 9. Effect of attention over the population of recorded V1 neurons. The data were collected for the two monkeys in which the psychophysical
measurements were made, SA (A,C) and UM (B,D). Two properties were measured: the mean response to the test line alone under the three
attentional states (A,B) and the mean percentage facilitation by a flanking bar (0% indicates no facilitation, 100% indicates a doubling of the
response). For SA, the facilitation was larger under distributed attention than under focal attention, and for UM the relationship was reversed,
with greater facilitation under focal attention. Note that for both animals the facilitation with focal attention away from the receptive field
position was equivalent to that seen under distributed attention, indicating that the modulation by focal attention was restricted to sites near the
attentional focus. The behavioral conditions in the upper graphs follow the same order as those in the lower graphs. From Ito and Gilbert (1999).
1224
C. Gilbert et al. / Vision Research 40 (2000) 1217–1226
target-alone response over the three attentional states.
For subject UM, on the other hand, cells showed more
contextual facilitation under focal than under distributed attention. Again, the target alone responses
were unchanged by attention. Another point worth
emphasizing is that for both animals the facilitation
seen under focal attention away from the receptive field
was similar to that seen under distributed attention, so
that the effect of focal attention was restricted to
positions near the receptive field. The attentional effects
on contextual facilitation within primary visual cortex
are therefore pronounced, changing by a factor of two
or more according to attentional state. The effects on
simple visual stimuli, however, are negligible.
The small effect of attention on the target alone
response, compared with the powerful effect of attention on contextual influences, might account for the
discrepancy with respect to earlier reports showing little
effects of attention in V1 (Wurtz & Mohler, 1976;
Haenny & Schiller, 1988; Motter, 1993; Luck, Chelazzi,
Hillyard & Desimone, 1997). However, the size of
attentional effects has been shown to increase with the
number of distractors (Motter, 1993; Vidyasagar, 1998).
This is consistent with ideas that attention is important
when a target must be found in a cluttered field, that
attention represents a competition between stimuli, and
that attending to one location reduces processing of the
local surround (Engel, 1971; Desimone & Duncan,
1995; Bahcall and Kowler, 1999). The earlier studies
failing to activate V1 generally used quite sparse visual
stimulation and simple tasks. Physiological studies involving a curve tracing task (Roelfesma, Lamme &
Spekreijse, 1998) and several fMRI and EEG studies
(Brefczynski & DeYoe, 1999; Gandhi, Heeger & Boynton, 1999; Martinez, Anllo-Vento, Sereno, Frank, Buxton, Dubowitz, et al., 1999; Somers, Dale, Seiffert &
Tootell, 1999) now show substantial attention effects in
V1. Subjects asked to imagine the appearance of an
object in the absence of visual input (visual imagery)
show activation of V1 (Kosslyn, Thompson, Kim,
Rauch & Alpert, 1995). Taken together, these findings
and those in the current work demonstrate the importance of top-down influences in the activation of V1
when subjects are actively engaged in interpreting the
visual environment. The degree of attentional modulation of responses in V1 is also dependent on the behavioral context. The amount of attentional resources
demanded by the task, for example, plays an important
role (Nakayama & Joseph, 1997). Attention is therefore
not an all or none phenomenon, rather it is dependent
upon the requirements of the task and the nature of the
attended stimuli.
Our results show that attentional effects can be quite
large, by a factor of two or more, when one looks at the
proper psychophysical measurement — contextual facilitation. It is clear, however, that the attentional ef-
fects show several dependencies that must be taken into
account when studying the attention effects in any
cortical area. First, it is important to select the correct
visual stimulus, taking into account the higher order
functional properties of cells and the manner in which
they are analyzing the visual scene. Second, it is also
necessary to take into account the appropriate behavioral context, where the attentional task, both in terms
of kind and difficulty, will determine the nature and
extent of modulation exerted in a given cortical area.
Third, the fact that these effects are subject to learning
emphasizes the importance of taking into account, and
measuring, the performance of each individual subject
in the discrimination task that is under attentional
control. Since these findings, as well as others, emphasize the emergence of attentional influences by an interaction between connections intrinsic to V1 and
interactions between multiple cortical areas, it will be
difficult to isolate the origin of these effects. To the
extent that contextual effects, originating in V1, are
gated by feedback connections, as suggested by the
attention effects, one could imagine that inactivating
the source of feedback would also influence the lateral
interactions within V1. It is important to emphasize,
however, that the time course of the contextual facilitation, and that of the attentional modulation of contextual facilitation (when attention is cued before stimulus
presentation), starts from the beginning of the sensory
response. Therefore the output from V1, possibly from
the very first spike, reflects both the contextual and
attentional influences, decreasing the likelihood that
they originate elsewhere.
The contextual influences on cells in primary visual
cortex indicate that they are selective for much more
complex features in the visual environment than previously thought, and that they play a central role in
contour integration and surface segmentation. These
influences depend on the precise geometric relationships
between stimuli lying inside and outside the receptive
field core, and the interactions between stimuli placed
in the extended receptive field and those lying within
the classical receptive field are most subject to attentional modulation. The response to a simple stimulus,
such as a single oriented line segment, is mediated by
feedforward connections such as thalamocortical and
vertical interlaminar connections within V1. The modulation of these responses by the global characteristics of
contours and surface boundaries extending beyond the
receptive field is likely to be mediated at least in part by
the plexus of long range horizontal connections. One
also must consider feedback connections from higher
cortical areas as potentially playing a part in certain
contextual influences. One possible mechanism underlying the attention effects is a gating or modulation of the
synaptic effects of long range horizontal connections by
feedback connection from higher cortical areas. If so,
C. Gilbert et al. / Vision Research 40 (2000) 1217–1226
then one must view the contextual interactions as
emerging from an interaction between the various inputs converging onto neurons in the primary visual
cortex.
The fundamental conclusion from these studies is
that V1 is not simply a passive filter subject to feedforward interactions, but is an active processor, dynamically changing its processes according to top down
influences. To see these influences operate, however,
one must take into account the higher order, integrative
properties of cells in V1, the task dependent nature of
these properties, and the role of learning in modulating
these influences.
Acknowledgements
This work was supported by NIH grant EY07968 to
C.D.G.
References
Ahissar, M., & Hochstein, S. (1993). Attentional control of early
perceptual learning. Proceedings of the National Academy of Sciences USA, 90, 5718 –5722.
Allman, J., Miezin, F., & McGuinness, E. (1985). Stimulus specific
responses from beyond the classical receptive field–neurophysiological mechanisms for local global comparisons in visual neurons. Annual Re6iew of Neuroscience, 8, 407–430.
Bahcall, D. O., & Kowler, E. (1999). Attentional interference at small
spatial separations. Vision Research, 39, 71–86.
Balz, G. Y., & Hock, H. S. (1997). The effect of attentional spread on
spatial resolution. Vision Research, 37, 1499–1510.
Beck, J., & Ambler, B. (1973). The effects of concentrated and
distributed attention on peripheral acuity. Perception and Psychophysics, 14, 225 – 230.
Blakemore, C., & Tobin, E. A. (1972). Lateral inhibition between
orientation detectors in the cat’s visual cortex. Experimental Brain
Research, 15, 439 – 440.
Bosking, W. H., Zhang, Y., Schofield, B., & Fitzpatrick, D. (1997).
Orientation selectivity and arrangement of horizontal connections
in tree shrew striate cortex. Journal of Neuroscience, 17, 2112 –
2127.
Brefczynski, J. A., & DeYoe, E. A. (1999). A physiological correlate
of the ‘spotlight’ of visual attention. Nature Neuroscience, 2,
370 – 374.
Cohn, T. E., & Lasley, D. J. (1974). Detectability of a luminance
increment: effect of spatial uncertainty. Journal of the Optical
Society of America, 64, 1715–1718.
Crist, R. E., Kapadia, M. K., Westheimer, G., & Gilbert, C. D.
(1997). Perceptual learning of spatial localization: specificity for
orientation, position and context. Journal of Neurophysiology, 78,
2889 – 2894.
Desimone, R., & Duncan, J. (1995). Neural mechanisms of selective
attention. Annual Re6iew of Neuroscience, 18, 193–222.
Dresp, B. (1993). Bright lines and edges facilitate the detection of
small line targets. Spatial Vision, 7, 213–225.
Engel, F. L. (1971). Visual conspicuity, directed attention and retinal
locus. Vision Research, 11, 563–576.
Eriksen, C. W., & St. James, J. D. (1986). Visual attention within and
around the field of focal attention: a zoom lens model. Perception
and Psychophysics, 40, 225–240.
1225
Fahle, M., & Morgan, M. (1996). No transfer of perceptual learning
between similar stimuli in the same retinal position. Current
Biology, 6, 292 – 297.
Field, D. J., Hayes, A., & Hess, R. F. (1993). Contour integration by
the human visual system: evidence for a local ‘association field’.
Vision Research, 33, 173 – 193.
Gandhi, S. P., Heeger, D. J., & Boynton, G. M. (1999). Spatial
attention affects brain activity in human primary visual cortex.
Proceedings of the National Academy of Sciences, 96, 3314–3319.
Gilbert, C. D. (1992). Horizontal integration and cortical dynamics.
Neuron, 9, 1 – 13.
Gilbert, C. D., & Wiesel, T. N. (1979). Morphological and intracortical projections of functionally characterized neurons in the cat
visual cortex. Nature, 280, 120 – 125.
Gilbert, C. D., & Wiesel, T. N. (1983). Clustered intrinsic connections
in cat visual cortex. Journal of Neuroscience, 3, 1116 – 1133.
Gilbert, C. D., & Wiesel, T. N. (1989). Columnar specificity of
intrinsic horizontal and corticocortical connections in cat visual
cortex. Journal of Neuroscience, 9, 2432 – 2442.
Gilbert, C. D., & Wiesel, T. N. (1990). The influence of contextual
stimuli on the orientation selectivity of cells in primary visual
cortex. Vision Research, 30, 1689 – 1701.
Gilbert, C. D. (1994). Early perceptual learning. Proceedings of the
National Academy of Sciences, 91, 1195 – 1197.
Grossberg, S, & Mingolla, E. (1985). Neural dynamics of perceptual
grouping: textures, boundaries, and emergent segmentations. Perception and Psychophysics, 38, 141 – 171.
Haenny, P. E., & Schiller, P. H. (1988). State dependent activity in
monkey visual cortex. 1. Single cell activity in V1 and V4 on
visual tasks. Experimental Brain Research, 69, 225 – 244.
von der Heydt, R., & Peterhans, E. (1989). Mechanisms of contour
perception in monkey visual cortex. I. Lines of pattern discontinuity. Journal of Neuroscience, 9, 1731 – 1748.
Hirsch, J. A., & Gilbert, C. D. (1991). Synaptic physiology of
horizontal connections in the cat’s visual cortex. Journal of Neuroscience, 11, 1800 – 1809.
Hupe, J. M., James, A. C., Payne, B. R., Lomber, S. G., Girard, P.,
& Bullier, J. (1998). Cortical feedback improves discrimination
between figure and background by V1, V2 and V3 neurons.
Nature, 394, 784 – 787.
Ito, M., Westheimer, G., & Gilbert, C. D. (1998). Attention and
perceptual learning modulate contextual influences on visual perception. Neuron, 20, 1191 – 1197.
Ito, M., & Gilbert, C. D. (1999). Attention modulates contextual
influences in the primary visual cortex of alert monkeys. Neuron,
22, 593 – 604.
Kapadia, M. K., Ito, M., Gilbert, C. D., & Westheimer, G. (1995).
Improvement in visual sensitivity by changes in local context:
parallel studies in human observers and in V1 of alert monkeys.
Neuron, 15, 843 – 856.
Kapadia, M. K., Westheimer, G., & Gilbert, C. D. (1998). Spatial
distribution and dynamics of contextual interactions in cortical
area V1. Society for Neuroscience Abstracts, 24, 1980.
Kapadia, M. K., Westheimer, G., & Gilbert, C. D. (1999). Dynamics
of spatial summation in primary visual cortex of alert monkeys.
Proceedings of the National Academy of Sciences USA, 96, 12073–
12078.
Kastner, S., Nothdurft, H.-C., & Pigarev, I. N. (1997). Neural
correlates of pop-out in cat striate cortex. Vision Research, 37,
371 – 376.
Kisvarday, Z. F., Toth, E., Rausch, M., & Eysel, U. T. (1997).
Orientation specific relationship between populations of excitatory and inhibitory lateral connections in the visual cortex of the
cat. Cerebral Cortex, 7, 605 – 618.
Knierim, J. J., & Van Essen, D. C. (1992). Neuronal responses to
static texture patterns in are V1 of the alert macaque monkey.
Journal of Neuroscience, 67, 961 – 980.
1226
C. Gilbert et al. / Vision Research 40 (2000) 1217–1226
Kosslyn, S. M., Thompson, W. L, Kim, I. J., Rauch, S. L., & Alpert,
N. M. (1995). Topographical representations of mental images in
primary visual cortex. Nature, 378, 496–498.
Krose, B. J., & Julesz, B. (1989). The control and speed of shifts of
attention. Vision Research, 29, 1607–1619.
Kuffler, S. W. (1953). Discharge patterns and functional organization
of mammalian retina. Journal of Neurophysiology, 16, 37– 68.
LaBerge, D. (1983). Spatial extent of attention to letters and words.
Journal of Experimental Psychology: Human Perception & Performance, 9, 371 – 379.
Lamme, V. A. F. (1995). The neurophysiology of figure-ground
segregation in primary visual cortex. Journal of Neuroscience, 15,
1605 – 1615.
Levitt, J. B., & Lund, J. S. (1997). Contrast dependence of contextual
effects in primate visual cortex. Nature, 387, 73–76.
Li, C.-Y., & Li, W. (1994). Extensive integration field beyond the
classical receptive field of cat’s striate cortical neurons – Classification and tuning properties. Vision Research, 34, 2337–2355.
Lindblom, B., & Westheimer, G. (1992a). Uncertainty effects in
orientation discrimination of foveally seen lines in human observers. Journal of Physiology (London), 454, 1–8.
Lindblom, B., & Westheimer, G. (1992b). Spatial uncertainty in
stereoacuity tests: implications for clinical vision test design. Acta
Opthalmology, 70, 60–65.
Luck, S. J., Chelazzi, L., Hillyard, S. A., & Desimone, R. (1997).
Neural mechanisms of spatial attention in areas V1, V2 and V4 of
macaque visual cortex. Journal of Neurophysiology, 77, 24 – 42.
Maffei, L., & Fiorentini, A. (1976). The unresponsive regions of
visual cortical receptive fields. Vision Research, 16, 1131– 1139.
Malach, R., Amir, Y., Harel, M., & Grinvald, A. (1993). Relationship between intrinsic connections and functional architecture
revealed by optical imaging and in vivo targeted biocytin injections in primate striate cortex. Proceedings of the National
Academy of Sciences, 90, 10469–10473.
Martinez, A., Anllo-Vento, L, Sereno, M. I., Frank, L. R., Buxton,
R. B., Dubowitz, D. J., Wong, E. C., Hinrichs, H. J., & Hillyard,
S. A. (1999). Involvement of striate and extrastriate cortical areas
in spatial attention. Nature Neuroscience, 2, 364–369.
Motter, B. C. (1993). Focal attention produces spatially selective
processing in visual cortical areas V1, V2, and V4 in the presence
of competing stimuli. Journal of Neurophysiology, 70, 909 – 919.
Nakayama, K., & Joseph, J. S. (1997). Attention, pattern recognition,
and popout in visual search. In R. Parasuraman, The attenti6e
brain (pp. 279 – 298). Cambridge, MA: MIT Press.
Nelson, J. I., & Frost, B. J. (1978). Orientation-selective inhibition
from beyond the classic visual receptive field. Brain Research, 139,
359 – 365.
Nothdurft, H. C., & Li, C. Y. (1985). Texture discrimination: representation of orientation and luminance differences in cells of the
cat striate cortex. Vision Research, 25, 99–113.
Orban, G. A., Guylas, B., & Vogels, R. (1987). Influence of a moving
textural background on direction selectivity of cat striate neurons.
Journal of Neurophysiology, 57, 1767–1791.
Polat, U., & Sagi, D. (1993). Lateral interactions between spatial
channels: suppression and facilitation revealed by lateral masking
experiments. Vision Research, 33, 993–999.
.
Polat, U., & Sagi, D. (1994). The architecture of perceptual spatial
interactions. Vision Research, 28, 115 – 132.
Polat, U., Mizobi, K., Pettet, M. W., Kasamatsu, T., & Norcia, T.
(1998). Collinear stimuli regulate visual responses depending on
cell’s contrast threshold. Nature, 391, 580 – 584.
Rockland, K. S., & Lund, J. S. (1982). Widespread periodic intrinsic
connections in the tree shrew visual cortex. Science, 215, 1532–
1534.
Roelfesma, P. R., Lamme, V. A., & Spekreijse, H. (1998). Objectbased attention in the primary visual cortex of the macaque
monkey. Nature, 395, 376 – 381.
Rossi, A. F., Rittenhouse, C. D., & Paradiso, M. A. (1996). The
representation of brightness in primary visual cortex. Science, 273,
1104 – 1107.
Sagi, D., & Tanne, D. (1994). Perceptual learning: learning to see.
Current Opinion in Neurobiology, 4, 195 – 199.
Shiu, L.-P., & Pashler, H. (1992). Improvement in line orientation
discrimination is retinally local but dependent on cognitive set.
Perception & Psychophysics, 52, 582 – 588.
Sillito, A. M., Grieve, K. L., Jones, H. L., Cuderio, J., & Davis, J.
(1995). Visual cortical mechanisms detecting focal orientation
discontinuities. Nature, 378, 492 – 496.
Sireteanu, R., & Rettenbach, R. (1995). Perceptual learning in visual
search: fast, enduring, but non-specific. Vision Research, 35,
2037 – 2043.
Somers, D. C., Dale, A. M., Seiffert, A. E., & Tootell, R. B. H.
(1999). Functional MRI reveals spatially specific attentional modulation in human primary visual cortex. Proceedings of the National Academy of Sciences USA, 96, 1663 – 1668.
Tanaka, K., Hikosaka, K., Saito, H., Yukie, M., Fukada, Y., & Iwai,
E. (1986). Analysis of local and wide-field movements in the
superior temporal areas of the macaque monkey. Journal of
Neuroscience, 6, 134 – 144.
Treisman, A., Veira, A., & Hayes, A. (1992). Automaticity and
preattentive processing. American Journal of Psychology, 105,
341 – 362.
Ullman, S. (1990). Three-dimensional object recognition. Cold Spring
Harbor Symposium Quantitati6e Biology, 55, 889 – 898.
Vidyasagar, T. R. (1998). Gating of neuronal responses in macaque
primary visual cortex by an attentional spotlight. NeuroReport, 9,
1947 – 1952.
Weliky, M., Kandler, K., Fitzpatrick, D., & Katz, L. C. (1995).
Patterns of excitation and inhibition evoked by horizontal connections in visual cortex share a common relationship to orientation
columns. Neuron, 15, 541 – 552.
Wertheimer, M. (1923). Untersuchungen zur Lehre von der Gestalt
II, Psychologische Forshung, 4, 301 – 350 [translated as Laws of
organization in perceptual forms. In W. D. Ellis, A Source Book
of Gestalt Psychology (pp. 71 – 88). London, Harcourt, Brace and
Javanovich.
Wolfson, S. S., & Landy, M. S. (1999). Long range interactions
between oriented texture elements. Vision Research, 39, 933–945.
Wurtz, R. H., & Mohler, C. W. (1976). Enhancement of visual
responses in monkey striate cortex and frontal eye fields. Journal
of Neurophysiology, 39, 766 – 772.
Zipser, K., Lamme, V. A. F., & Schiller, P. H. (1996). Contextual
modulation in primary visual cortex. Journal of Neuroscience, 15,
7376 – 7389.