Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Journal of Physiology - Paris xxx (2012) xxx–xxx Contents lists available at SciVerse ScienceDirect Journal of Physiology - Paris journal homepage: www.elsevier.com/locate/jphysparis Building a mechanistic model of the development and function of the primary visual cortex James A. Bednar Institute for Adaptive and Neural Computation, The University of Edinburgh, 10 Crichton St., EH8 9AB Edinburgh, UK a r t i c l e i n f o Article history: Available online xxxx Keywords: Topographic map Development Computational model Orientation map Visual system Cortical microcircuit Cortical column a b s t r a c t Researchers have used a very wide range of different experimental and theoretical approaches to help understand mammalian visual systems. These approaches tend to have quite different assumptions, strengths, and weaknesses. Computational models of the visual cortex, in particular, have typically implemented either a proposed circuit for part of the visual cortex of the adult, assuming a very specific wiring pattern based on findings from adults, or else attempted to explain the long-term development of a visual cortex region from an initially undifferentiated starting point. Previous models of adult V1 have been able to account for many of the measured properties of V1 neurons, while not explaining how these properties arise or why neurons have those properties in particular. Previous developmental models have been able to reproduce the overall organization of specific feature maps in V1, such as orientation maps, but are generally formulated at an abstract level that does not allow testing with real images or analysis of detailed neural properties relevant for visual function. In this review of results from a large set of new, integrative models developed from shared principles and a set of shared software components, I show how these models now represent a single, consistent explanation for a wide body of experimental evidence, and form a compact hypothesis for much of the development and behavior of neurons in the visual cortex. The models are the first developmental models with wiring consistent with V1, the first to have realistic behavior with respect to visual contrast, and the first to include all of the demonstrated visual feature dimensions. The goal is to have a comprehensive explanation for why V1 is wired as it is in the adult, and how that circuitry leads to the observed behavior of the neurons during visual tasks. Ó 2012 Published by Elsevier Ltd. 1. Introduction Understanding how we see remains an elusive goal, despite more than half a century of intensive work using a wide array of experimental and theoretical techniques. Because each of these techniques has different assumptions, strengths, and weaknesses, it can be difficult to establish clear principles and conclusive evidence. To make significant progress in this area, it is important to consider how the existing data can be synthesized into a coherent explanation for a wide variety of phenomena. Computational models of the primary visual cortex (V1) could provide a platform for achieving such a synthesis, integrating results across levels to provide an overall explanation for the main body of results. However, existing models typically fall into one of two categories with different aims, neither of which achieves this goal: (1) narrowly constrained models of specific aspects of adult cortical circuitry or function, or (2) abstract models of E-mail address: [email protected] URLs: http://homepages.inf.ed.ac.uk/jbednar large-scale visual area development, accounting for only a few of the response properties of individual neurons within these areas. Existing models of type (1) (e.g. Chance et al., 1999; Adelson and Bergen, 1985; Albrecht and Geisler, 1991) have been able to show how a variety of specialized circuits (often mutually incompatible) can account for most of the major observed functional properties of V1 neurons, but do not attempt to show how a single, general-purpose circuit could explain most of them at the same time. Because each specific phenomenon can often be explained by many different specialized models, it can be difficult or impossible to distinguish between different explanations. Moreover, just showing an example of how the property can be implemented does little to explain why neurons are arranged in this way, and what the circuit might contribute to the process of vision. Similarly, many existing models of type (2) have been able to account for the large-scale organization of V1, such as its arrangement into topographic orientation maps (reviewed in Swindale (1996), Goodhill (2007), and Erwin et al. (1995)). Yet because the developmental models are formulated at an abstract level, they address only a few of the observed properties of V1 neurons (such as their orientation or eye preference), and it is again difficult to decide between 0928-4257/$ - see front matter Ó 2012 Published by Elsevier Ltd. http://dx.doi.org/10.1016/j.jphysparis.2011.12.001 Please cite this article in press as: Bednar, J.A. Building a mechanistic model of the development and function of the primary visual cortex. J. Physiol. (2012), http://dx.doi.org/10.1016/j.jphysparis.2011.12.001 2 J.A. Bednar / Journal of Physiology - Paris xxx (2012) xxx–xxx the various explanations. Thus despite the many thousands of experimental and computational papers about the visual cortex, methods for integrating, interpreting, and evaluating the overall body of evidence to build a coherent explanation remain frustratingly scarce. This paper outlines and reviews a large set of closely interrelated computational models of the visual cortex that together are beginning to form a consistent, biologically grounded, computationally simple explanation of the bulk of V1 development and function. Specifically, the models develop: 1. Neurons with receptive fields (RFs) selective for retinotopy (X, Y), orientation (OR), ocular dominance (OD), motion direction (DR), spatial frequency (SF), temporal frequency (TF), disparity (DY), and color (CR).1 2. Preferences for each of these organized into realistic spatially topographic maps. 3. Lateral connections between these neurons that reflect the structure of the maps. 4. Realistic surround modulation effects, including their diversity, caused by interactions between these neurons. 5. Contrast-gain control and contrast-invariant tuning for the individual neurons, ensuring that they retain selectivity robustly. 6. Both simple and complex cells, to account for the major response types of V1 neurons. 7. Long-term and short-term plasticity (e.g. aftereffects), emerging from mechanisms originally implemented for development. Together, these phenomena arguably represent the bulk of the generally agreed stimulus-driven response properties of V1 neurons. Accounting for such a diverse set of phenomena could have required an extremely complex model, e.g. the union of the many previously proposed models for each of the individual phenomena. Yet our results show that it is possible to account for all of these using only a small set of plausible principles and mechanisms, within a consistent biologically grounded framework: 1. Single-compartment (point) firing-rate (non-spiking) RGC, LGN, and V1 neurons. 2. Hardwired subcortical pathways to V1 including the main LGN (lateral geniculate nucleus) or RGC (retinal ganglion cell) types that have been identified. 3. Initially isotropic, topographic connectivity within and between neurons in layers in V1. 4. Natural images and spontaneous activity patterns that lead to V1 responses. 5. Hebbian learning with normalization for V1 neurons. 6. A large number of parameters associated with each of these mechanisms. Properties not necessary to explain the phenomena above, such as spiking and detailed neuronal morphology, have been omitted, to clearly focus on the most relevant aspects of the system. The overall hypothesis is that much of the complex structure and properties observed in the visual cortex emerges from interactions between relatively simple but highly interconnected computing elements, with connection strengths and patterns self-organizing in response to visual input and other sources of neural activity. 1 Abbreviations: OR: orientation, OD: ocular dominance, DR: motion direction, SF: spatial frequency, TF: temporal frequency, DY: disparity, CR: color, RF: receptive field, CF: connection field, RGC: retinal ganglion cell, LGN: lateral geniculate nucleus, V1: primary visual cortex, GCAL: gain-control, adaptive laterally connected (model), LISSOM: Laterally interconnected synergetically self-organizing map, LMS: long, medium, short (wavelength cone photoreceptors), GR: medium-center, long-surround retinal ganglion cell, RG: long-center, medium-surround retinal ganglion cell, BY: short-center, long and medium surround retinal ganglion cell, OCTC: orientationcontrast tuning curve. Through visual experience, the geometry and statistical regularities of the visual world become encoded into the structure and connectivity of the visual cortex, leading to a complex functional cortical architecture that reflects the physical and statistical properties of the visual world. At present, many of the results have been obtained independently in a wide variety of separate projects performed by different collaborators at different times. However, all of the models share the same underlying principles outlined above, and all are implemented using the same simulator and a small number of underlying components. This review shows how each of these modeling studies, previously reported separately, fits into a consistent and compact framework for explaining a very wide range of data. The models are the first developmental models of V1 maps with wiring consistent with V1, and the first to have realistic behavior with respect to visual contrast, and together they account for all of the various spatial feature dimensions for which topographic maps have been reported using imaging in mammals. Preliminary work into developing an implementation combining each of the models into a single, working model visual system is also reported, although doing so is still a long-term work in progress. The unified model will include all of the visual feature dimensions, as well as all of the major sources of connectivity that affect V1 neuron responses. The goal is to have the first comprehensive, mechanistic explanation for why V1 becomes wired as it is in the adult, and how that circuitry leads to the observed behavior of the neurons during visual tasks. That is, the model will be the first that starts from an initially undifferentiated state, to wire itself into a collection of neurons that behave, at a first approximation, like those in V1. Because such a model starts with no specializations (at the cortical level) specific to vision and would organize very differently when given different inputs, it would also represent a general explanation for the development and function of sensory and motor areas throughout the cortex. 2. Material and methods All of the models whose results are presented here are implemented in the Topographica simulator, and are freely available along with the simulator from www.topographica.org. This section describes the complete, unified architecture under development, with each currently implemented model representing a subset or simplification of this architecture (as specified for each set of results below). The proposed model is a generalization of the GCAL (gain-controlled, adaptive, laterally connected) map model (Law et al., 2011), extended to cover results from a large family of related models. The unified GCAL model is intended to represent the visual system of the macaque monkey, but relies on data from studies of cats, ferrets, tree shrews, or other mammalian species where clear results are not yet available from monkeys. 2.1. Sheets and projections Each Topographica model consists of a set of sheets of neurons and projections (sets of topographically mapped connections) between them. For each model discussed in this paper, there are sheets representing the visual input (as a set of activations in photoreceptor cells), the transformation from the photoreceptors to inputs driving V1 (expressed as a set of LGN cell activations), and neurons in V1. Fig. 1 shows the sheets and connections in the unified GCAL model, which is described for the first time here as a single, combined model. Each sheet is implemented as a two-dimensional array of firingrate neurons. The Topographica simulator allows parameters for Please cite this article in press as: Bednar, J.A. Building a mechanistic model of the development and function of the primary visual cortex. J. Physiol. (2012), http://dx.doi.org/10.1016/j.jphysparis.2011.12.001 J.A. Bednar / Journal of Physiology - Paris xxx (2012) xxx–xxx 3 tially localized set of connections from neurons in an input sheet near the corresponding topographic location of the target neuron. Fig. 1 shows one sample connection field (CF) for each projection, visualized as an oval of the corresponding radius on the input sheet (drawn to scale), connected by a cone to the neuron on the target sheet. The connections and their weights determine the specific properties of each neuron in the network, by differentially weighting RGC/LGN inputs of different types and/or locations. Each of the specific types of sheets and projections is described in the following sections. 2.2. Images and photoreceptor sheets Fig. 1. Comprehensive GCAL model architecture. Unified GCAL model for simple and complex cells with surround modulation and (X, Y), OR, OD, DY, DR, TF, SF, and CR maps (see list of abbreviations on page 4). The model consists of 29 neural sheets and 123 separate projections between them. Each sheet is drawn to scale, with larger sheets subcortically to avoid edge effects, and an actual sample activity pattern on each subcortical sheet. Each projection is illustrated with an oval, also drawn to scale, showing the extent of the connection field in that projection, with lines converging on the target of the projection. Sheets below V1 are hardwired to cover the range of response types found in the retina and LGN. Connections to V1 neurons adapt via Hebbian learning, allowing initially unselective V1 neurons to exhibit the range of response types seen experimentally, by differentially weighting each of the subcortical and intracortical inputs. sheets and projections to be specified in measurement units that are independent of the specific grid sizes used in a particular run of the simulation. To achieve this, Topographica sheets provide multiple spatial coordinate systems, called sheet and matrix coordinates. Where possible, parameters (e.g. sheet dimensions or connection radii) are expressed in sheet coordinates, expressed as if the sheet were a continuous neural field rather than a finite grid. In practice, of course, sheets are always implemented using some finite matrix of units. Each sheet has a parameter called its density, which specifies how many units (matrix elements) in the matrix correspond to a length of 1.0 in continuous sheet coordinates, which allows conversion between sheet and matrix coordinates. In all simulations shown, sheets of V1 neurons have dimensions (in sheet coordinates) 1.0 1.0. If every other sheet in the model were to have a 1.0 1.0 area, units near the border of higher-level sheets like V1 would have afferent connections that extend past the border of lower-level sheets like the RGC/LGN cells. This cropping of connections will result in artifacts in the behavior of units near the border. To avoid such artifacts, lower-level sheets have areas larger than 1.0 1.0. (Alternatively, one could avoid cropping of connections by imposing periodic boundary conditions, but doing so would create further artifacts by combining unrelated portions of the visual field into the connection fields of V1 neurons.) In Fig. 1 each sheet is plotted at the same scale in terms of degrees of visual angle covered, and thus the photoreceptor and RGC/LGN sheets appear larger. Sheet dimensions were chosen to ensure that each unit in the receiving sheet has a complete set of connections, where possible, minimizing edge effects in the RGC/ LGN and V1 (Miikkulainen et al., 2005). When sizes are scaled appropriately (Bednar et al., 2004), results are independent of the density used, except at very low densities or for simulations with complex cells, where complexity increases with density (as described below). Larger areas can be simulated easily (Bednar et al., 2004), but require more memory and simulation time. A projection to an m m sheet of neurons consists of m2 separate connection fields, one per target neuron, each of which is a spa- The unified GCAL model contains six input sheets, representing the Long, Medium, and Short (LMS) wavelength cone photoreceptors in the retinas of the left and right eyes (illustrated in Fig. 1). The density of photoreceptors is uniform across the sheets, because only a relatively small portion of the visual field is being modeled, but the non-uniform spacing from fovea to periphery can be added for a model with a larger input sheet. Given a color image appearing in one eye, the estimated activations of the LMS sheets are calculated from the cone sensitivity functions for each photoreceptor type (Stockman et al., 1999; Stockman and Sharpe, 2000) using calibrated color images (Olmos and Kingdom, 2004), following the method described in De Paula (2007). Input image pairs (left, right) were generated by choosing one image randomly from a database of single calibrated images, selecting a random patch within the image, a random nearly horizontal offset between patterns in each eye (as described in Ramtohul (2006)), a random direction of motion translation with a fixed speed (described in Bednar and Miikkulainen (2003)), and a random brightness difference between the two eyes (described in Miikkulainen et al. (2005)). These modifications are intended as a simple model of motion and eye differences, to allow development of direction preference, ocular dominance, and disparity maps, until suitable full-motion stereo calibrated-color video datasets of natural scenes are available. Simulated retinal waves can also be used as inputs, to provide initial RF and map structure before eye opening, but are not required for RF or map development in the model (Bednar and Miikkulainen, 2004). 2.3. Subcortical sheets The subcortical pathway from the photoreceptors to the thalamorecipient cells in V1 is represented as a set of hardwired subcortical cells with fixed connection fields (CFs) that determine the response properties of each cell. These cells represent the complete processing pathway to V1, including circuitry in the retina (including the retinal ganglion cells), optic nerve, lateral geniculate nucleus, and optic radiations to V1. Because the focus of the model is to explain cortical development given its thalamic input, the properties of these RGC/LGN cells are kept fixed throughout development, for simplicity and clarity of analysis. Each distinct RGC/LGN cell type is grouped into a separate sheet, each of which contains a topographically organized set of cells with identical properties but responding to a different region of the retinal photoreceptor input sheet. Fig. 1 shows examples of each of the different response types suitable for the development of the full range of V1 RFs: SF1 (achromatic cells with large receptive fields, as in the magnocellular pathway), SF2 (achromatic cells with small receptive fields), BY cells (color opponent cells with blue (short) cone photoreceptor center, and red (long) and green (short) surround), and similarly for medium-center (GR) and long-center (RG) chromatic L/M RFs. Each such opponent cell comes in two types, On (with an excitatory center) and Off (with an excitatory surround). Please cite this article in press as: Bednar, J.A. Building a mechanistic model of the development and function of the primary visual cortex. J. Physiol. (2012), http://dx.doi.org/10.1016/j.jphysparis.2011.12.001 4 J.A. Bednar / Journal of Physiology - Paris xxx (2012) xxx–xxx All of these cells have Difference-of-Gaussian RFs, and thus perform edge enhancement at a particular size scale (SF1 and SF2) and/or color enhancement (e.g. RG and GR). Each of these RF types has been reported in the macaque retina or LGN (Vidyasagar et al., 2002; Dacey, 2000; Klug et al., 2003), and together they cover the range of typically reported spatial response types. Additional such cell classes can easily be added as needed, e.g. to provide more than two sizes of RGC/LGN cells with different spatial frequency preferences (Palmer, 2009), although doing so increases memory and computation requirements. Not all of the cell types currently included are necessarily required for these results, but so far they have been found to be sufficient. For each of the RGC/LGN sheets, multiple projections with different delays connect them to the V1 sheet. These delays represent the different latencies in the lagged vs. non-lagged cells found in cat LGN (Saul and Humphrey, 1992; Wolfe and Palmer, 1998), and allow V1 neurons to become selective for the direction of motion. Lagged cells could instead be implemented using separate LGN sheets, as in Bednar and Miikkulainen (2003), if differences from non-lagged cells other than temporal delay need to be taken into account. Many other sources of temporal delays would also lead to direction preferences, but have not been tested specifically. Apart from these delays, the detailed temporal properties of the subcortical neural responses and of signal propagation along the various types of connections elsewhere in the network have not been modeled. Instead, the model RGC/LGN neurons have a constant, sustained output, and all connections in each projection have a constant delay, independent of the physical length of that connection. Modeling the subcortical temporal response properties and simulating non-uniform delays would greatly increase the number of timesteps needed to simulate each input presentation, but should otherwise be feasible in future work. 2.4. Cortical sheets Many of the simulations use only a single V1 sheet for simplicity, but in the full unified model, V1 is represented by three cortical sheets. First, cells with direct thalamic input are labeled V1 L4 in Fig. 1, and nominally correspond to pyramidal simple cells in macaque V1 layer 4Cb. Second, pyramidal cells in layer 2/3 (labeled V1 L2/3E) receive input from a small topographically corresponding (columnar) region of V1 L4. These cells (as discussed below) become complex-cell–like, i.e. relatively insensitive to spatial phase, by pooling across nearby L4 simple cells of different preferred spatial phases (Antolik and Bednar, 2011). The third V1 sheet L2/3I models inhibitory interneurons in layer 2/3. Together, these sheets will be used to show how simple cells can develop in V1 L4, how complex cells can develop in L2/3, and how interactions between these cells can vary depending on contrast, which affects the balance between excitation and inhibition (Antolik, 2010). The behavior of the V1 sheets is primarily determined by the projections to, within, and between them. Fig. 2 illustrates and describes each of these projections. The overall pattern of projections was chosen based on anatomical tracing of the V1 microcircuit in cat (Binzegger et al., 2009), providing the first model of V1 selforganization where the long-range connections are excitatory (Law, 2009; Antolik, 2010), as found in animals (Gilbert and Wiesel, 1989). Each of these projections is initially non-specific, and becomes selective only through the process of self-organization (described below), which increases some connection weights at the expense of others. 2.5. Activation At each training iteration, a new retinal input image is presented and the activation of each unit in each sheet is updated in Fig. 2. Projections in model V1. Each cone represents a projection to the neuron to which it points. V1 L4 neurons receive numerous direct projections from the RGC/ LGN cells shown in Fig. 1 (green projections along the bottom of the figure), along with weak short-range lateral excitatory connections (blue ovals) and feedback connections from V1 L2/3E (light blue and red cones pointing to V1 L4). V1 L2/3E neurons receive narrow afferent projections from V1 L4 (red cone on left), narrow inhibitory projections from V1 L2/3I (red cone in top right), and both short-range and long-range lateral excitatory projections (blue ovals). V1 L2/3I neurons receive both short-range and long-range afferent excitation from V1 L2/3E (green and purple cones), and make short-range lateral inhibitory connections to other V1 L2/3I neurons (blue oval). If projections are visualized as outgoing rather than incoming as drawn here, L2/3E cells will make wide-ranging excitatory connections to cells of both L2/3E and L2/3I, and L2/3I cells will make short-ranging inhibitory connections to cells of both sheets. This pattern of connectivity is a simplified but consistent implementation of the known connectivity patterns between V1 layers (Binzegger et al., 2009). a series of steps. One training iteration represents one visual fixation (for natural images) or a snapshot of the relatively slowly changing spatial pattern of spontaneous activity (e.g. for retinal waves (Wong, 1999)). I.e., an iteration consists of a constant retinal activation, followed by recurrent processing at the LGN and cortical levels. For one iteration, assume that input is drawn on the photoreceptors at time t and the connection delay (constant for all projections) is defined as dt = 0.05 (roughly corresponding to 10– 20 ms). Then at t + 0.05 the RGC/LGN cells compute their responses, and at t + 0.10 the thalamic output is delivered to V1, where it similarly propagates through the cortical sheets. Images are presented to the model by activating the retinal photoreceptor units. The activation value Wi,P of unit i in photoreceptor sheet P is given by the calibrated-color estimate of the L, M, or S cone activation in the chosen image at that point. For each model neuron in the other sheets, the activation value is computed based on the combined activity contributions to that neuron from each of the sheet’s incoming projections. The activity contribution from a projection is recalculated whenever its input sheet activity changes, after the corresponding connection delay. For unit j in the target sheet, the activity contribution Cjp to j from projection p is a dot product of the relevant input with the weights: C jp ðt þ dtÞ ¼ P X isp ðtÞxij;p ð1Þ i2F jp where Xis is the activation of unit i on this projection’s input sheet sp, taken from the set of all input neurons from which target unit j receives connections in that projection (its connection field Fjp), and xij,p is the connection weight from i to j in that projection. Across all projections, multiple direct connections between the same pair of neurons are possible, but each projection p contains at most one connection between i and j, denoted by xij,p. For a given subcortical or cortical unit j in the separate models reported in this paper (except those containing complex cells), the Please cite this article in press as: Bednar, J.A. Building a mechanistic model of the development and function of the primary visual cortex. J. Physiol. (2012), http://dx.doi.org/10.1016/j.jphysparis.2011.12.001 5 J.A. Bednar / Journal of Physiology - Paris xxx (2012) xxx–xxx activity gj(t) is calculated from a rectified weighted sum of the activity contributions Cjp(t): gj ðtÞ ¼ f P p ! cp C jp ðtÞ ð2Þ f is a half-wave rectifying function with a variable threshold point (h) dependent on the average activity of the unit, as described in the next subsection. Each cp is an arbitrary multiplier for the overall strength of connections in projection p. The cp values are set in the approximate range 0.5–3.0 for excitatory projections and 0.5 to 3.0 for inhibitory projections. For afferent connections, the cp value is chosen to map average V1 activation levels into the range 0– 1.0 by convention, for ease of interconnecting and analyzing multiple sheets. For lateral and feedback connections, the cp values are then chosen to provide a balance between feedforward, lateral, and feedback drive, and between excitation and inhibition; these parameters are crucial for making the network operate in a useful regime. For the full unified model, RGC/LGN neuron activity is computed similarly to Eq. (2), except that they have a separate divisive lateral inhibitory projection: 0P gjL ðtÞ ¼ f @ p cp C jp ðtÞ 1 cS C jS ðtÞ þ k A ð3Þ where L stands for one of the RGC/LGN sheets. Projection S consists of a set of isotropic Gaussian-shaped lateral inhibitory connections (see Eq. (7), evaluated with u = 1), and p ranges over all the other projections to that sheet. k is a small constant to make the output well-defined for weak inputs. The divisive inhibition implements the contrast gain control mechanisms found in RGC and LGN neurons (Antolik, 2010; Felisberti and Derrington, 1999; Bonin et al., 2005; Alitto and Usrey, 2008). For the unified model and individual models with complex cells, cortical neuron activity is computed similarly to Eq. (2), except to add firing-rate-fluctuation noise and exponential smoothing of the recurrent dynamics: gjV ðt þ dtÞ ¼ kf P p ! cp C jp ðt þ dtÞ þ ð1 kÞgjV ðtÞ þ rn x ð4Þ where V stands for one of the cortical sheets, p ranges over all projections to that sheet, k = 0.5 is a time constant parameter that defines the strength of smoothing of the recurrent dynamics in the network, x is a normally distributed zero-mean unit-variance random variable, and rn scales x to determine the amount of noise. The smoothing ensures that the system remains numerically stable, without spurious oscillations caused by simulating only discrete time steps, for the relatively coarse time steps that are used here for computational efficiency. For any of the models, each time the activity is computed using Eq. (2), (3), or (4), the new activity values are sent to each of the outgoing projections, where they arrive after the projection delay (typically 0.05). The process of activity computation then begins again, with a new contribution C computed as in Eq. (1), leading to new activation values by Eqs. (2), (3), or (4). Activity thus spreads recurrently throughout the network, and can change, die out, or be strengthened, depending on the parameters. With typical parameters that lead to realistic topographic map patterns, initial blurry patterns of afferent-driven activity are sharpened into well-defined ‘‘activity bubbles’’ through locally cooperative and more distantly competitive lateral interactions (Miikkulainen et al., 2005). Nearby neurons are thus influenced to respond more similarly, while more distant neurons receive net inhibition and thus learn to respond to different input patterns. The competitive interactions ‘‘sparsify’’ the cortical response into patches, in a process that can be compared to the explicit sparseness constraints in non-mechanistic models (Olshausen and Field, 1996), while the local facilitatory interactions encourage spatial locality so that smooth topographic maps will be developed. Whenever the time reaches an integer multiple (e.g. 1.0 or 2.0), the V1 response is used to update the threshold point (h) of V1 neurons (using the adaptation process described in the next section) and to update the afferent weights via Hebbian learning (as described in the following section). Both adaptation and learning could also be performed at each settling step, but doing so would greatly decrease computational efficiency. Because the settling (sparsification) process typically leaves only small patches of the cortical neurons responding strongly, those neurons will be the ones that learn the current input pattern, while other nearby neurons will learn other input patterns, eventually covering the complete range of typical input variation. Overall, through a combination of the network dynamics that achieve sparsification along with local similarity, plus Hebbian learning that leads to feature preferences, the network will learn smooth, topographic maps with good coverage of the space of input patterns, thereby developing into a functioning system for processing patterns of visual input. 2.6. Homeostatic adaptation For this model, the threshold for activation of each neuron is a very important quantity, because it directly determines how much the neuron will fire in response to a given input. To set the threshold, each neuron unit j in V1 calculates a smoothed exponential average of its activity (gj ): gj ðtÞ ¼ ð1 bÞgj ðtÞ þ bgj ðt 1Þ ð5Þ The smoothing parameter (b = 0.999) determines the degree of smoothing in the calculation of the average. gj is initialized to the target average V1 unit activity (l), which for all simulations is gjA ð0Þ ¼ l ¼ 0:024. The threshold is updated as follows: hðtÞ ¼ hðt 1Þ þ kðgj ðtÞ lÞ ð6Þ where k = 0.0001 is the homeostatic learning rate. The effect of this scaling mechanism is to bring the average activity of each V1 unit closer to the specified target. If the activity in a V1 unit moves away from the target during training, the threshold for activation is thus automatically raised or lowered in order to bring it closer to the target. Note that an alternative rule with only a single smoothing parameter (rather than b and k) could be formulated, but the rule as presented here makes it simple for the modeler to set a desired target activity l, and is in any case relatively insensitive to the values of the smoothing parameters. 2.7. Learning Initial connection field weights are random within a twodimensional Gaussian envelope. E.g., for a postsynaptic (target) neuron j located at sheet coordinate (0, 0), the weight xij,p from presynaptic unit i in projection p is: xij;p 1 x2 þ y 2 ¼ u exp Z xp 2r2p ! ð7Þ where (x, y) is the sheet-coordinate location of the presynaptic neuron i, u is a scalar value drawn from a uniform random distribution for the afferent and lateral inhibitory projections (p = A, I), rp determines the width of the Gaussian in sheet coordinates, and Zx is a constant normalizing term that ensures that the total of all weights xij to neuron j in projection p is 1.0. Weights for each projection are Please cite this article in press as: Bednar, J.A. Building a mechanistic model of the development and function of the primary visual cortex. J. Physiol. (2012), http://dx.doi.org/10.1016/j.jphysparis.2011.12.001 6 J.A. Bednar / Journal of Physiology - Paris xxx (2012) xxx–xxx only defined within a specific maximum circular radius rp; they are considered zero outside that radius. Each connection weight xij from unit i to unit j is adjusted using a simple Hebbian learning rule. This rule results in connections that reflect correlations between the presynaptic activity and the postsynaptic response. Hebbian connection weight adjustment for unit j is dependent on the presynaptic activity gi, the postsynaptic response gj, and the Hebbian learning rate ap: xij;p ðt 1Þ þ ap gj gi ðxkj;p ðt 1Þ þ ap gj gk Þ xij;p ðtÞ ¼ P ð8Þ k Unless it is constrained, Hebbian learning will lead to ever-increasing (and thus unstable) values of the weights. The weights are constrained using divisive postsynaptic weight normalization (Eq. (8)), which is a simple and well understood mechanism. All afferent connection weights from RGC/LGN sheets are normalized together in the model, which allows V1 neurons to become selective for any subset of the RGC/LGN inputs. Weights are normalized separately for each of the other projections, to ensure that Hebbian learning does not disrupt the balance between feedforward drive, lateral and feedback excitation, and lateral and feedback inhibition. Subtractive normalization with upper and lower bounds could be used instead, but it would lead to binary weights (Miller and MacKay, 1994; Miller, 1994), which is not desirable for a firing-rate model whose connections represent averages over multiple physical connections. More biologically motivated homeostatic mechanisms for normalization such as multiplicative synaptic scaling (Turrigiano, 1999) or a sliding threshold for plasticity (Bienenstock et al., 1982) could be implemented instead, but these have not been tested so far. 3. Results In each of the subsections below, results are shown from previously reported simulations using partial or simplified versions of the full unified GCAL model described above. Models were typically run for 10,000 iterations (where an iteration corresponds to one complete image presentation, e.g. a visual saccade, covering a simulation time of 1.0), with structured feature maps and receptive fields gradually appearing as more input patterns are presented. By 10,000, the maps and connection fields have come to a ‘‘dynamic equilibrium’’, where they are stable as long as the input statistics are stationary (Miikkulainen et al., 2005), and these are the results that are shown here. 3.1. Feature maps For each of the topographic feature maps reported for V1 in imaging experiments, Fig. 3 shows a typical imaging result from an animal, plotted above a typical final result from a simulation using the above framework and including that feature value. For OR, DR, DY, and CR, black indicates neurons not selective for the indicated feature, and bright colors indicate highly selective neurons. For CR the color in the plot indicates the preferred color; the others are false-color plots showing the feature value for each neuron using the color key adjacent to that panel. Maps were measured by collating responses to sine gratings covering ranges of values of each feature (Miikkulainen et al., 2005), reproducing typical methods of map measurement in animals (Blasdel, 1992a). Each of these specific simulations was done with an earlier model named LISSOM (Miikkulainen et al., 2005), which is similar to the full GCAL model described above but simplified to include only a single cortical layer, with direct inhibitory long-range lateral connections, and with modeler-determined thresholds rather than the homeostatic mechanisms described above (Eq. (6)). Each result is for a model including only a subset of the subcortical sheets shown in Fig. 1. Specifically, the model (X, Y) and OR map simulations use only a single pair of monochromatic On, Off RGC/LGN sheets at a fixed size, the OD and DY maps are similar to OR but include an additional pair for the other eye, the DR and TF maps are similar to OR but include three additional pairs with different delays, the SF map is similar to OR but includes three additional pairs of On, Off cells with different Difference-Of-Gaussians CFs, and the CR map is similar to OR but includes five additional sheets of color opponent cells (BY On, RG On, RG Off, GR On, and GR Off). As described in the indicated original source for each model, the model results for (X, Y), OR, OD, DR, and SF have been evaluated against the available animal data, and capture the main aspects of the feature value coverage and the spatial organization of the maps (Miikkulainen et al., 2005; Palmer, 2009). For instance, the OR and DR maps show iso-feature domains, pinwheel centers, fractures, saddle points, linear zones, and an overall ring-shaped Fourier transform, as in the animal maps (Blasdel, 1992a; Weliky et al., 1996). The maps simulated together (e.g. OR and OD) also tend to intersect at right angles, such that high-gradient regions in one map avoid high-gradient regions in others (Miikkulainen et al., 2005). These patterns primarily emerge from geometric constraints on smoothly mapping the range of values for the indicated feature, within a two-dimensional retinotopic map (Miikkulainen et al., 2005). They are also affected by the relative amount by which each feature varies in the input dataset, how often each feature appears, and other aspects of the input image statistics (Miikkulainen et al., 2005). For instance, orientation maps trained on natural image inputs develop a preponderance of neurons with horizontal and vertical orientation preferences, as seen in ferret maps and in natural images (Bednar and Miikkulainen, 2004; Coppola et al., 1998). For DY, the model results (Ramtohul, 2006) have not been compared systematically with the small amount of data (barely visible in the plot) from the one available experimental report on the organization for disparity preferences (Kara and Boyd, 2009), because the model predated the experiments by several years. A preliminary analysis suggests that the model organization is comparable in that neurons for disparity tend to occur in small, local regions sensitive to horizontal disparity, but further evaluation is necessary. The results for color (CR) are preliminary due to ongoing work on this topic, but do show that color-selective neurons are found in spatially segregated blobs that include multiple color preferences, as suggested by the imaging data currently available (Xiao et al., 2007). Results for TF are also preliminary, again because they predated experimental TF maps and have not yet been characterized against the experimental results. Overall, where it has been possible to make comparisons, the separate models have been shown to reproduce the main features of the experimental data, using a small set of assumptions. In each case, the model demonstrates how the experimentally measured map can emerge from Hebbian learning of corresponding patterns of subcortical and cortical activity. The models thus illustrate how the same basic, general-purpose adaptive mechanism will lead to very different organizations, depending on the geometrical and statistical properties of that feature. So far, only a subset of these features have been combined into a single simulation, such as (X, Y), OR, OD, DR, and TF (Bednar and Miikkulainen, 2006). Future work will focus on showing how all or nearly all of these results could emerge simultaneously in the full unified GCAL model. However, note that only a few of these maps have ever been measured in the same animal, or even the same species, and thus it is not yet known whether all such maps are actually present in any single animal. The unified model can be used to understand how the maps interact, and to make Please cite this article in press as: Bednar, J.A. Building a mechanistic model of the development and function of the primary visual cortex. J. Physiol. (2012), http://dx.doi.org/10.1016/j.jphysparis.2011.12.001 7 J.A. Bednar / Journal of Physiology - Paris xxx (2012) xxx–xxx 1 5 6 10 2 3 5 5 5 8 9 7 11 12 4 5 Fig. 3. Simulated vs. real animal V1 maps. Imaging results for 4 mm 4 mm of V1 of the indicated species and from the corresponding LISSOM models of retinotopy (X, Y), orientation (OR), ocular dominance (OD), motion direction (DR), spatial frequency (SF), temporal frequency (TF), disparity (DY), and color (CR). See the main text for description. Reprinted from references indicated: 1Bosking et al. (2002). 2Blasdel (1992a). 3Blasdel (1992b). 4Weliky et al. (1996). 5Miikkulainen et al. (2005). 6Xu et al. (2007). 7 Kara and Boyd (2009). 8Xiao et al. (2007). 9Purushothaman et al. (2009). 10Palmer (2009). 11Ramtohul (2006). 12Ball and Bednar (2009). predictions for the expected organization for maps not yet measured in a particular species. 3.2. Connection patterns The feature maps described in the previous subsection are summaries of the properties of a set of neurons embedded in a network. To understand how these properties come about, it is necessary to look at the patterns of connectivity that underlie them. Fig. 4 shows examples of such connections from an (X, Y) and OR GCAL simulation using a single pair of On, Off LGN/RGC sheets (Antolik, 2010). Note that this model focuses only on the emergence of orientation preferences, not on other dimensions such as spatial frequency, which would require additional sheets as shown in Fig. 1. The afferent connections that develop reflect the feature preference of the target neuron, with elongated Gabor-shaped connection fields that respond to oriented edges in the input. Only a single orientation edge size is represented, because this simulation includes only a single RGC/LGN cell RF size. The orientation preference suggested by the afferent weight pattern strongly correlates with the measured orientation map (Antolik, 2010; not shown). Because the lateral weights, like the afferent weights, are modified by Hebbian learning, they reflect the correlation patterns between V1 neurons, which are determined both by the emerging map patterns and by the input statistics. For this OR map, retinotopy and OR strongly determine the correlations, Please cite this article in press as: Bednar, J.A. Building a mechanistic model of the development and function of the primary visual cortex. J. Physiol. (2012), http://dx.doi.org/10.1016/j.jphysparis.2011.12.001 8 J.A. Bednar / Journal of Physiology - Paris xxx (2012) xxx–xxx Fig. 4. Self-organized projections to V1 L2/3. Results from a GCAL model orientation map with separate V1 L4 and L2/3 regions allowing the emergence of complex cells; other dimensions like OD, DR, DY, SF, and CR are not included here. (a and b) Connection fields from the LGN On and Off channels to every 20th neuron in the model L4 show that the neurons develop OR preferences that cover the full range at each retinotopic location. (c) Long-range excitatory lateral connections to those neurons preferentially come from neurons with similar OR preferences. Here strong weights are colored with the OR preference of the source neuron. Strong weights occur in clumps (appearing as small dots here) corresponding to an iso-orientation domain (each approximately 0.2–0.3 mm wide); the fact that most of the dots are similar in color for any given neuron shows that the connections are orientation specific. (d) Enlarged plot from (c) for a typical OR domain neuron that prefers horizontal patterns and receives connections primarily from other horizontal-preferring neurons (appearing as blobs of red or nearly red colors). (e) OR pinwheel neurons receive connections from neurons with many different OR preferences. Reprinted from Antolik (2010). and thus the lateral connection patterns respect both the retinotopic and the orientation maps. Lateral connection patterns are thus patchy and orientation specific, as seen in tree shrews and monkeys (Bosking et al., 1997; Sincich and Blasdel, 2001) (see Fig. 5). For neurons in orientation domains, long-range connections primarily come from neurons with similar orientation preference, because those neurons were often coactivated with this neuron during self-organization using natural images. Interestingly, a prediction is that cells near pinwheel fractures, which are less selective for orientation in the model, will have a much broader range of input connections, because their activity is correlated with neurons with a wide, non-specific range of orientation preferences. The degree of orientation selectivity of neurons near pinwheel centers has been controversial, but current evidence is in line with the model results (Nauhaus et al., 2008). Connectivity patterns in pinwheels have not yet been investigated experimentally, and so the model results represent predictions for future experiments. When multiple maps are simulated in the same model, the connection patterns respect all maps at once, because there is only one Please cite this article in press as: Bednar, J.A. Building a mechanistic model of the development and function of the primary visual cortex. J. Physiol. (2012), http://dx.doi.org/10.1016/j.jphysparis.2011.12.001 J.A. Bednar / Journal of Physiology - Paris xxx (2012) xxx–xxx 9 Fig. 5. Lateral connections across maps. LISSOM/GCAL neurons each participate in multiple functional maps, but have only a single set of lateral connections. Connections are strongest from other neurons with similar properties, respecting each of the maps to the degree to which that map affects correlation between neurons. Maps for a combined LISSOM OR/OD/DR simulation are shown above, with the black outlines indicating the connections to the central neuron (marked with a small black outline) that remain after weak connections have been pruned. Model neurons connect to other model neurons with similar orientation preference (a) (as in tree shrew, (d)) but even more strongly respect the direction map (c). This highly monocular unit also connects strongly to the same eye (b), but the more typical binocular cells have wider connection distributions. (a–c) Reprinted from Bednar and Miikkulainen (2006); (d) reprinted from Bosking et al. (1997). set of V1 neurons and one set of lateral connections, each determined by Hebbian learning. Fig. 5 shows an example from a combined (X, Y), OR, OD, DR map simulation, which reproduces the observed connection dependence on the OR map but predicts that connections will also respect the DR map (as reported by Roerig and Kao (1999)), and for highly monocular neurons will respect the OD map as well. The model strongly predicts that lateral connection patterns will respect all other maps that account for a significant fraction of the response variance of the neurons, because each of those features will thus affect the correlation between neurons. 3.3. Orientation and phase tuning Typical map development models focus on reproducing the map patterns, and are otherwise highly abstract (for review see Swindale (1996), Goodhill (2007), and Erwin et al. (1995)). These models are very useful for understanding the process of map formation in isolation, and to explain the geometric properties of maps. However, a map pattern measured in a real animal is simply a summary of one property of a complete system that processes visual information, and so a full explanation of the map pattern requires a demonstration of how the map patterns emerge from a set of neurons that process visual information in a realistic way. For such a model, the map patterns can be a way to determine if the underlying model circuit is operating like those in animals, which is a very different goal than in abstract models of map patterns in isolation. Once such a model exhibits realistic maps, it can then be tested to determine whether the feature preferences summarized in the map actually represent realistic responses to a visual feature, such as the orientation and phase of a sine grating test pattern. For instance, neurons in a model orientation map can be tested to see if they are actually selective for the orientation of an input stimulus, and retain that selectivity as contrast is varied (i.e., have robust contrast-invariant tuning (Sclar and Freeman, 1982)). Many developmental models (e.g. those based on the elastic net or using correlation-based learning) cannot be tested with a specific bitmap input image, and so cannot directly be extended into a model visual system of the type considered here. The results from others that do allow bitmap input are not typically compared with single-unit data from animals, again because the papers focus on the map patterns. Yet without contrast-invariant tuning, the response of a neuron would be ambiguous – a strong response would could indicate either that the preferred orientation is present, or else that the input simply has very high contrast. Fig. 6 shows that GCAL model neurons, thanks to the lateral inhibition implemented at the RGC/LGN level, do retain their orientation tuning as contrast is varied (Antolik and Bednar, 2011). Earlier models such as LISSOM (Miikkulainen et al., 2005) did not have this property and were thus valid only for a small range of contrasts. In GCAL, lateral inhibition acts like divisive normalization of the population response, giving similar patterns and levels of activity regardless of the contrast, which leads to contrast-invariant tuning (Antolik and Bednar, 2011). In the animal V1 data, tuning for spatial phase (the specific position of an oriented sine grating) is complicated, with simple cells highly selective for both orientation and spatial phase, and complex cells selective for orientation but not spatial phase (Hubel et al., 1962). Moreover, the spatial organization is smooth for orientation (Blasdel, 1992a), such that nearby neurons have similar orientation preferences, but disordered for phase, with nearby neurons having a variety of phase preferences (Aronov et al., 2003; Liu et al., 1992; Jin et al., 2011) (though the detailed organization for spatial phase is not yet clear and consistent between studies). Disorder in the phase map provides a simple, local way to construct complex cells—simply pool outputs from several local simple cells, each with similar orientation preferences but one of a variety of phase preferences, to construct an orientation-selective cell relatively invariant to phase (as originally proposed by Hubel et al. (1962)). However, it is difficult to develop a random phase map in simulation, because existing developmental models group neurons by similarity in responses to a visual pattern, while neurons with different phase preferences will by definition respond to different visual patterns (Antolik and Bednar, 2011). Previous models for the development of random phase preferences have relied on unrealistic mechanisms such as squaring negative activation levels (Hyvärinen and Hoyer, 2001; Weber et al., 2001), which force opposite phases to become correlated (and thus grouped together by developmental models) but have no clear physical interpretation. Instead, the GCAL models (Antolik and Bednar, 2011; Antolik, 2010) show how random phase maps could arise in a set of simple cells (nominally in layer 4) through a small amount of initial disorder in the mapping from the thalamus, which persists because the model includes strong long-range lateral connectivity only in the layer 2/3 complex cells rather than in layer 4 (see Fig. 2). The layer 2/3 cells pool from a local patch of layer 4 cells, thus becoming unselective for phase, but have strong lateral interactions that lead to well-organized maps in layer 2/3 (which is primarily what is measured in most optical imaging experiments). Feedback from layer 2/3 to layer 4 then causes layer 4 cells to develop an orientation map, without disrupting the random phase map. Fig. 7 plots Please cite this article in press as: Bednar, J.A. Building a mechanistic model of the development and function of the primary visual cortex. J. Physiol. (2012), http://dx.doi.org/10.1016/j.jphysparis.2011.12.001 10 J.A. Bednar / Journal of Physiology - Paris xxx (2012) xxx–xxx Fig. 6. Contrast-invariant tuning. Unlike other developmental models such as LISSOM, GCAL shows contrast-invariant tuning, i.e., similar orientation tuning width at different contrasts, as found in animals (Sclar and Freeman, 1982). Reprinted from Antolik and Bednar (2011). the results of this process, showing the resulting maps and modulation ratios for each layer. The model predicts a strong spatial organization for (weak) phase preferences in layer 2/3, and that the orientation map in layer 4 (not currently measurable with imaging techniques due to its depth beneath the cortical surface) is less well ordered than in layer 2/3. Fig. 6 shows that the complex cells in layer 2/3 retain orientation tuning and contrast invariance, as expected. very large range of possible input patterns suffices to develop map patterns—orientation maps (but not realistic lateral connection patterns) can develop from random noise inputs, abstract spatially localized patterns, retinal wave model patterns, and other two-dimensional patterns (Miikkulainen et al., 2005). Thus the emergence of maps and RFs is a robust phenomenon, but the specific patterns of response properties and neural connectivity directly reflect the input scene statistics. 3.4. Dependence on input statistics 3.5. Surround modulation Because development in the model is driven by input patterns (whether externally or intrinsically generated), the results depend crucially on the specific patterns used. At the extreme, models with two eyes having identical inputs do not develop ocular dominance maps, and models trained only on static images do not develop direction maps (Miikkulainen et al., 2005). The specific map patterns also reflect the input statistics, with more neurons responding to horizontal and vertical contours because of the prevalence of such contours in natural images (Bednar and Miikkulainen, 2004). Relationships between the maps also reflect the input statistics— direction maps dominate the overall organization when input patterns are all moving quickly (Miikkulainen et al., 2005), and ocular dominance maps interact orthogonally with orientation maps only for some types of eye-specific input differences (Jegelka et al., 2006). Finally, the lateral connection patterns depend crucially on the input statistics, with long-range orientation specific connectivity developing only for input datasets with long-range orientation-specific correlations (Miikkulainen et al., 2005). Even so, a Given a model with realistically patchy, specific lateral connectivity and realistic single-neuron properties, as outlined above, the patterns of interaction between neurons can be compared with neurophysiological evidence for surround modulation—influences on neural responses from distant patterns in the visual field. These studies can help validate the underlying model circuit, while helping understand how the visual cortex will respond to complicated patterns such as natural images. For instance, as the size of a patch of grating is increased, the response of a V1 neuron typically increases at first, reaches a peak, and then decreases (Sceniak et al., 1999; Sengpiel et al., 1997; Wang et al., 2009). Similar patterns can be observed in a GCAL-based model orientation map with complex cells and separate inhibitory and excitatory subpopulations (Fig. 8 from Antolik (2010); see connectivity in Fig. 2). Small patterns initially activate neurons weakly, due to low overlap with the afferent receptive fields of layer 4 cells, but the response increases with larger patterns. For large enough patterns, lateral interactions are strong and in most locations net Please cite this article in press as: Bednar, J.A. Building a mechanistic model of the development and function of the primary visual cortex. J. Physiol. (2012), http://dx.doi.org/10.1016/j.jphysparis.2011.12.001 J.A. Bednar / Journal of Physiology - Paris xxx (2012) xxx–xxx 11 Fig. 7. Complex and simple cells. Models with layer 4C and layer 2/3 develop both complex and simple cells (Antolik and Bednar, 2011). The layer 2/3 orientation map in (b) is a good match to the experimental data from imaging the superficial layers. The matching but slightly disordered OR map in (a), the heavily disordered L4C phase map in (c), and the weak but highly ordered L2/3 phase map in (d) are all predictions of the model, as none of those have been measured in animals. (e) The layer 4 cells are highly sensitive to phase (and thus ‘‘simple cells’’), while the layer 2/3 cells respond to a wide (though not entirely flat) range of phases (with response plotted on the vertical axis and spatial phase on the horizontal). The overall histogram of modulation ratios (ranging from perfectly ‘‘complex’’, i.e., insensitive to phase, to perfectly ‘‘simple’’, responding to one phase only), is bimodal (f) as in macaque (g). Most model simple cells are in layer 4, and most complex cells are in layer 2/3, but feedback causes the two categories to overlap. (a–f) Reprinted from Antolik and Bednar (2011); (g) reprinted from Ringach et al. (2002). Fig. 8. Size tuning. Responses from a particular neuron vary as the input pattern size increases. For a small sine grating (radius 0.2), responses are weak due to low overlap between the pattern and the neuron’s afferent connection field, even when the position and phase are optimal for that neuron (as here). An intermediate size (0.8) leads to a peak in response for this neuron, with the maximum ratio between excitation and inhibition. Larger sizes activate a larger area of V1, but responses are sparser due to the strong lateral inhibition recruited by this salient pattern. For this particular neuron, the response happens to be suppressed for radius 1.8, but note that many other neurons are still highly active, which is one reason that surround modulation properties are highly variable. The dashed line indicates the 1.0 1.0 area of the retina that is topographically mapped to the V1 layer 2/3 sheet. Reprinted from Antolik (2010). inhibitory, causing many neurons to be suppressed (leading to a subsequent dip in response). These patterns are visualized and compared with the experimental data in Fig. 9. The model demonstrates that the lateral interactions are sufficient to account for typical size tuning effects, and also accounts for less commonly reported effects that result from neurons with different specific Please cite this article in press as: Bednar, J.A. Building a mechanistic model of the development and function of the primary visual cortex. J. Physiol. (2012), http://dx.doi.org/10.1016/j.jphysparis.2011.12.001 12 J.A. Bednar / Journal of Physiology - Paris xxx (2012) xxx–xxx Fig. 9. Diversity in size tuning. Plots A–F reprinted from Antolik (2010) show how the neural response varies as the sine grating radius increases, for 50% (blue) and 100% (red) contrasts, for six example neurons. Those on the top row are the most common type (37% of model neurons measured), and are a good match to the phenomena reported in experimental studies in macaque (G, reprinted from Sceniak et al., 1999) and cat (H, reprinted from Sengpiel et al., 1997, and I, reprinted from Wang et al., 2009), I (Wang et al., 2009)). Those in the middle row occur less often and are less commonly reported in experimental studies, such as larger responses to low contrasts at high radii, but examples of each such pattern can be found in the experimental results shown here. The model predicts that these variations in surround tuning properties are real, not just noise or an experimental artifact, and that they derive from the many possible interactions between a diverse set of neurons in the map. Reprinted from Antolik (2010). self-organized patterns of lateral connectivity. The model thus accounts both for the typical pattern of size tuning, and explains why such a diversity of patterns is observed in animals. The effects of the self-organized lateral connections are even more evident when orientation-specific surround modulation is considered. Because the connections respect the orientation map (Figs. 4 and 5), interactions change depending on the relative orientation of center and surround elements (Fig. 10). Moreover, because these patterns also vary depending on the location of the neuron in the map, a variety of patterns of interaction are seen, just as has been reported in experimental studies. Fig. 11 illustrates some of these relationships, but many more such relationships are possible—any property of the neurons that varies systematically across the cortical surface will affect the pattern of interactions, as long as it changes the correlation between neurons during the history of visual experience. These results suggest both that lateral interactions may underlie many of the observed surround modulation effects, and also that the diversity of observed effects can at least in part be traced to the diversity of lateral connection patterns, which in turn is a result of the sequences of activations of the neurons during development. 3.6. Aftereffects The previous sections have focused on the network organization and operation after Hebbian learning can be considered to be completed. However, the visual system is continually adapting to the visual input even during normal visual experience, resulting in phenomena such as visual aftereffects (Thompson and Burr, 2009). To investigate whether and how this adaptation differs from long-term self-organization, we tested LISSOM and GCALbased models with stimuli used in visual aftereffect experiments (Bednar and Miikkulainen, 2000; Ciroux, 2005). Surprisingly, the same Hebbian equations that allow neurons and maps to develop selectivity also lead to realistic aftereffects, such as for orientation and color (Fig. 12). In the model, we assume that connections adapt during normal visual experience just as they do in simulated long-term development, albeit with a lower learning rate appropriate for adult vision. If so, neurons that are coactive during a particular visual stimulus (such as a vertical grating) will become slightly more strongly laterally connected as they adapt to that pattern. Subsequently, the response to that pattern will be reduced, due to increased lateral excitation that leads to net (disynaptic) lateral inhibition for high contrast patterns like those in the aftereffect studies. Assuming a population decoding model such as the vector sum (Bednar and Miikkulainen, 2000), there will be no change in the perceived orientation of the adaptation pattern, but the perceived value of a nearby orientation will be repelled away from the adapting stimulus, because the neurons activated during adaptation now inhibit each other more strongly, shifting the population response. These changes are the direct result of Hebbian learning of intracortical connections, as can be shown by disabling learning for all other connections and observing no change in the overall behavior. Please cite this article in press as: Bednar, J.A. Building a mechanistic model of the development and function of the primary visual cortex. J. Physiol. (2012), http://dx.doi.org/10.1016/j.jphysparis.2011.12.001 J.A. Bednar / Journal of Physiology - Paris xxx (2012) xxx–xxx 13 Fig. 10. Orientation-contrast tuning curves (OCTCs). Surround modulation changes as the orientation of a surrounding annulus is varied relative to a center sine-grating patch (as in the sample stimulus at the bottom right). In each graph A–F reprinted from Antolik (2010), red is the orientation tuning curve (as in Fig. 6) for the given neuron (with just the center grating patch), blue is for surround contrast 50%, and green is for surround contrast 100%. Top row: typically (51% of model neurons tested), a collinear surround is suppressive for these contrasts, but the surround becomes less suppressive as the surround orientation is varied (as for cat (G, reprinted from Sengpiel et al., 1997) and macaque (H, reprinted from Jones et al., 2002)). Middle row: Other patterns seen in the model include high responses at diagonals (D, 20%, as seen in Sengpiel et al. (1997)), strongest suppression not collinear (E, as seen in Jones et al. (2002)), and facilitation for all orientations (F, 5%). The relatively rare pattern in F has not been reported in existing studies, and thus constitutes a prediction. In each case the observed variability is a consequence of the model’s Hebbian learning that leads to a diversity of patterns of lateral connectivity, rather than noise or experimental artifacts. Interestingly, for distant orientations, the human data suggests an attractive effect, with a perceived orientation shifted towards the adaptation orientation (Mitchell and Muir, 1976). The model reproduces this feature as well, and provides the novel explanation that this indirect effect is due to the divisive normalization term in the Hebbian learning equation (Eq. (8)). Specifically, when the neurons activated during adaptation increase their mutual inhibition, the normalization term forces this increase to come at the expense of connections to other neurons not (or only weakly) activated during adaptation. Those neurons are thus disinhibited, and can respond more strongly than before, shifting the response towards the adaptation stimulus. Similar patterns occur for the McCollough Effect (McCollough, 1965) (Fig. 12). Here the adaptation stimulus coactivates neurons selective for orientation, color, or both, and again the lateral interactions between all these neurons are strengthened. Subsequent stimuli then appear different in both color and orientation, in patterns similar to the human data. Interestingly, the McCollough effect can last for months, which suggests that the modeled changes in lateral connectivity can become essentially permanent, though the effects of short-term exposure typically fade in darkness or in subsequent visual experience. Overall, the model suggests that the same process of Hebbian learning could explain both long-term development and shortterm adaptation, unifying phenomena previously considered distinct. Of course, the biophysical mechanisms may indeed be distinct, potentially operating at different time scales and being largely temporary rather than the permanent changes found early in development. Even so, the results here suggest that both early development and adult short-term adaptation may operate using similar mathematical principles. How mechanisms for long-term and short-term plasticity may interact, including possible transitions from long-term to short term plasticity during so-called ‘‘critical periods’’, is an important area for future modeling and experimental studies. Please cite this article in press as: Bednar, J.A. Building a mechanistic model of the development and function of the primary visual cortex. J. Physiol. (2012), http://dx.doi.org/10.1016/j.jphysparis.2011.12.001 14 J.A. Bednar / Journal of Physiology - Paris xxx (2012) xxx–xxx Fig. 11. Surround population effects. Some of the observed variance in surround modulation effects can be explained based on the properties of the measured model neuron or its neighbors. These measurements are based only on a relatively small number of model neurons, because each data point requires a several-hour-long computational experiment to choose the optimal stimuli for testing, but some trends are clear in the data so far. For instance, the modulation ratio and suppression index are somewhat correlated (r = 0.226, p < 0.07) – i.e., simple cells appear to be suppressed a bit more strongly (left). The amount of local homogeneity is a measure of how slowly the map is changing around a given neuron. Model neurons in homogeneous regions of the map exhibit lower orientation-contrast suppression (r = 0.362, p < 0.01; middle) but greater overall surround suppression (r = 0.327, p < 0.05). Although preliminary, these predictions can be tested in animal experiments. Moreover, these analyses suggest that much of the observed variance in surround modulation properties could be due to network and map effects like these, caused by Hebbian learning, with potentially many more possible interactions for neurons embedded in multiple overlaid functional maps. Reprinted from Antolik (2010). Fig. 12. Aftereffects as short-term self-organization. (a) While the fully organized network is repeatedly presented patterns with the same orientation, connection strengths are updated by Hebbian learning (as during development, but at a lower learning rate). The net effect is increased inhibition, which causes the neurons that responded during adaptation to respond less afterwards. When the overall response is summarized as a ‘‘perceived value’’ using a vector average, the result is systematic shifts in perception, such that a previously similar orientation will now seem very different in orientation, while more distant orientations will be unchanged or go in the opposite direction. These patterns are a close match to results from humans (Mitchell and Muir, 1976), suggesting that short-term and long-term adaptation share similar rules. (b) Similar explanations apply to the McCollough effect (McCollough, 1965), an orientation-contingent color aftereffect; here the model predicts that lateral connections between orientation and color-selective neurons cause this effect (Ellis, 1977) (and many others in other maps, such as motion aftereffects). (a) Reprinted from Bednar and Miikkulainen (2000) and replotting data from Mitchell and Muir (1976); (b) reprinted from Ciroux (2005) and replotting data from Ellis (1977). 4. Discussion and future work The results reviewed above illustrate a general approach to understanding the large-scale development, organization, and function of cortical areas. The models show that a relatively small number of basic and largely uncontroversial assumptions and principles may be sufficient to explain a very wide range of experimental results from the visual cortex. Even very simple neural units, i.e., firing-rate point neurons, generically connected into topographic maps with initially random or isotropic weights, can form a wide range of specific feature preferences and maps via unsupervised normalized Hebbian learning of natural images and spontaneous activity patterns. The resulting maps consist of neurons with realistic visual response properties, with variability due to visual context and recent history that explains significant aspects of surround modulation and visual aftereffects. The simulator and example simulations are freely downloadable from topographica.org (see Fig. 13), allowing any interested researcher to build on this work. The long-term goal of this project is to understand whether and how a single, generic cortical circuit and developmental process can account for the major known information-processing properties of the visual cortex. If so, such a model would be a general-purpose model of cortical computation, with potentially many applications beyond computational neuroscience. Similar models have already been used for other cortical regions, such as rodent barrel cortex (Wilson et al., 2010). Combining the existing models into a single, runnable visual system is very much a work in progress, but the results so far suggest that doing so will be both feasible and valuable. Once a full unified model is feasible, an important test will be to determine if it can replicate both the feature-based analyses described in most of the sections above, and also findings that the visual cortex acts as a unified map of spatiotemporal energy (Basole et al., 2003). Although the model results are compatible with the feature-based view, the actual underlying elements of the model are better described as nonlinear filters, responding to some local region of the high-dimensional input space. Replicating the results Please cite this article in press as: Bednar, J.A. Building a mechanistic model of the development and function of the primary visual cortex. J. Physiol. (2012), http://dx.doi.org/10.1016/j.jphysparis.2011.12.001 J.A. Bednar / Journal of Physiology - Paris xxx (2012) xxx–xxx 15 Fig. 13. Topographica simulator. This Topographica session shows a user analyzing a simple three-level model with two LGN/RGC sheets and one V1 sheet. (Clockwise from top) Displays for activity patterns, gradients, Fourier transforms, CFs for one neuron, the overall network, histograms of orientation preference, orientation maps, and projections (center) are shown. The simulator allows any number of sheets to be defined, interconnected, and analyzed easily, for any simulation defined as a network of interacting two-dimensional sheets of neurons. from Basole et al. (2003) in the model would help unify these two competing explanations for cortical function, showing how the same underlying system could have responses that are related both to stimulus energy and to stimulus features. Building realistic models of this kind depends on having a realistic model of a subcortical pathway. The approach illustrated in Fig. 1 allows arbitrarily detailed implementation of subcortical populations, and allows clear specification of model properties. However, it also results in a large number of subcortical neural sheets that can be difficult to simulate, and these also introduce a significant number of new parameters. An alternative approach is to model the retina as primarily driven by random wiring, an idea which has recently been verified to a large extent (Field et al., 2010). In this approach, a regular hexagonal grid of photoreceptors would be set up initially, and then a wide and continuous range of retinal ganglion cell types would be created by randomly connecting photoreceptors over local areas of the grid. This process would require some parameter tuning to ensure that the results cover the range of properties seen in animals, but could automatically generate realistic RGCs to drive V1 development. At present, all of the models reviewed contain feedforward and lateral connections, but no feedback from higher cortical areas to V1 or from V1 to the LGN, because such feedback has not been found necessary to replicate the features surveyed. However, note that nearly all of the physiological data considered was from anesthetized animals not engaged in any visually mediated behaviors. Under those conditions, it is not surprising that feedback would have relatively little effect. Corticocortical and corticothalamic feedback is likely to be crucial to explain how these circuits oper- ate during natural vision (Sillito et al., 2006; Thiele et al., 2009), and determining the form and function of this feedback is an important aspect of developing a general-purpose cortical model. Because they focus on long-term development, the models discussed here implement time at a relatively coarse level. Most of the models update the retinal image only a few times each second, and thus are not suitable for studying the detailed time course of neural responses. A fully detailed model would need to include spiking, which makes most of the analyses presented here more difficult and much more time consuming, but it is possible to simulate even quite fine time scales at the level of a peri-stimulus-time histogram (PSTH). I.e., rather than simulating individual spike events, one can calibrate model neurons against a PSTH from a single neuron, thus matching its average temporal response over time, or against the average population response (e.g. measured by voltage-sensitive dye (VSD) imaging). Recent work shows that it is possible to replicate quite detailed transient-onset LGN and V1 PSTHs in GCAL using the same neural architecture as in the models discussed here, simply by adding hysteresis to control how fast neurons can become excited and by simulating at a detailed 0.5-ms time step (Stevens, 2011). Importantly, the transient responses are a natural consequence of lateral connections in the model, and do not require a more complex model of each neuron. Future work will calibrate the model responses to the full time course of population activity using VSD imaging, and investigate how self-organization is affected by using this finer time scale during long-term development. Because GCAL models are driven by afferent activity, the type and properties of the visual input patterns assumed are important Please cite this article in press as: Bednar, J.A. Building a mechanistic model of the development and function of the primary visual cortex. J. Physiol. (2012), http://dx.doi.org/10.1016/j.jphysparis.2011.12.001 16 J.A. Bednar / Journal of Physiology - Paris xxx (2012) xxx–xxx aspects of each model. Ideally, a model of visual system development in primates would be driven by color, stereo, foveated video streams replicating typical patterns of eye movements, movements of an animal in its environment, and responses to visual patterns. Collecting data of this sort is difficult, and moreover cannot capture any causal or contingent relationships between the current visual input and the current neural organization that can affect future eye and organism movements that will then change the visual input. In the long run, to account for more complex aspects of visual system development such as visual object recognition and optic flow processing, it will be necessary to implement the models as embodied, situated agents (Pylyshyn, 2000; Weng et al., 2001) embedded in the real world or in realistic 3D virtual environments. Building such robotic or virtual agents will add significant additional complexity, however, so it is important first to see how much of the behavior of V1 neurons can be addressed by the present ‘‘open-loop’’, non-situated approach. As discussed throughout, the main focus of this modeling work has been on replicating experimental data using a small number of computational primitives and mechanisms, with a goal of providing a concise, concrete, and relatively simple explanation for a wide and complex range of experimental findings. A complete explanation of visual cortex development and function would go even further, demonstrating more clearly why the cortex should be built in this way, and precisely what information-processing purpose this circuit performs. For instance, realistic RFs can be obtained from ‘‘normative’’ models embodying the idea that the cortex is developing a set of basis functions to represent input patterns faithfully, with only a few active neurons (Olshausen and Field, 1996; Bell and Sejnowski, 1997; Hyvärinen and Hoyer, 2001; Rehn and Sommer, 2007), maps can emerge by minimizing connection lengths in the cortex (Koulakov and Chklovskii, 2001), and lateral connections can be modeled as decorrelating the input patterns (Barlow and Földiák, 1989; Dong, 1995). The GCAL model can be seen as a concrete, mechanistic implementation of these ideas, showing how a physically realizable local circuit could develop RFs with good coverage of the input space, via lateral interactions that also implement sparsification via decorrelation (Miikkulainen et al., 2005). Making more explicit links between mechanistic models like GCAL and normative theories is an important goal for future work. Meanwhile, there are many aspects of cortical function not explained by current normative models. The focus of the current line of research is on first capturing those phenomena in a mechanistic model, so that researchers can then build deeper explanations for why these computations are useful for the organism. As previously emphasized, many of the individual results found with GCAL can also be obtained using other modeling approaches, which can be complementary to the processes modeled by GCAL. For instance, it is possible to generate orientation maps without any activity-dependent plasticity, through the initial wiring pattern between the retina and the cortex (Ringach, 2007; Paik and Ringach, 2011) or within the cortex itself (Grabska-Barwinska and von der Malsburg, 2008). Such an approach cannot explain subsequent experience-dependent development, whereas the Hebbian approach of GCAL can explain both the initial map and later plasticity, but it is of course possible that the initial map and plasticity occur via different mechanisms. Other models are based on abstractions of some of the mechanisms in GCAL (Yu et al., 2005; Farley et al., 2007; Obermayer et al., 1990; Wolf and Geisel, 2003), operating similarly but at a higher level. GCAL is not meant as a competitor to such models, but as a concrete, physically realizable implementation of those ideas, forming a prototype of both the biological system and potential future artificial vision systems. 5. Conclusions The GCAL model results suggest that it will soon be feasible to build a single model visual system that will account for a very large fraction of the visual response properties, at the firing rate level, of V1 neurons in a particular species. Such a model will help researchers make testable predictions to drive future experiments to understand cortical processing, as well as determine which properties require more complex approaches, such as feedback, attention, and detailed neural geometry and dynamics. The model suggests that cortical neurons develop to cover the typical range of variation in their thalamic inputs, within the context of a smooth, multidimensional topographic map, and that lateral connections store pairwise correlations and use this information to modulate responses to natural scenes, dynamically adapting to both long-term and short-term visual input statistics. Because the model cortex starts without any specialization for vision, it represents a general model for any cortical region, and is also an implementation for a generic information processing device that could have important applications outside of neuroscience. By integrating and unifying a wide range of experimental results, the model should thus help advance our understanding of cortical processing and biological information processing in general. Acknowledgements Thanks to all of the collaborators whose modelling work is reviewed here, and to the members of the Developmental Computational Neuroscience research group, the Institute for Adaptive and Neural Computation, and the Doctoral Training Centre in Neuroinformatics, at the University of Edinburgh, for discussions and feedback on many of the models. This work was supported in part by the UK EPSRC and BBSRC Doctoral Training Centre in Neuroinformatics, under grants EP/F500385/1 and BB/F529254/1, and by the US NIMH grant R01-MH66991. Computational resources were provided by the Edinburgh Compute and Data Facility (ECDF). References Adelson, E.H., Bergen, J.R., 1985. Spatiotemporal energy models for the perception of motion. Journal of the Optical Society of America A 2, 284–299, <http:// web.mit.edu/persci/people/adelson/pub_pdfs/spatio85.pdf>. Albrecht, D.G., Geisler, W.S., 1991. Motion selectivity and the contrast-response function of simple cells in the visual cortex. Visual Neuroscience 7, 531–546, <http://www.ncbi.nlm.nih.gov/entrez/ query.fcgi?cmd=retrieve&db=pubmed&dopt=abstract&list_uids=1772804>. Alitto, H.J., Usrey, W.M., 2008. Origin and dynamics of extraclassical suppression in the lateral geniculate nucleus of the macaque monkey. Neuron 57 (1), 135–146, <http://dx.doi.org/10.1016/j.neuron.2007.11.019>. Antolik, J., 2010. Unified developmental model of maps, complex cells and surround modulation in the primary visual cortex. Ph.D. thesis, School of Informatics, The University of Edinburgh, Edinburgh, UK. <http://hdl.handle.net/1842/4875>. Antolik, J., Bednar, J.A., 2011. Development of maps of simple and complex cells in the primary visual cortex. Frontiers in Computational Neuroscience 5, 17, <http://dx.doi.org/10.3389/fncom.2011.00017>. Aronov, D., Reich, D.S., Mechler, F., Victor, J.D., 2003. Neural coding of spatial phase in V1 of the macaque monkey. Journal of Neurophysiology 89 (6), 3304–3327, <http://dx.doi.org/10.1152/jn.00826.2002>. Ball, C.E., Bednar, J.A., 2009. A self-organizing model of color, ocular dominance, and orientation selectivity in the primary visual cortex. In: Society for Neuroscience Abstracts, Society for Neuroscience. <http://www.sfn.org> (Program No. 756.9). Barlow, H.B., Földiák, P., 1989. Adaptation and decorrelation in the cortex. In: Durbin, R., Miall, C., Mitchison, G. (Eds.), The Computing Neuron. AddisonWesley, Reading, MA, pp. 54–72. Basole, A., White, L.E., Fitzpatrick, D., 2003. Mapping multiple features in the population response of visual cortex. Nature 424 (6943), 986–990, <http:// dx.doi.org/10.1038/nature01721>. Bednar, J.A., Miikkulainen, R., 2000. Tilt aftereffects in a self-organizing model of the primary visual cortex. Neural Computation 12 (7), 1721–1740, <http:// nn.cs.utexas.edu/keyword?bednar:nc00>. Bednar, J.A., Miikkulainen, R., 2003. Self-organization of spatiotemporal receptive fields and laterally connected direction and orientation maps. Neurocomputing 52–54, 473–480, <http://nn.cs.utexas.edu/keyword? bednar :neuro computing03>. Please cite this article in press as: Bednar, J.A. Building a mechanistic model of the development and function of the primary visual cortex. J. Physiol. (2012), http://dx.doi.org/10.1016/j.jphysparis.2011.12.001 J.A. Bednar / Journal of Physiology - Paris xxx (2012) xxx–xxx Bednar, J.A., Miikkulainen, R., 2004. Prenatal and postnatal development of laterally connected orientation maps. In: Computational Neuroscience: Trends in Research, 2004, pp. 985–992. <http://nn.cs.utexas.edu/keyword?bednar: neurocomputing04-or>. Bednar, J.A., Miikkulainen, R., 2004. Prenatal and postnatal development of laterally connected orientation maps. Neurocomputing 58–60, 985–992, <http:// nn.cs.utexas.edu/keyword?bednar:neurocomputing04-or>. Bednar, J.A., Miikkulainen, R., 2006. Joint maps for orientation, eye, and direction preference in a self-organizing model of V1. Neurocomputing 69 (10–12), 1272–1276, <http://nn.cs.utexas.edu/keyword?bednar:neurocomputing06>. Bednar, J.A., Kelkar, A., Miikkulainen, R., 2004. Scaling self-organizing maps to model large cortical networks. Neuroinformatics 2, 275–302, <http:// nn.cs.utexas.edu/keyword?bednar:neuroinformatics04>. Bell, A.J., Sejnowski, T.J., 1997. The ‘‘independent components’’ of natural scenes are edge filters. Vision Research 37, 3327–3338, <http://citeseer.nj.nec.com/ bell97independent.html>. Bienenstock, E.L., Cooper, L.N., Munro, P.W., 1982. Theory for the development of neuron selectivity: orientation specificity and binocular interaction in visual cortex. The Journal of Neuroscience 2, 32–48, <http://www.ncbi.nlm.nih.gov/entrez/ query.fcgi?cmd=retrieve&db=pubmed&dopt=abstract&list_ uids=7054394>. Binzegger, T., Douglas, R.J., Martin, K.A.C., 2009. Topology and dynamics of the canonical circuit of cat V1. Neural Networks 22 (8), 1071–1078, <http:// dx.doi.org/10.1016/j.neunet.2009.07.011>. Blasdel, G.G., 1992a. Orientation selectivity, preference, and continuity in monkey striate cortex. The Journal of Neuroscience 12, 3139–3161, <http:// www.jneurosci.org/cgi/content/abstract/12/8/3139>. Bonin, V., Mante, V., Carandini, M., 2005. The suppressive field of neurons in lateral geniculate nucleus. Journal of Neuroscience 25, 10844–10856, <http:// dx.doi.org/10.1523/JNEUROSCI.3562-05.2005>. Bosking, W.H., Zhang, Y., Schofield, B.R., Fitzpatrick, D., 1997. Orientation selectivity and the arrangement of horizontal connections in tree shrew striate cortex. The Journal of Neuroscience 17 (6), 2112–2127, <http://www.jneurosci.org/cgi/ content/full/17/6/2112>. Bosking, W.H., Crowley, J.C., Fitzpatrick, D., 2002. Spatial coding of position and orientation in primary visual cortex. Nature Neuroscience 5 (9), 874–882, <http://dx.doi.org/10.1038/nn908>. Chance, F.S., Nelson, S.B., Abbott, L.F., 1999. Complex cells as cortically amplified simple cells. Nature Neuroscience 2 (3), 277–282, <http:// www.ncbi.nlm.nih.gov/entrez/ query.fcgi?cmd=retrieve&db=pubmed&dopt=abstract&list_uids=10195222>. Ciroux, J., 2005. Simulating the McCollough effect in a self-organizing model of the primary visual cortex, Master’s thesis, The University of Edinburgh, Scotland, UK. <http://www.inf.ed.ac.uk/publications/thesis/online/IM050308.pdf>. Coppola, D.M., White, L.E., Fitzpatrick, D., Purves, D., 1998. Unequal representation of cardinal and oblique contours in ferret visual cortex. Proceedings of the National Academy of Sciences of United States of America 95 (5), 2621–2623, <http://www.ncbi.nlm.nih.gov/entrez/ query.fcgi?cmd=retrieve&db=pubmed&dopt=abstract&list_uids=9482936>. Dacey, D.M., 2000. Parallel pathways for spectral coding in primate retina. Annual Review of Neuroscience 23, 743–775, <http://www.ncbi.nlm.nih.gov/entrez/ query.fcgi?cmd=retrieve&db=pubmed&dopt=abstract&list_uids=10845080>. De Paula, J.B. 2007. Modeling the self-organization of color selectivity in the visual cortex, Ph.D. thesis, Department of Computer Sciences, The University of Texas at Austin, Austin, TX. <http://nn.cs.utexas.edu/keyword?depaula:phd07> Dong, D.W., 1995. Associative decorrelation dynamics: a theory of self-organization and optimization in feedback networks. In: Tesauro, G., Touretzky, D.S., Leen, T.K. (Eds.), Advances in Neural Information Processing Systems, vol. 7. MIT Press, Cambridge, MA, pp. 925–932, <ftp://ftp.ci.tuwien.ac.at/pub/texmf/bibtex/ nips-7.bib>. Ellis, S.R., 1977. Orientation selectivity of the McCollough effect: analysis by equivalent contrast transformation. Perception and Psychophysics 22 (6), 539– 544. Erwin, E., Obermayer, K., Schulten, K.J., 1995. Models of orientation and ocular dominance columns in the visual cortex: a critical comparison. Neural Computation 7 (3), 425–468, <http://www.ncbi.nlm.nih.gov/entrez/ query.fcgi?cmd=retrieve&db=pubmed&dopt=abstract&list_uids=8935959>. Farley, B.J., Yu, H., Jin, D.Z., Sur, M., 2007. Alteration of visual input results in a coordinated reorganization of multiple visual cortex maps. The Journal of Neuroscience 27 (38), 10299–10310, <http://dx.doi.org/10.1523/ JNEUROSCI.2257-07.2007>. Felisberti, F., Derrington, A.M., 1999. Long-range interactions modulate the contrast gain in the lateral geniculate nucleus of cats. Visual Neuroscience 16, 943–956. Field, G.D., Gauthier, J.L., Sher, A., Greschner, M., Machado, T.A., Jepson, L.H., Shlens, J., Gunning, D.E., Mathieson, K., Dabrowski, W., Paninski, L., Litke, A.M., Chichilnisky, E.J., 2010. Functional connectivity in the retina at the resolution of photoreceptors. Nature 467 (7316), 673–677, <http://dx.doi.org/10.1038/ nature09424>. Gilbert, C.D., Wiesel, T.N., 1989. Columnar specificity of intrinsic horizontal and corticocortical connections in cat visual cortex. The Journal of Neuroscience 9, 2432–2442, <http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=retrieve& db=pubmed&dopt=abstract&list_uids=2746337>. Goodhill, G.J., 2007. Contributions of theoretical modeling to the understanding of neural map development. Neuron 56 (2), 301–311, <http://dx.doi.org/10.1016/ j.neuron.2007.09.027>. Grabska-Barwinska, A., von der Malsburg, C., 2008. Establishment of a scaffold for orientation maps in primary visual cortex of higher mammals. The Journal of 17 Neuroscience 28 (1), 249–257, <http://dx.doi.org/10.1523/JNEUROSCI.551406.2008>. Hubel, D.H., Wiesel, T.N., 1962. Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. The Journal of Physiology 160, 106–154, <http://www.pubmedcentral.nih.gov/articlerender.fcgi? tool= pubmed&pubmedid=14449617>. Hyvärinen, A., Hoyer, P.O., 2001. A two-layer sparse coding model learns simple and complex cell receptive fields and topography from natural images. Vision Research 41 (18), 2413–2423, <http://www.sciencedirect.com/science/article/ B6T0W-43GCBN2-B/1/94ef0f6b0d8cf1d97998c9e4c392e0c2>. Jegelka, S., Bednar, J.A., Miikkulainen, R., 2006. Prenatal development of ocular dominance and orientation maps in a self-organizing model of V1. Neurocomputing 69, 1291– 1296, <http://nn.cs.utexas.edu/keyword? jegelka:cns05>. Jin, J., Wang, Y., Swadlow, H.A., Alonso, J.M., 2011. Population receptive fields of ON and OFF thalamic inputs to an orientation column in visual cortex. Nature Neuroscience 14 (2), 232–238, <http://dx.doi.org/10.1038/nn.2729>. Jones, H.E., Wang, W., Sillito, A.M., 2002. Spatial organization and magnitude of orientation contrast interactions in primate V1. Journal of Neurophysiology 88 (5), 2796–2808, <http://dx.doi.org/10.1152/jn.00403.2001>. Kara, P., Boyd, J.D., 2009. A micro-architecture for binocular disparity and ocular dominance in visual cortex. Nature 458 (7238), 627–631, <http://dx.doi.org/ 10.1038/nature07721>. Klug, K., Herr, S., Ngo, I.T., Sterling, P., Schein, S., 2003. Macaque retina contains an Scone OFF midget pathway. Journal of Neuroscience 23, 9881–9887. Koulakov, A.A., Chklovskii, D.B., 2001. Orientation preference patterns in mammalian visual cortex: a wire length minimization approach. Neuron 29, 519–527. U, <http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=retrieve &db=pubmed&dopt=abstract&list_uids=11239440>. Law, J.S., 2009. Modeling the development of organization for orientation preference in primary visual cortex. Ph.D. thesis, School of Informatics, The University of Edinburgh, Edinburgh, UK. <http://hdl.handle.net/1842/3935>. Law, J.S., Antolik, J., Bednar, J.A., 2011. Mechanisms for stable and robust development of orientation maps and receptive fields. Tech. rep., School of Informatics, The University of Edinburgh, EDI-INF-RR-1404. <http:// www.inf.ed.ac.uk/publications/report/1404.html>. Liu, Z., Gaska, J.P., Jacobson, L.D., Pollen, D.A., 1992. Interneuronal interaction between members of quadrature phase and anti-phase pairs in the cat’s visual cortex. Vision Research 32 (7), 1193–1198. McCollough, C., 1965. Color adaptation of edge-detectors in the human visual system. Science 149 (3688), 1115–1116, <http://links.jstor.org/sici?sici=00368075%2819650903%293%3A149%3A3688%3C1115%3ACAOEIT%3E2.0.CO%3B2-7>. Miikkulainen, R., Bednar, J.A., Choe, Y., Sirosh, J., 2005. Computational Maps in the Visual Cortex. Springer, Berlin, <http://computationalmaps.org>. Miller, K.D., 1994. A model for the development of simple cell receptive fields and the ordered arrangement of orientation columns through activity-dependent competition between ON- and OFF-center inputs. The Journal of Neuroscience 14, 409–441, <http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd =retrieve&db=pubmed&dopt=abstract&list_uids=8283248>. Miller, K.D., MacKay, D.J.C., 1994. The role of constraints in Hebbian learning. Neural Computation 6, 100–126, <http://wol.ra.phy.cam.ac.uk/mackay/abstracts/ constraints.html>. Mitchell, D.E., Muir, D.W., 1976. Does the tilt aftereffect occur in the oblique meridian? Vision Research 16, 609–613, <http://dx.doi.org/10.1016/00426989(76)90007-9>. Nauhaus, I., Benucci, A., Carandini, M., Ringach, D.L., 2008. Neuronal selectivity and local map structure in visual cortex. Neuron 57 (5), 673–679, <http://dx.doi.org/ 10.1016/j.neuron.2008.01.020>. Obermayer, K., Ritter, H., Schulten, K.J., 1990. A principle for the formation of the spatial structure of cortical feature maps. Proceedings of the National Academy of Sciences of United States of America 87, 8345–8349, <http://www.pnas.org/ cgi/content/abstract/87/21/8345>. Olmos, A., Kingdom, F.A.A., 2004. McGill calibrated colour image database. <http:// tabby.vision.mcgill.ca>. Olshausen, B.A., Field, D.J., 1996. Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature 381, 607–609, <http:// www.ncbi.nlm.nih.gov/entrez/ query.fcgi?cmd=retrieve&db=pubmed&dopt=abstract&list_uids=8637596>. Paik, S.-B., Ringach, D.L., 2011. Retinal origin of orientation maps in visual cortex. Nature Neuroscience 14 (7), 919–925, <http://dx.doi.org/10.1038/nn.2824>. Palmer, C.M., 2009. Topographic and laminar models for the development and organisation of spatial frequency and orientation in V1, Ph.D. thesis, School of Informatics, The University of Edinburgh, Edinburgh, UK. <http:// hdl.handle.net/1842/4114>. Purushothaman, G., Khaytin, I., Casagrande, V.A., 2009. Quantification of optical images of cortical responses for inferring functional maps. Journal of Neurophysiology 101 (5), 2708–2724, <http://dx.doi.org/10.1152/jn.90696.2008>. Pylyshyn, Z.W., 2000. Situating vision in the world. Trends in Cognitive Sciences 4 (5), 197–207, <http://www.biomednet.com/library/abstract/TICS.etd00309_ 13646 613_v0004i05_00001477>. Ramtohul, T. 2006. A self-organizing model of disparity maps in the primary visual cortex. Master’s thesis, The University of Edinburgh, Scotland, UK. <http:// www.inf.ed.ac.uk/publications/thesis/online/IM060400.pdf>. Rehn, M., Sommer, F.T., 2007. A network that uses few active neurones to code visual input predicts the diverse shapes of cortical receptive fields. Journal of Computational Neuroscience 22 (2), 135–146, <http://dx.doi.org/10.1007/ s10827-006-0003-9>. Please cite this article in press as: Bednar, J.A. Building a mechanistic model of the development and function of the primary visual cortex. J. Physiol. (2012), http://dx.doi.org/10.1016/j.jphysparis.2011.12.001 18 J.A. Bednar / Journal of Physiology - Paris xxx (2012) xxx–xxx Ringach, D.L., 2007. On the origin of the functional architecture of the cortex. PLoS One 2 (2), e251, <http://dx.doi.org/10.1371/journal.pone.0000251>. Ringach, D.L., Shapley, R.M., Hawken, M.J., 2002. Orientation selectivity in macaque V1: diversity and laminar dependence. The Journal of Neuroscience 22 (13), 5639–5651, <http://www.jneurosci.org/content/22/13/5639.long>. Roerig, B., Kao, J.P., 1999. Organization of intracortical circuits in relation to direction preference maps in ferret visual cortex. The Journal of Neuroscience 19 (24), RC44, <http://www.jneurosci.org/content/19/24/RC44.long>. Saul, A.B., Humphrey, A.L., 1992. Evidence of input from lagged cells in the lateral geniculate nucleus to simple cells in cortical area 17 of the cat. Journal of Neurophysiology 68 (4), 1190–1208, <http://www.ncbi.nlm.nih.gov/entrez/ query.fcgi?cmd=retrieve&db=pubmed&dopt=abstract&list_uids=1432077>. Sceniak, M.P., Ringach, D.L., Hawken, M.J., Shapley, R., 1999. Contrast’s effect on spatial summation by macaque V1 neurons. Nature Neuroscience 2, 733–739. Sclar, G., Freeman, R.D., 1982. Orientation selectivity in the cat’s striate cortex is invariant with stimulus contrast. Experimental Brain Research 46, 457–461, <http://www.ncbi.nlm.nih.gov/entrez/ query.fcgi?cmd=retrieve&db=pubmed&dopt=abstract&list_uids=7095050>. Sengpiel, F., Sen, A., Blakemore, C., 1997. Characteristics of surround inhibition in cat area 17. Experimental Brain Research 116 (2), 216–228. Sillito, A.M., Cudeiro, J., Jones, H.E., 2006. Always returning: feedback and sensory processing in visual cortex and thalamus. Trends in Neuroscience 29 (6), 307– 316, <http://dx.doi.org/10.1016/j.tins.2006.05.001>. Sincich, L.C., Blasdel, G.G., 2001. Oriented axon projections in primary visual cortex of the monkey. The Journal of Neuroscience 21, 4416–4426, <http:// www.jneurosci.org/cgi/content/abstract/21/12/4416>. Stevens, J.-L., 2011. A temporal model of neural activity and VSD response in the primary visual cortex. Master’s thesis, The University of Edinburgh, Scotland, UK. <http://www.inf.ed.ac.uk/publications/thesis/online/IT111096.pdf>. Stockman, A., Sharpe, L.T., 2000. The spectral sensitivities of the middle- and longwavelength-sensitive cones derived from measurements in observers of known genotype. Vision Research 40, 1711–1737. Stockman, A., Sharpe, L.T., Fach, C., 1999. The spectral sensitivity of the human short-wavelength sensitive cones derived from thresholds and color matches. Vision Research 39, 2901–2927. Swindale, N.V., 1996. The development of topography in the visual cortex: a review of models. Network: Computation in Neural Systems 7, 161–247, <http:// www.iop.org/EJ/S/3/251/kxtGlJZKth350bhDwPkibw/article/0954-898X/7/2/ 002/ne62r2.pdf>. Thiele, A., Pooresmaeili, A., Delicato, L.S., Herrero, J.L., Roelfsema, P.R., 2009. Additive effects of attention and stimulus contrast in primary visual cortex. Cerebral Cortex 19 (12), 2970–2981, <http://dx.doi.org/10.1093/cercor/bhp070>. Thompson, P., Burr, D., 2009. Visual aftereffects. Current Biology 19 (1), R11–14, <http://dx.doi.org/10.1016/j.cub.2008.10.014>. Turrigiano, G.G., 1999. Homeostatic plasticity in neuronal networks: the more things change, the more they stay the same. Trends in Neurosciences 22 (5), 221–227, <http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd =retrieve&db= pubmed&dopt=abstract&list_uids=10322495>. Vidyasagar, T.R., Kulikowski, J.J., Lipnicki, D.M., Dreher, B., 2002. Convergence of parvocellular and magnocellular information channels in the primary visual cortex of the macaque. European Journal of Neuroscience 16 (5), 945–956, <http://dx.doi.org/10.1046/j.1460-9568.2002.02137.x>. Wang, C., Bardy, C., Huang, J.Y., FitzGibbon, T., Dreher, B., 2009. Contrast dependence of center and surround integration in primary visual cortex of the cat. Journal of Vision 9 (1), 20.1–1, <http://dx.doi.org/10.1167/9.1.20>. Weber, C., maps, 2001. Self-organization of orientation maps, lateral connections, and dynamic receptive fields in the primary visual cortex. In: Proceedings of the International Conference on Artificial Neural Networks. Lecture Notes in Computer Science, vol. 2130. Springer, Berlin, pp. 1147–1152, <http:// www.bcs.rochester.edu/cweber/publications/ICANN01.ps>. Weliky, M., Bosking, W.H., Fitzpatrick, D., 1996. A systematic map of direction preference in primary visual cortex. Nature 379, 725–728, <http:// www.ncbi.nlm.nih.gov/entrez/ query.fcgi?cmd=retrieve&db=pubmed&dopt=abstract&list_uids=8602218>. Weng, J., McClelland, J.L., Pentland, A., Sporns, O., Stockman, I., Sur, M., Thelen, E., 2001. Autonomous mental development by robots and animals. Science 291 (5504), 599– 600, <http://www.sciencemag.org/cgi/content/full/291/5504/599>. Wilson, S.P., Law, J.S., Mitchinson, B., Prescott, T.J., Bednar, J.A., 2010. Modeling the emergence of whisker direction maps in rat barrel cortex. PLoS One 5 (1), e8778, <http://dx.doi.org/10.1371/journal.pone.0008778>. Wolf, F., Geisel, T., 2003. Universality in visual cortical pattern formation. Journal of Physiology – Paris 97 (2–3), 253–264, <http://dx.doi.org/10.1016/ j.jphysparis.2003.09.018>. Wolfe, J., Palmer, L.A., 1998. Temporal diversity in the lateral geniculate nucleus of cat. Visual Neuroscience 15 (4), 653–675, <http://www.journals.cambridge.org/ issue_VisualNeuroscience/Vol15No04>. Wong, R.O.L., 1999. Retinal waves and visual system development. Annual Review of Neuroscience 22, 29–47, <http://www.ncbi.nlm.nih.gov/entrez/ query.fcgi?cmd=retrieve&db=pubmed&dopt=abstract&list_uids=10202531>. Xiao, Y., Casti, A., Xiao, J., Kaplan, E., 2007. Hue maps in primate striate cortex. Neuroimage 35 (2), 771–786, <http://dx.doi.org/10.1016/ j.neuroimage.2006.11.059>. Xu, X., Anderson, T.J., Casagrande, V.A., 2007. How do functional maps in primary visual cortex vary with eccentricity? Journal of Comparative Neurology 501 (5), 741–755, <http://dx.doi.org/10.1002/cne.21277>. Yu, H., Farley, B.J., Jin, D.Z., Sur, M., 2005. The coordinated mapping of visual space and response features in visual cortex. Neuron 47 (2), 267–280, <http:// dx.doi.org/10.1016/j.neuron.2005.06.011>. Please cite this article in press as: Bednar, J.A. Building a mechanistic model of the development and function of the primary visual cortex. J. Physiol. (2012), http://dx.doi.org/10.1016/j.jphysparis.2011.12.001