1760 lines
102 KiB
Plaintext
1760 lines
102 KiB
Plaintext
Canonical microcircuits for predictive coding
|
||
|
||
Andre M. Bastos1,2,6, W. Martin Usrey1,3,4, Rick A. Adams8, George R. Mangun2,3,5, Pascal
|
||
Fries6,7, and Karl J. Friston8
|
||
|
||
1Center for Neuroscience, University of California-Davis, Davis, CA 95618 USA. 2Center for Mind
|
||
and Brain, University of California-Davis, Davis, CA 95618 USA. 3Department of Neurology,
|
||
University of California-Davis, Davis, CA 95618 USA. 4Department of Neurobiology, Physiology
|
||
and Behavior, University of California-Davis, Davis, CA 95618 USA. 5Department of Psychology,
|
||
University of California-Davis, Davis, CA 95618 USA. 6Ernst Strüngmann Institute (ESI) for
|
||
Neuroscience in Cooperation with Max Planck Society, Deutschordenstraße 46, 60528 Frankfurt,
|
||
Germany. 7Donders Institute for Brain, Cognition and Behaviour, Radboud University Nijmegen,
|
||
Kapittelweg 29, 6525 EN Nijmegen, Netherlands. 8The Wellcome Trust Centre for Neuroimaging,
|
||
University College London, Queen Square, London WC1N 3BG, UK.
|
||
|
||
Summary
|
||
|
||
This review considers the influential notion of a canonical (cortical) microcircuit in light of recent
|
||
theories about neuronal processing. Specifically, we conciliate quantitative studies of
|
||
microcircuitry and the functional logic of neuronal computations. We revisit the established idea
|
||
that message passing among hierarchical cortical areas implements a form of Bayesian inference –
|
||
paying careful attention to the implications for intrinsic connections among neuronal populations.
|
||
By deriving canonical forms for these computations, one can associate specific neuronal
|
||
populations with specific computational roles. This analysis discloses a remarkable
|
||
correspondence between the microcircuitry of the cortical column and the connectivity implied by
|
||
predictive coding. Furthermore, it provides some intuitive insights into the functional asymmetries
|
||
between feedforward and feedback connections and the characteristic frequencies over which they
|
||
operate.
|
||
|
||
Keywords
|
||
|
||
neuronal; connectivity; cortical; microcircuit; computation; predictive coding; free energy
|
||
principle; gamma oscillations; beta oscillations
|
||
|
||
Introduction
|
||
|
||
The idea that the brain actively constructs explanations for its sensory inputs is now
|
||
generally accepted. This notion builds on a long history of proposals that the brain uses
|
||
internal or generative models to make inferences about the causes of its sensorium
|
||
(Helmholtz, 1860; Gregory 1968, 1980; Dayan et al., 1995). In terms of implementation,
|
||
predictive coding is, arguably, the most plausible neurobiological candidate for making
|
||
these inferences (Srinivasan et al., 1982; Mumford, 1992; Rao and Ballard, 1999). This
|
||
review considers the canonical microcircuit in light of predictive coding. We focus on the
|
||
intrinsic connectivity within a cortical column and the extrinsic connections between
|
||
|
||
Correspondence: Karl Friston The Wellcome Trust Centre for Neuroimaging, Institute of Neurology, University College London, 12
|
||
Queen Square, London, WC1N 3BG, UK. Tel (44) 207 833 7454 k.friston@ucl.ac.uk.
|
||
|
||
NIH Public Access
|
||
Author Manuscript
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
Published in final edited form as:
|
||
Neuron. 2012 November 21; 76(4): 695–711. doi:10.1016/j.neuron.2012.10.038.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
columns in different cortical areas. We try to relate this circuitry to neuronal computations
|
||
by showing the computational dependencies – implied by predictive coding – recapitulate
|
||
the physiological dependencies implied by quantitative studies of intrinsic connectivity. This
|
||
issue is important as distinct neuronal dynamics in different cortical layers are becoming
|
||
increasingly apparent (de Kock et al., 2007; Sakata and Harris, 2009; Maier et al., 2010;
|
||
Bollimunta et al., 2011). For example, recent findings suggest that the superficial layers of
|
||
cortex show neuronal synchronization and spike-field coherence predominantly in the
|
||
gamma frequencies, while deep layers prefer lower (alpha or beta) frequencies (Roopun et
|
||
al., 2006, 2008; Maier et al., 2010; Buffalo et al., 2011). Since feedforward connections
|
||
originate predominately from superficial layers and feedback connections from deep layers,
|
||
these differences suggest that feedforward connections use relatively high frequencies,
|
||
compared to feedback connections, as recently demonstrated empirically (Bosman et al.,
|
||
2012). These asymmetries call for something quite remarkable: namely, a synthesis of
|
||
spectrally distinct inputs to a cortical column and the segregation of its outputs. This
|
||
segregation can only arise from local neuronal computations that are structured and
|
||
precisely interconnected. It is the nature of this intrinsic connectivity – and the dynamics it
|
||
supports – that we consider. The aim of this review is to speculate about the functional roles
|
||
of neuronal populations in specific cortical layers in terms of predictive coding. Our long-
|
||
term aim is to create computationally informed models of microcircuitry that can be tested
|
||
with dynamic causal modelling (David et al., 2006; Moran et al., 2008, 2011).
|
||
|
||
This review comprises three sections. We start with an overview of the anatomy and
|
||
physiology of cortical connections – with an emphasis on quantitative advances. The second
|
||
section considers the computational role of the canonical microcircuit that emerges from
|
||
these studies. The third section provides a formal treatment of predictive coding and defines
|
||
the requisite computations in terms of differential equations. We then associate the form of
|
||
these equations with the canonical microcircuit to define a computational architecture. We
|
||
conclude with some predictions about intrinsic connections and note some important
|
||
asymmetries in feedforward and feedback connections that emerge from this treatment.
|
||
|
||
The anatomy and physiology of cortical connections
|
||
|
||
This section reviews laminar-specific connections that underlie the notion of a canonical
|
||
microcircuit (Douglas et al., 1989; Douglas and Martin, 1991, 2004). We first focus on
|
||
mammalian visual cortex and then consider whether visual microcircuitry can be generalized
|
||
to a canonical circuit for the entire cortex. Both functional and anatomical techniques have
|
||
been applied to study intrinsic (intracortical) and extrinsic connections. We will emphasise
|
||
the insights from recent studies that combine both techniques.
|
||
|
||
Intrinsic connections and the canonical microcircuit
|
||
|
||
The seminal work of Douglas and Martin (1991), in the cat visual system, produced a model
|
||
of how information flows through the cortical column. Douglas and Martin recorded
|
||
intracellular potentials from cells in primary visual cortex during electrical stimulation of its
|
||
thalamic afferents. They noted a stereotypical pattern of fast excitation, followed by slower
|
||
and longer-lasting inhibition. The latency of the ensuing hyperpolarisation distinguished
|
||
responses in supragranular and infragranular layers. Using conductance-based models, they
|
||
showed that a simple model could reproduce these responses. Their model contained
|
||
superficial and deep pyramidal cells with a common pool of inhibitory cells. All three
|
||
neuronal populations received thalamic drive and were fully interconnected. The deep
|
||
pyramidal cells received relatively weak thalamic drive but strong inhibition (Figure 1).
|
||
These interconnections allowed the circuit to amplify transient thalamic inputs to generate
|
||
sustained activity in the cortex, while maintaining a balance between excitation and
|
||
inhibition, two tasks that must be solved by any cortical circuit. Their circuit, although based
|
||
|
||
Bastos et al.
|
||
Page 2
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
on recordings from cat visual cortex, was also proposed as a basic theme that might be
|
||
present and replicated, with minor variations, throughout the cortical sheet (Douglas et al.,
|
||
1989).
|
||
|
||
Subsequent studies have used intracellular recordings and histology to measure spikes (and
|
||
depolarisation) in pre and post-synaptic cells, whose cellular morphology can be determined.
|
||
This approach quantifies both the connection probability – defined as the number of
|
||
observed connections divided by total number of pairs recorded – and connection strength –
|
||
defined in terms of post-synaptic responses. Thomson et al (2002) used these techniques to
|
||
study layers 2 to 5 (L2 to L5) of the cat and rat visual systems. The most frequently
|
||
connected cells were located in the same cortical layer, where the largest interlaminar
|
||
projections were the ‘feedforward’ connections from L4 to L3 and from L3 to L5. Excitatory
|
||
reciprocal ‘feedback’ connections were not observed (L3 to L4) or less commonly observed
|
||
(L5 to L3), suggesting that excitation spreads within the column in a feedforward fashion.
|
||
Feedback connections were typically seen when pyramidal cells in one layer targeted
|
||
inhibitory cells in another (see Thomson and Bannister, 2003 for a review).
|
||
|
||
While many studies have focused on excitatory connections, a few have examined inhibitory
|
||
connections. These are more difficult to study, because inhibitory cells are less common
|
||
than excitatory cells, and because there are at least seven distinct morphological classes
|
||
(Salin and Bullier, 1995). However, recent advances in optogenetics have made it possible
|
||
to more easily target inhibitory cells: Kätzel and colleagues combined optogenetics and
|
||
whole-cell recording to investigate the intrinsic connectivity of inhibitory cells in mouse
|
||
cortical areas M1, S1, and V1 (Kätzel et al., 2010). They transgenically expressed
|
||
channelrhodopsin in inhibitory neurons and activated them, while recording from pyramidal
|
||
cells. This allowed them to assess the effect of inhibition as a function of laminar position
|
||
relative to the recorded neuron.
|
||
|
||
Several conclusions can be drawn from this approach (Kätzel et al., 2010): first, L4
|
||
inhibitory connections are more restricted in their lateral extent, relative to other layers. This
|
||
supports the notion that L4 responses are dominated by thalamic inputs, while the remaining
|
||
laminae integrate afferents from a wider cortical patch. Second, the primary source of
|
||
inhibition originates from cells in the same layer, reflecting the prevalence of inhibitory
|
||
intralaminar connections. Third, several interlaminar motifs appeared to be general – at least
|
||
in granular cortex: principally, a strong inhibitory connection from L4 onto supragranular
|
||
L2/3 and from infragranular layers onto L4. For more information on the cell-type
|
||
specificity of inhibitory connections, see Yoshimura and Callaway, (2005). Figure 2
|
||
provides a summary of key excitatory and inhibitory intralaminar connections.
|
||
|
||
Microcircuits in the sensorimotor cortex
|
||
|
||
Do the features of visual microcircuits generalize to other cortical areas? Recently, two
|
||
studies have mapped the intrinsic connectivity of mouse sensory and motor cortices: Lefort
|
||
and colleagues (2009) used multiple whole-cell recordings in mouse barrel cortex to
|
||
determine the probability of monosynaptic connections – and the corresponding connection
|
||
strength. As in visual cortex, the strongest connections were intralaminar and the strongest
|
||
interlaminar connections were the ascending L4 to L2, and descending L3 to L5.
|
||
|
||
One puzzle about canonical microcircuits is whether motor cortex has a local circuitry that is
|
||
qualitatively similar to sensory cortex. This question is important because motor cortex lacks
|
||
a clearly defined granular L4 (a property that earns it the name “agranular cortex”). Weiler
|
||
and colleagues combined whole-cell recordings in mouse motor cortex with photo-
|
||
stimulation to uncage Glutamate (Weiler et al., 2008). This allowed them to systematically
|
||
stimulate the cortical column in a grid – centred on the pyramidal neuron from which they
|
||
|
||
Bastos et al.
|
||
Page 3
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
recorded. By recording from pyramidal neurons in L2-6 (L1 lacks pyramidal cells), the
|
||
authors mapped the excitatory influence that each layer exerts over the others. They found
|
||
that the L2/3 to L5A/B was the strongest connection – accounting for one-third of the total
|
||
synaptic current in the circuit. The second strongest interlaminar connection was the
|
||
reciprocal L5A to L2/3 connection. This pathway may be homologous to the prominent
|
||
L4/5A to L2/3 pathway in sensory cortex. Also – as in sensory cortex – recurrent
|
||
(intralaminar) connections were prominent, particularly in L2, L5A/B and L6. The largest
|
||
fraction of synaptic input arrived in L5A/B, consistent with its key role in accumulating
|
||
information from a wide range of afferents – before sending its output to the corticospinal
|
||
tract. In summary, strong input-layer to superficial, and superficial to deep connectivity,
|
||
together with strong intralaminar connectivity, suggests that the intrinsic circuitry of motor
|
||
cortex is similar to other cortical areas.
|
||
|
||
The anatomy and physiology of extrinsic connections
|
||
|
||
Clearly, an account of microcircuits must refer to the layers of origin of extrinsic
|
||
connections and their laminar targets. Although the majority of presynaptic inputs arise from
|
||
intrinsic connections, cortical areas are also richly interconnected, where the balance
|
||
between intrinsic and extrinsic processing mediates functional integration among specialised
|
||
cortical areas (Engel et al., 2010). By numbers alone, intrinsic connections appear to
|
||
dominate – 95% of all neurons labelled with a retrograde tracer lie within about 2 mm of the
|
||
injection site (Markov et al., 2011). The remaining 5% represent cells giving rise to extrinsic
|
||
connections, which – although sparse – can be extremely effective in driving their targets. A
|
||
case in point is the LGN to V1 connection: although it is only the sixth strongest connection
|
||
to V1, LGN afferents have a substantial effect on V1 responses (Markov et al., 2011).
|
||
|
||
Hierarchies and functional asymmetries—Current dogma holds that the cortex is
|
||
hierarchically organized. The idea of a cortical hierarchy rests on the distinction between
|
||
three types of extrinsic connections: feedforward connections, that link an earlier area to a
|
||
higher area, feedback connections, that link a higher to an earlier area, and lateral
|
||
connections, that link areas at the same level (reviewed in Felleman and Van Essen, 1991).
|
||
These connections are distinguished by their laminar origins and targets. Feedforward
|
||
connections originate largely from superficial pyramidal cells and target L4, while feedback
|
||
connections originate largely from deep pyramidal cells and terminate outside of L4
|
||
(Felleman and van Essen 1991). Clearly, this description of cortical hierarchies is a
|
||
simplification and can be nuanced in many ways: for example, as the hierarchical distance
|
||
between two areas increases, the percentage of cells that send feedforward (resp. feedback)
|
||
projections from a lower (resp. higher) level becomes increasingly biased towards the
|
||
superficial (resp. deep) layers (Barone et al., 2000; Vezoli, 2004) .
|
||
|
||
In addition to the laminar specificity of their origins and targets, feedforward and feedback
|
||
connections also differ in their synaptic physiology. The traditional view holds that
|
||
feedforward connections are strong and driving, capable of eliciting spiking activity in their
|
||
targets and conferring classical receptive field properties – the prototypical example being
|
||
the synaptic connection between LGN and V1 (Sherman and Guillery, 1998). Feedback
|
||
connections are thought to modulate (extra-classical) receptive field characteristics
|
||
according to the current context; e.g., visual occlusion, attention, salience, etc. The
|
||
prototypical example of a feedback connection is the cortical L6 to LGN connection.
|
||
Sherman and Guillery identified several properties that distinguish drivers from modulators.
|
||
Driving connections tend to show a strong ionotropic component in their synaptic response,
|
||
evoke large EPSPs, and respond to multiple EPSPs with depressing synaptic effects.
|
||
Modulatory connections produce metabotropic and ionotropic responses when stimulated,
|
||
evoke weak EPSPs, and show paired-pulse facilitation (Sherman and Guillery, 1998; 2011).
|
||
|
||
Bastos et al.
|
||
Page 4
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
These distinctions were based upon the inputs to the LGN, where retinal input is driving and
|
||
cortical input is modulatory. Until recently, little data were available to assess whether a
|
||
similar distinction applies to cortico-cortical feedforward and feedback connections.
|
||
However, recent studies show that cortical feedback connections express not only
|
||
modulatory but also driving characteristics:
|
||
|
||
Are feedback connections driving, modulatory or both?—Although it is generally
|
||
thought that feedback connections are weak and modulatory (Crick and Koch, 1998;
|
||
Sherman and Guillery, 1998), recent evidence suggests that feedback connections do more
|
||
than modulate lower level responses: Sherman and colleagues recorded cells in mouse area
|
||
V1/V2 and A1/A2, while stimulating feedforward or feedback afferents. In both cases,
|
||
driving-like responses as well as modulatory-like responses were observed (Covic and
|
||
Sherman, 2011; De Pasquale and Sherman, 2011). This indicates that – for these
|
||
hierarchically proximate areas – feedback connections can drive their targets just as strongly
|
||
as feedforward connections. This is consistent with earlier studies showing that feedback
|
||
connections can be driving: Mignard and Malpeli (1991) studied the feedback connection
|
||
between areas 18 and 17, while layer A of the LGN was pharmacologically inactivated. This
|
||
silenced the cells in L4 in area 17 but spared activity in superficial layers. However,
|
||
superficial cells were silenced when area 18 was lesioned. This is consistent with a driving
|
||
effect of feedback connections from area 18, in the absence of geniculate input. In summary,
|
||
feedback connections can mediate modulatory and driving effects. This is important from
|
||
the point of view of predictive coding, because top-down predictions have to elicit
|
||
obligatory responses in their targets (cells reporting prediction errors):
|
||
|
||
In predictive coding, feedforward connections convey prediction errors, while feedback
|
||
connections convey predictions from higher cortical areas to suppress prediction errors in
|
||
lower areas. In this scheme, feedback connections should therefore be capable of exerting
|
||
strong (driving) influences on earlier areas to suppress or counter feedforward driving
|
||
inputs. However, as we will see later, these influences also need to exert nonlinear or
|
||
modulatory effects. This is because top-down predictions are necessarily context sensitive:
|
||
e.g., the occlusion of one visual object by another. In short, predictive coding requires
|
||
feedback connections to drive cells in lower levels in a context sensitive fashion, which
|
||
necessitates a modulatory aspect to their postsynaptic effects.
|
||
|
||
Are feedback connections excitatory or inhibitory?—Crucially, because feedback
|
||
connections convey predictions – that serve to explain and thereby reduce prediction errors
|
||
in lower levels – their effective (polysynaptic) connectivity is generally assumed to be
|
||
inhibitory. An overall inhibitory effect of feedback connections is consistent with in vivo
|
||
studies. For example, electrophysiological studies of the mismatch negativity suggest that
|
||
neural responses to deviant stimuli – that violate sensory predictions established by a regular
|
||
stimulus sequence – are enhanced relative to predicted stimuli (Garrido et al., 2009).
|
||
Similarly, violating expectations of auditory repetition causes enhanced gamma-band
|
||
responses in early auditory cortex (Todorovic et al., 2011). These enhanced responses are
|
||
thought to reflect an inability of higher cortical areas to predict, and thereby suppress, the
|
||
activity of populations encoding prediction error (Garrido et al., 2007; Wacongne et al.,
|
||
2011). The suppression of predictable responses can also be regarded as repetition
|
||
suppression, observed in single unit recordings from the inferior temporal cortex of macaque
|
||
monkeys (Desimone, 1996). Furthermore, neurons in monkey inferotemporal cortex respond
|
||
significantly less to a predicted sequence of natural images, compared to an unpredicted
|
||
sequence (Meyer and Olson, 2011).
|
||
|
||
The inhibitory effect of feedback connections is further supported by neuroimaging studies
|
||
(Murray et al., 2002; Murray, 2005; Harrison et al., 2007; Summerfield et al., 2008, 2011;
|
||
|
||
Bastos et al.
|
||
Page 5
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
Alink et al., 2010). These studies show that predictable stimuli evoke smaller responses in
|
||
early cortical areas. Crucially, this suppression cannot be explained in terms of local
|
||
adaptation, because the attributes of the stimuli that can be predicted are not represented in
|
||
early sensory cortex (e.g., Harrison et al. 2007). It should be noted that the suppression of
|
||
responses to predictable stimuli can coexist with (top-down) attentional enhancement of
|
||
signal processing (Wyart et al., 2012): in predictive coding, attention is mediated by
|
||
increasing the gain of populations encoding prediction error (Spratling, 2008; Feldman and
|
||
Friston, 2010). The resulting attentional modulation (e.g., Hopfinger et al., 2000) can
|
||
interact with top-down predictions to override their suppressive influence – as demonstrated
|
||
empirically (Kok et al., 2011). See Buschman and Miller, (2007), Saalmann et al., (2007),
|
||
Anderson et al. (2011), and Armstrong et al. (2012) for further discussion of top-down
|
||
connections in attention.
|
||
|
||
Further evidence for the inhibitory (suppressive) effect of feedback connections comes from
|
||
neuropsychology: Patients with damage to the prefrontal cortex (PFC) show disinhibition of
|
||
event related potential responses (ERP) to repeating stimuli (Knight et al., 1989; Yamaguchi
|
||
and Knight, 1990; but see Barceló et al., 2000). In contrast, they show reduced-amplitude
|
||
P300 ERPs in response to novel stimuli – as if there were a failure to communicate top-
|
||
down predictions to sensory cortex (Knight, 1984). Furthermore, normal subjects show a
|
||
rapid adaptation to deviant stimuli as they become predictable – an effect not seen in
|
||
prefrontal patients.
|
||
|
||
Several invasive studies complement these human studies in suggesting an overall inhibitory
|
||
role for feedback connections. In a recent seminal study, Olsen et al. studied corticothalamic
|
||
feedback between L6 of V1 and the LGN using transgenic expression of channel rhodopsin
|
||
in L6 cells of V1. By driving these cells optogenetically – while recording units in V1 and
|
||
the LGN – the authors showed that deep L6 principal cells inhibited their extrinsic targets in
|
||
the LGN and their intrinsic targets in cortical layers 2 to 5 (Olsen et al., 2012). This
|
||
suppression was powerful – in the LGN, visual responses were suppressed by 76%.
|
||
Suppression was also high in V1, around 80-84% (Olsen et al., 2012). This evidence is in
|
||
line with classical studies of corticogeniculate contributions to length tuning in the LGN,
|
||
showing that cortical feedback contributes to the surround suppression of feline LGN cells:
|
||
without feedback, LGN cells are disinhibited and show weaker surround suppression
|
||
(Murphy and Sillito, 1987; Sillito et al., 1993; but see Alitto and Usrey, 2008).
|
||
|
||
While these studies provide convincing evidence that cortical feedback to the LGN is
|
||
inhibitory, the evidence is more complicated for corticocortical feedback connections
|
||
(Sandell and Schiller, 1982; Johnson and Burkhalter, 1996, 1997). Hupé and colleagues
|
||
cooled area V5/MT while recording from areas V1, V2, and V3 in the monkey. When visual
|
||
stimuli were presented in the classical receptive field (CRF), cooling of area V5/MT
|
||
decreased unit activity in earlier areas, suggesting an excitatory effect of extrinsic feedback
|
||
(Hupé et al., 1998). However, when the authors used a stimulus that spanned the extra-
|
||
classical RF the responses of V1 neurons were – on average – enhanced after cooling area
|
||
V5, consistent with the suppressive role of feedback connections. These results indicate that
|
||
the inhibitory effects of feedback connections may depend on (natural) stimuli that require
|
||
integration over the visual field. Similar effects were observed when area V2 was cooled and
|
||
neurons were measured in V1: when stimuli were presented only to the CRF, cooling V2
|
||
decreased V1 spiking activity; however, when stimuli were present in the CRF and the
|
||
surround, cooling V2 increased V1 activity (Bullier et al., 1996). Finally, others have argued
|
||
for an inhibitory effect of feedback based on the timing and spatial extent of surround
|
||
suppression in monkey V1 – concluding that the far surround suppression effects were most
|
||
likely mediated by feedback (Bair et al., 2003).
|
||
|
||
Bastos et al.
|
||
Page 6
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
The empirical finding that feedback connections can both facilitate and suppress firing in
|
||
lower hierarchical areas – depending on the content of classical and extra-classical receptive
|
||
fields – is consistent with predictive coding: Rao and Ballard (1999) trained a hierarchical
|
||
predictive coding network to recognise natural images. They showed that higher levels in
|
||
the hierarchy learn to predict visual features that extend across many CRFs in the lower
|
||
levels (e.g. tree trunks or horizons). Hence, higher visual areas come to predict that visual
|
||
stimuli will span the receptive fields of cells in lower visual areas. In this setting, a stimulus
|
||
that is confined to a CRF would elicit a strong prediction error signal (because it cannot be
|
||
predicted). This provides a simple explanation for the findings of Hupé et al (1998) and
|
||
Bullier et al (1996): when feedback connections are deactivated, there are no top-down
|
||
predictions to explain responses in lower areas – leading to a disinhibition of responses in
|
||
earlier areas, when – and only when – stimuli can be predicted over multiple CRFs.
|
||
|
||
Feedback connections and layer 1—How might the inhibitory effect of feedback
|
||
connections be mediated? The established view is that extrinsic corticocortical connections
|
||
are exclusively excitatory (using glutamate as their excitatory neurotransmitter) – although
|
||
recent evidence suggests that inhibitory extrinsic connections exist and may play an
|
||
important role in synchronizing distant regions (Melzer et al., 2012). However, one
|
||
important route by which feedback connections could mediate selective inhibition is via
|
||
their termination in L1 (Anderson and Martin, 2006; Shipp, 2007): layer 1 is sometimes
|
||
referred to as acellular due to its pale appearance with Nissl staining (the classical method
|
||
for separating layers that selectively labels cell bodies). Indeed, a recent study concluded
|
||
that L1 contains less than 0.5% of all cells in a cortical column (Meyer et al., 2011). These
|
||
L1 cells are almost all inhibitory and interconnect strongly with each other, via electrical
|
||
connections and chemical synapses (Chu et al., 2003). Simultaneous whole cell patch clamp
|
||
recordings show that they provide strong monosynaptic inhibition to L2/3 pyramidal cells,
|
||
whose apical dendrites project into L1 (Chu et al., 2003; Wozny and Williams, 2011). This
|
||
means L1 inhibitory cells are in a prime position to mediate inhibitory effects of extrinsic
|
||
feedback. The laminar location highlighted by these studies – the bottom of L1 and the top
|
||
of L2/3 – has recently been shown to be a “hotspot” of inhibition in the column (Meyer et
|
||
al., 2011). Indeed, a study of rat barrel cortex – that stimulated (and inactivated) L1 –
|
||
showed that it exerts a powerful inhibitory effect on whisker-evoked responses (Shlosberg et
|
||
al., 2006). These studies suggest that corticocortical feedback connections could deliver
|
||
strong inhibition, if they were to recruit the inhibitory potential of L1.
|
||
|
||
In terms of the excitatory and modulatory effect of feedback connections, predictive input
|
||
from higher cortical areas might have an important impact via the distal dendrites of
|
||
pyramidal neurons (Larkum et al., 2009). Furthermore, there is a specific type of
|
||
GABAergic neuron that appears to control distal dendritic excitability, gating top down
|
||
excitatory signals differentially during behavior (Gentet et al., 2012). Table 1 summarises
|
||
the studies we have discussed in relation to the role of feedback connections.
|
||
|
||
Feedforward and transthalamic connections
|
||
|
||
While the evidence for an inhibitory effect of feedback connections has to be evaluated
|
||
carefully, the evidence for an excitatory effect of feedforward connections is unequivocal.
|
||
For example, in the monkey, V1 projects monosynaptically to V2, V3, V3a, V4, and V5/MT
|
||
(Zeki, 1978; Zeki and Shipp, 1988). In all cases – when V1 is reversibly inactivated through
|
||
cooling – single-cell activity in target areas is strongly suppressed (Girard and Bullier, 1989;
|
||
Girard et al., 1991a, 1991b, 1992). In the cases of V2 and V3, the result of cooling area V1
|
||
is a near-total silencing of single unit activity. These studies illustrate that activity in higher
|
||
cortical areas depends on driving inputs from earlier cortical areas that establish their
|
||
receptive field properties.
|
||
|
||
Bastos et al.
|
||
Page 7
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
Finally, while many studies have focused on extrinsic connections that project directly from
|
||
one cortical area to the next, there is mounting evidence that feedforward driving
|
||
connections (and perhaps feedback) in the cortex could be mediated by transthalamic
|
||
pathways (Sherman and Guillery, 1998, 2011). The strongest evidence for this claim comes
|
||
from the somatosensory system, where it was shown recently that the posterior medial
|
||
nucleus of the thalamus (POm) – a higher-order thalamic nucleus that receives direct input
|
||
from cortex – can relay information between S1 and S2 (Theyel et al., 2009). In addition, the
|
||
thalamic reticular nucleus has been proposed to mediate the inhibition that might underlie
|
||
cross-modal attention or top-down predictions (Yamaguchi and Knight, 1990; Crick, 1984;
|
||
Wurtz et al., 2011). Furthermore, computational considerations and recent experimental
|
||
findings point to a potentially important role for higher-order thalamic nuclei in coordinating
|
||
and synchronizing cortical responses (Vicente et al., 2008; Saalmann et al., 2012). The
|
||
degree to which cortical areas are integrated directly via corticocortical or indirectly via
|
||
cortico-thalamo-cortical connections – and the extent to which transthalamic pathways
|
||
dissociate feedforward from feedback connections in the same way as we have proposed for
|
||
the cortico-cortical connections – are open questions.
|
||
|
||
The canonical microcircuit
|
||
|
||
Central to the idea of a canonical microcircuit is the notion that a cortical column contains
|
||
the circuitry necessary to perform requisite computations, and that these circuits can be
|
||
replicated with minor variations throughout the cortex. One of the clearest examples of how
|
||
cortical circuits process simple inputs – to generate complex outputs – is the emergence of
|
||
orientation tuning in V1. Orientation tuning is a distinctly cortical phenomenon because
|
||
geniculocortical relay cells show no orientation preferences. A further elaboration of cortical
|
||
responses can be found in the distinction between simple and complex cells – while simple
|
||
cells possess spatially confined receptive fields, complex cells are orientation-tuned but
|
||
show less preference for the location of an oriented bar. Hubel and Wiesel proposed a model
|
||
for how intrinsic and extrinsic connectivity could establish a circuit explaining these
|
||
receptive field properties. They proposed that orientation tuning in simple cells could be
|
||
generated by a single cortical cell receiving input from several ON centre – OFF surround
|
||
geniculate cells arranged along a particular orientation, thereby endowing it with a
|
||
preference for bars oriented in a particular direction (Hubel and Wiesel, 1962). Complex
|
||
cells were hypothesized to receive inputs from several simple cells – with the same
|
||
orientation preference and slightly varying receptive field locations. Thus, complex cells
|
||
were thought not to receive direct LGN input but to be higher order cells in cortex.
|
||
Subsequent findings supported these predictions, showing that input layer 4Cα and 4Cβ
|
||
contained the largest proportion of cells receiving monosynaptic geniculate input, while
|
||
superficial and deep layer cells contain a larger number of cells receiving disynaptic or
|
||
polysynaptic input (Bullier and Henry, 1980). Furthermore, simple cells project
|
||
monosynaptically onto complex cells, where they exert a strong feedforward influence
|
||
(Alonso and Martinez, 1998; Alonso, 2002). These models suggest that intrinsic cortical
|
||
circuitry allows processing to proceed along discrete steps that are capable of producing
|
||
response properties in outputs that are not present in inputs.
|
||
|
||
Segregation of processing streams
|
||
|
||
A key property of canonical circuits is the segregation of parallel streams of processing. For
|
||
example, in primates, parvocellular input enters the cortex primarily in layer 4Cβ, whereas
|
||
magnocellular inputs enter in 4Cα. The corticogeniculate feedback pathway from L6
|
||
maintains this segregation, as upper L6 cells preferentially synapse onto parvocellular cells
|
||
in the LGN, while lower L6 cells target the magnocellular LGN layers (Fitzpatrick et al.,
|
||
1994; Briggs and Usrey, 2009). Further examples of stream segregation are also present in
|
||
|
||
Bastos et al.
|
||
Page 8
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
the dorsal “where” and the ventral “what” pathways, and in the projection from V1 to the
|
||
thick, thin, and inter-stripe regions of V2 (Zeki and Shipp, 1988; Sincich and Horton, 2005).
|
||
|
||
Superficial and deep layers are anatomically interconnected, but mounting evidence suggests
|
||
that they constitute functionally distinct processing streams: in an elegant experiment,
|
||
Roopun et al. (2006) showed that L2/3 of rat somatomotor cortex shows prominent gamma
|
||
oscillations that are co-expressed with beta oscillations in L5. Both rhythms persisted, when
|
||
superficial and deep layers were disconnected at the level of L4. Maier and colleagues
|
||
(2010) used multilaminar recordings to show strong LFP coherence amongst sites within the
|
||
superficial layers (the superficial compartment), as well as strong coherence amongst sites in
|
||
deep layers (the deep compartment), but weak inter-compartment coherence. These studies
|
||
indicate a segregation of – potentially autonomous – supragranular and infragranular
|
||
dynamics. Maier et al., found that supragranular sites had higher broadband gamma power
|
||
than infragranular sites. This pattern was reversed in the alpha and beta range; with greater
|
||
power in the infragranular and granular layers. Finally, the spiking activity of neurons in the
|
||
superficial layers of visual cortex are more coherent with gamma frequency oscillations in
|
||
the local field potential, while neurons in deep layers are more coherent with alpha
|
||
frequency oscillations (Buffalo et al., 2011). This finding is consistent with an earlier study
|
||
by Livingstone (1996) showing that 50% of cells in L2/3 of squirrel monkey V1 expressed
|
||
gamma oscillations, compared to less than 20% of cells in L4C and infragranular layers. The
|
||
different spectral behaviour of superficial and deep layers has led to the interesting proposal
|
||
that feedforward and feedback signalling may be mediated by distinct (high and low)
|
||
frequencies (reviewed in Wang, 2010, see also Buschman and Miller, 2007 in the context of
|
||
attention), a proposal that has recently received experimental support - at least for the
|
||
feedforward connections (Bosman et al., 2012; see also Gregoriou et al., 2009).
|
||
|
||
Integration and segregation within canonical circuits—Given this functional and
|
||
anatomical segregation into parallel streams, the question naturally arises, how are these
|
||
streams integrated? It has been previously suggested that integration occurs through the
|
||
synchronized firing of multiple neurons that form a neural ensemble (Gray et al., 1989;
|
||
Singer, 1999), while others have emphasized inter-areal phase-synchronization or coherence
|
||
(Varela et al., 2001; Fries, 2005; Fujisawa and Buzsáki, 2011). While a full treatment of this
|
||
question is beyond the scope of the current review, we propose that the canonical
|
||
microcircuit contains a clue for how the dialectic between segregation and integration might
|
||
be resolved. While top-down and bottom-up inputs and outputs may be segregated in layers,
|
||
streams, and frequency bands, the canonical microcircuit specifies the circuitry for how the
|
||
basic units of cortex are interconnected and therefore how the intrinsic activity of the
|
||
cortical column is entrained by extrinsic inputs. This intrinsic connectivity specifies how the
|
||
cells of origin and termination of extrinsic projections are interconnected, and thus
|
||
determine how top-down and bottom-up streams are integrated within each cortical column.
|
||
|
||
Spatial segregation and cortical columns
|
||
|
||
The notion of a canonical microcircuit implicitly assumes that each circuit is distinct from
|
||
its neighbours; that could presumably carry out computations in parallel. Therefore, the
|
||
canonical microcircuit specifies the spatial scale over which processing is integrated. The
|
||
most likely candidate for this spatial scale is the cortical column – that can vary over three
|
||
orders of magnitude between minicolumns, columns, and hypercolumns. Minicolumns are
|
||
only a few cells wide, estimated to be about 50-60 micrometers in diameter by Mountcastle
|
||
(1997) and are seen in Nissl sections of cortex as slight variations in cell density.
|
||
Minicolumns were originally proposed as elementary units of cortex by Lorente de No
|
||
(1949) and appear to reflect the migration of cells from the ventricular zone to the cortical
|
||
sheet during foetal development (reviewed in Horton and Adams, 2005). Hubel and Wiesel
|
||
|
||
Bastos et al.
|
||
Page 9
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
estimated that orientation columns were on this order of magnitude, about 25-50
|
||
micrometers wide, although they failed to establish a correspondence between orientation
|
||
columns observed physiologically and the minicolumns seen in Nissl sections (Hubel and
|
||
Wiesel, 1974). A cortical column was classically defined as a vertical alignment of cells
|
||
containing neurons with similar receptive field properties, such as orientation preference and
|
||
ocular dominance in V1; or touch in somatosensory cortex (Mountcastle, 1957; Hubel and
|
||
Wiesel, 1972). These columns were suggested by Mountcastle to encompass a number of
|
||
minicolumns, with a width of 300-400 micrometers (Mountcastle, 1997). Finally, Hubel and
|
||
Wiesel defined a hypercolumn to be the unit of cortex necessary to traverse all possible
|
||
values of a particular receptive field property, such as orientation or eye dominance –
|
||
estimated to be between 0.5 to 1 mm wide (Hubel and Wiesel, 1974).
|
||
|
||
Columns, connections and computations—So is the cortical column the basic unit
|
||
of cortical computation? Some authors emphasize that even within a dendrite, there are all
|
||
the necessary biophysical mechanisms for performing surprisingly advanced computations,
|
||
such as direction selectivity, coincidence detection, or temporal integration (Häusser and
|
||
Mel, 2003; London and Häusser, 2005). Others argue that single neurons can process their
|
||
inputs at the dendrite, soma, and initial segment, such that the output spike trains of just two
|
||
interconnected cells could mediate computations like independent components analysis
|
||
(Klampfl et al., 2009). Others posit that cortical columns form the basic computational unit
|
||
(Mountcastle, 1997; Hubel and Wiesel, 1972) but see (Horton and Adams, 2005). Donald
|
||
Hebb proposed that neurons distributed over several cortical areas could form a functional
|
||
computational unit called a neural assembly (Hebb, 1949). This view has re-emerged in
|
||
recent years, with the development of the requisite recording and analytic techniques for
|
||
evaluating this proposal (Buzsáki, 2010; Canolty et al., 2010; Singer et al., 1997; Lopes-dos-
|
||
Santos et al., 2011).
|
||
|
||
Computational modelling studies indicate that cortical columns with structured connectivity
|
||
are computationally more efficient than a network containing the same number of neurons
|
||
but with random connectivity (Haeusler and Maass, 2007). Others suggest that this circuitry
|
||
allows the cortex to organize and integrate bottom-up, lateral, and top-down information
|
||
(Ullman, 1995; Raizada and Grossberg, 2003). Douglas and Martin suggest that the rich
|
||
anatomical connectivity of L2/3 pyramidal cells allows them to collect information from
|
||
top-down, lateral, and bottom-up inputs, and – through processing in the dendritic tree –
|
||
select the most likely interpretation of its inputs. More recently, George and Hawkins have
|
||
suggested that the canonical microcircuit implements a form of Bayesian processing
|
||
(George and Hawkins, 2009). In the following section, we pursue similar ideas, but ground
|
||
them in the framework of predictive coding, and propose a cortical circuit that could
|
||
implement predictive coding through canonical interconnections. In particular, we find that
|
||
the proposed circuitry agrees remarkably well with quantitative characterisations of the
|
||
canonical microcircuit (Haeusler and Maass, 2007).
|
||
|
||
A canonical microcircuit for predictive coding
|
||
|
||
This section considers the computational role of cortical microcircuitry in more detail. We
|
||
try to show that the computations performed by canonical microcircuits can be specified
|
||
more precisely than one might imagine and that these computations can be understood
|
||
within the framework of predictive coding. In brief, we will show that (hierarchical
|
||
Bayesian) inference about the causes of sensory input can be cast as predictive coding. This
|
||
is important because it provides formal constraints on the dynamics one would expect to
|
||
find in neuronal circuits. Having established these constraints, we then attempt to match
|
||
them with the neurobiological constraints afforded by the canonical microcircuit. The
|
||
endpoint of this exercise is a canonical microcircuit for predictive coding.
|
||
|
||
Bastos et al.
|
||
Page 10
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
Predictive coding and the free energy principle
|
||
|
||
It might be thought impossible to specify the computations performed by the brain.
|
||
However, there are some fairly fundamental constraints on the basic form of neuronal
|
||
dynamics. The argument goes as follows – and can be regarded as a brief summary of the
|
||
free energy principle (see Friston, 2010 for details):
|
||
|
||
•
|
||
Biological systems are homoeostatic (or allostatic), which means that they
|
||
minimise the dispersion (entropy) of their interoceptive and exteroceptive states.
|
||
|
||
•
|
||
Entropy is the average of surprise over time, which means biological systems
|
||
minimise the surprise associated with their sensory states at each point in time.
|
||
|
||
•
|
||
In statistics, surprise is the negative logarithm of Bayesian model evidence, which
|
||
means biological systems – like the brain – must continually maximise the
|
||
Bayesian evidence for their (generative) model of sensory inputs.
|
||
|
||
•
|
||
Maximising Bayesian model evidence corresponds to Bayesian filtering of sensory
|
||
inputs. This is also known as predictive coding.
|
||
|
||
These arguments mean that by minimising surprise, through selecting appropriate
|
||
sensations, the brain is implicitly maximising the evidence for its own existence – this is
|
||
known as active inference. In other words, to maintain a homoeostasis the brain must predict
|
||
its sensory states on the basis of a model. Fulfilling those predictions corresponds to
|
||
accumulating evidence for that model – and the brain that embodies it. The implicit
|
||
maximisation of Bayesian model evidence provides an important link to the Bayesian brain
|
||
hypothesis (Hinton and van Camp, 1993; Dayan et al., 1995; Knill and Pouget, 2004) and
|
||
many other compelling proposals about perceptual synthesis – including analysis by
|
||
synthesis (Neisser, 1967; Yuille and Kersten, 2006) epistemological automata (MacKay,
|
||
1956), the principle of minimum redundancy (Attneave, 1954; Barlow, H.B., 1961; Dan et
|
||
al., 1996), the Infomax principle (Linsker, 1990; Atick, 1992; Kay and Phillips, 2011), and
|
||
perception as hypothesis testing (Gregory, 1968; 1980).
|
||
|
||
The most popular scheme – for Bayesian filtering in neuronal circuits – is predictive coding
|
||
(Srinivasan et al., 1982; Buchsbaum and Gottschalk, 1983; Rao and Ballard, 1999). In this
|
||
context, surprise corresponds (roughly) to prediction error. In predictive coding, top-down
|
||
predictions are compared with bottom-up sensory information to form a prediction error.
|
||
This prediction error is used to update higher-level representations – upon which top-down
|
||
predictions are based. These optimised predictions then reduce prediction error at lower
|
||
levels.
|
||
|
||
To predict sensations, the brain must be equipped with a generative model of how its
|
||
sensations are caused (Helmholtz, 1860). Indeed, this led Geoffrey Hinton and colleagues to
|
||
propose that the brain is an inference (Helmholtz) machine (Hinton and Zemel, 1994; Dayan
|
||
et al., 1995). A generative model describes how variables or causes in the environment
|
||
conspire to produce sensory input. Generative models map from (hidden) causes to (sensory)
|
||
consequences. Perception then corresponds to the inverse mapping from sensations to their
|
||
causes, while action can be thought of as the selective sampling of sensations. Crucially, the
|
||
form of the generative model dictates the form of the inversion – for example, predictive
|
||
coding. Figure 3 depicts a general model as a probabilistic graphical model. A special case
|
||
of these models are hierarchical dynamic models (see Figure 4), which grandfather most
|
||
parametric models in statistics and machine learning (see Friston, 2008). These models
|
||
explain sensory data in terms of hidden causes and states. Hidden causes and states are both
|
||
hidden variables that cause sensations but they play slightly different roles: hidden causes
|
||
link different levels of the model and mediate conditional dependencies among hidden states
|
||
at each level. Conversely, hidden states model conditional dependencies over time (i.e.,
|
||
|
||
Bastos et al.
|
||
Page 11
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
memory) by modelling dynamics in the world. In short, hidden causes and states mediate
|
||
structural and dynamic dependencies respectively.
|
||
|
||
The details of the graph in Figure 3 are not important; it just provides a way of describing
|
||
conditional dependencies among hidden states and causes responsible for generating sensory
|
||
input. These dependencies mean that we can interpret neuronal activity as message passing
|
||
among the nodes of a generative model, where each canonical microcircuit contains
|
||
representations or expectations about hidden states and causes. In other words, the form of
|
||
the underlying generative model defines the form of the predictive coding architecture used
|
||
to invert the model. This is illustrated in Figure 4, where each node has a single parent. We
|
||
will deal with this simple sort of model because it lends itself to an unambiguous description
|
||
in terms of bottom-up (feedforward) and top-down (feedback) message passing. We now
|
||
look at how perception or model inversion – recovering the hidden states and causes of this
|
||
model given sensory data – might be implemented at the level of a microcircuit:
|
||
|
||
Predictive coding and message passing
|
||
|
||
In predictive coding, representations (or conditional expectations) generate top-down
|
||
predictions to produce prediction errors. These prediction errors are then passed up the
|
||
hierarchy in the reverse direction, to update conditional expectations. This ensures an
|
||
accurate prediction of sensory input and all its intermediate representations. This hierarchal
|
||
message passing can be expressed mathematically as a gradient descent on the (sum of
|
||
squared) prediction errors
|
||
, where the prediction errors are weighted by their
|
||
precision (inverse variance).
|
||
|
||
(1)
|
||
|
||
The first pair of equalities just says that conditional expectations about hidden causes and
|
||
|
||
states
|
||
are updated based upon the way we would predict them to change – the first
|
||
term – and subsequent terms that minimise prediction error. The second pair of equations
|
||
|
||
simply expresses prediction error
|
||
as the difference between conditional
|
||
expectations about hidden causes and (the changes in) hidden states and their predicted
|
||
|
||
values, weighed by their precisions
|
||
. These predictions are nonlinear functions of
|
||
conditional expectations (g(i), f(i)) at each level of the hierarchy and the level above.
|
||
|
||
It is difficult to overstate the generality and importance of Equation (1) – it grandfathers
|
||
nearly every known statistical estimation scheme, under parametric assumptions about
|
||
additive noise. These range from ordinary least squares to advanced Bayesian filtering
|
||
schemes (see Friston 2008). In this general setting, Equation (1) minimises variational free
|
||
energy and corresponds to generalised predictive coding. Under linear models, it reduces to
|
||
linear predictive coding, also known as Kalman-Bucy filtering (see Friston et al, 2010 for
|
||
details).
|
||
|
||
In neuronal network terms, Equation (1) says that prediction error units receive messages
|
||
from the same level and the level above. This is because the hierarchical form of the model
|
||
only requires conditional expectations from neighbouring levels to form prediction errors, as
|
||
can be seen schematically in Figure 4. Conversely, expectations are driven by prediction
|
||
|
||
Bastos et al.
|
||
Page 12
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
error from the same level and the level below – updating expectations about hidden states
|
||
and causes respectively. These constitute the bottom-up and lateral messages that drive
|
||
conditional expectations to provide better predictions – or representations – that suppress
|
||
prediction error. This updating corresponds to an accumulation of prediction errors – in that
|
||
the rate of change of conditional expectations is proportional to prediction error.
|
||
Electrophysiologically, this means that one would expect to see a transient prediction error
|
||
response to bottom-up afferents (in neuronal populations encoding prediction error) that is
|
||
suppressed to baseline firing rates by sustained responses (in neuronal populations encoding
|
||
predictions). This is the essence of recurrent message passing between hierarchical levels to
|
||
suppress prediction error (see Friston 2008 for a more detailed discussion).
|
||
|
||
The nature of this message passing is remarkably consistent with the anatomical and
|
||
physiological features of cortical hierarchies. An important prediction is that the nonlinear
|
||
functions of the generative model – modelling context sensitive dependencies among hidden
|
||
variables – appear only in the top-down and lateral predictions. This means,
|
||
neurobiologically, we would predict feedback connections to possess nonlinear or
|
||
neuromodulatory characteristics; in contrast to feedforward connections that mediate a linear
|
||
mixture of prediction errors. This functional asymmetry is exactly consistent with the
|
||
empirical evidence reviewed above. Another key feature of Equation (1) is that the top-
|
||
down predictions produce prediction errors through subtraction. In other words, feedback
|
||
connections should exert inhibitory effects, of the sort seen empirically. Table 2 summarises
|
||
the features of extrinsic connectivity (reviewed in the previous section) that are explained by
|
||
predictive coding. In the remainder of this review, we focus on intrinsic connections and
|
||
cortical microcircuits.
|
||
|
||
The cortical microcircuit and predictive coding
|
||
|
||
We now try to associate the variables in Equation (1) with specific populations in the
|
||
canonical microcircuit. Figure 5 illustrates a remarkable correspondence between the form
|
||
of Equation (1) and the connectivity of the canonical microcircuit. Furthermore, the
|
||
resulting scheme corresponds almost exactly to the computational architecture proposed by
|
||
Mumford (1992). This correspondence rests upon the following intuitive steps:
|
||
|
||
•
|
||
First, we divide the excitatory cells in the superficial and deep layers into principal
|
||
(pyramidal) cells and excitatory interneurons. This accommodates the fact that (in
|
||
macaque V1) a significant percentage of superficial L2/3 cells (about half) and
|
||
deep L5 excitatory cells (about 80%) do not project outside the cortical column
|
||
(Callaway and Wiser, 1996; Briggs and Callaway, 2005).
|
||
|
||
•
|
||
Second, we know that the superficial and deep pyramidal cells provide feedforward
|
||
and feedback connections respectively. This means that superficial pyramidal cells
|
||
|
||
must encode and broadcast prediction errors on hidden causes
|
||
, while deep
|
||
|
||
pyramidal cells must encode conditional expectations
|
||
so that they can
|
||
elaborate feedback predictions.
|
||
|
||
•
|
||
Third, we know that the (spiny stellate) excitatory cells in the granular layer receive
|
||
|
||
feedforward connections encoding prediction errors
|
||
on the hidden causes of the
|
||
level below.
|
||
|
||
•
|
||
This leaves the inhibitory interneurons in the granular layer, which, for symmetry,
|
||
we associate with prediction errors on the hidden states.
|
||
|
||
•
|
||
The remaining populations are the excitatory and inhibitory interneurons in the
|
||
supragranular layer, to which we assign expectations about hidden causes and
|
||
states respectively. These are mapped through descending (intrinsic) feedforward
|
||
|
||
Bastos et al.
|
||
Page 13
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
connections to cells in the deep layers that generate predictions. We do not suppose
|
||
that this is a simple one-to-one mapping – rather it mediates the nonlinear
|
||
transformation of expectations to predictions required by the earlier cortical level.
|
||
|
||
This arrangement accommodates the fact that the dependencies among hidden states are
|
||
confined to each node (by the nature of graphical models), which means that their
|
||
expectations and prediction errors should be encoded by interneurons. Furthermore, the
|
||
splitting of excitatory cells in the upper layers into two populations (encoding expectations
|
||
and prediction errors on hidden causes) is sensible, because there is a one-to-one mapping
|
||
between the expectations on hidden causes and their prediction errors.
|
||
|
||
The ensuing architecture bears a striking correspondence to the microcircuit in (Haeusler
|
||
and Maass, 2007) in the left panel of Figure 5 – in the sense that nearly every connection
|
||
required by the predictive coding scheme appears to be present in terms of quantitative
|
||
measures of intrinsic connectivity. However, there are two exceptions that both involve
|
||
connections to the inhibitory cells in the granular layer (shown as dotted lines in Figure 5).
|
||
Predictive coding requires that these cells (that encode prediction errors on hidden states)
|
||
compare the expected changes in hidden states with the actual changes. This suggests that
|
||
there should be interlaminar projections from supragranular (inhibitory) and infragranular
|
||
(excitatory) cells. In terms of their synaptic characteristics, one would predict that these
|
||
intrinsic connections would be of a feedback sort – in the sense that they convey predictions.
|
||
Although not considered in this Haeusler and Maass scheme, feedback connections from
|
||
infragranular layers are an established component of the canonical microcircuit (see Figure
|
||
2).
|
||
|
||
Functional asymmetries in the microcircuit
|
||
|
||
The circuitry in Figure 5 appears consistent with the broad scheme of ascending
|
||
(feedforward) and descending (feedback) intrinsic connections: feedforward prediction
|
||
errors from a lower cortical level arrive at granular layers and are passed forward to
|
||
excitatory and inhibitory interneurons in supragranular layers, encoding expectations. Strong
|
||
and reciprocal intralaminar connections couple superficial excitatory interneurons and
|
||
pyramidal cells. Excitatory and inhibitory interneurons in supragranular layers then send
|
||
strong feedforward connections to the infragranular layer. These connections enable deep
|
||
pyramidal cells and excitatory interneurons to produce (feedback) predictions, which ascend
|
||
back to L4 or descend to a lower hierarchical level. This arrangement recapitulates the
|
||
functional asymmetries between extrinsic feedforward and feedback connections and is
|
||
consistent with the empirical characteristics of intrinsic connections.
|
||
|
||
If we focus on the superficial and deep pyramidal cells, the form of the recognition
|
||
dynamics in Equation (1) tells us something quite fundamental: We would anticipate higher
|
||
frequencies in the superficial pyramidal cells, relative to the deep pyramidal cells. One can
|
||
see this easily by taking the Fourier transform of the first equality in Equation (1)
|
||
|
||
(2)
|
||
|
||
This equation says that the contribution of any (angular) frequency ω in the prediction errors
|
||
(encoded by superficial pyramidal cells) to the expectations (encoded by the deep pyramidal
|
||
cells) is suppressed in proportion to that frequency (Friston 2008). In other words, high
|
||
frequencies should be attenuated when passing from superficial to deep pyramidal cells.
|
||
There is nothing mysterious about this attenuation – it is a simple consequence of the fact
|
||
that conditional expectations accumulate prediction errors, thereby suppressing high-
|
||
frequency fluctuations to produce smooth estimates of hidden causes. This smoothing –
|
||
|
||
Bastos et al.
|
||
Page 14
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
inherent in Bayesian filtering – leads to an asymmetry in frequency content of superficial
|
||
and deep cells: for example, superficial cells should express more gamma relative to beta,
|
||
and deep cells should express more beta relative to gamma (Roopun et al., 2006, 2008;
|
||
Maier et al., 2010).
|
||
|
||
Figure 6 provides a schematic illustration of the spectral asymmetry predicted by Equation
|
||
2. Note that predictions about the relative amplitudes of high and low frequencies in
|
||
superficial and deep layers pertain to all frequencies – there is nothing in predictive coding
|
||
per se to suggest characteristic frequencies in the gamma and beta ranges. However, one
|
||
might speculate the characteristic frequencies of canonical microcircuits have evolved to
|
||
model and – through active inference – create the sensorium (Friston, 2010; Berkes et al.,
|
||
2011; Engbert et al., 2011). Indeed, there is empirical evidence to support this notion; in the
|
||
visual (Lakatos et al., 2008; Meirovithz et al., 2012; Bosman et al., 2009) and motor domain
|
||
(Gwin and Ferris, 2012).
|
||
|
||
In summary, predictions are formed by a linear accumulation of prediction errors.
|
||
Conversely, prediction errors are nonlinear functions of predictions. This means that the
|
||
conversion of prediction errors into predictions (Bayesian filtering) necessarily entails a loss
|
||
of high frequencies. However, the nonlinearity in the mapping from predictions to prediction
|
||
errors means that high frequencies can be created (consider the effect of squaring a sine
|
||
wave, which would convert beta into gamma). In short, prediction errors should express
|
||
higher frequencies than the predictions that accumulate them. This is another example of a
|
||
potentially important functional asymmetry between feedforward and feedback message
|
||
passing that emerges under predictive coding. It is particularly interesting given recent
|
||
evidence that feedforward connections may use higher frequencies than feedback
|
||
connections (Bosman et al., 2012).
|
||
|
||
Conclusion
|
||
|
||
In conclusion, there is a remarkable correspondence between the anatomy and physiology of
|
||
the canonical microcircuit and the formal constraints implied by generalised predictive
|
||
coding. Having said this, there are many variations on the mapping between computational
|
||
and neuronal architectures: Even if predictive coding is an appropriate implementation of
|
||
Bayesian filtering, there are many variations on the arrangement shown in Figure 5. For
|
||
example, feedback connections could arise directly from cells encoding conditional
|
||
expectations in supragranular layers. Indeed, there is emerging evidence that feedback
|
||
connections between proximate hierarchical levels originate from both deep and superficial
|
||
layers (Markov et al 2011). Note that this putative splitting of extrinsic streams is only
|
||
predicted in the light of empirical constraints on intrinsic connectivity.
|
||
|
||
One of our motivations – for considering formal constraints on connectivity – was to
|
||
produce dynamic causal models of canonical microcircuits. Dynamic causal modelling
|
||
enables one to compare different connectivity models, using empirical electrophysiological
|
||
responses (David et al, 2006; Moran et al, 2008, 2011). This form of modelling rests upon
|
||
Bayesian model comparison and allows one to assess the evidence for one microcircuit
|
||
relative to another. In principle, this provides a way to evaluate different microcircuit
|
||
models, in terms of their ability to explain observed activity. One might imagine that the
|
||
particular circuits for predictive coding presented in this paper will be nuanced as more
|
||
anatomical and physiological information becomes available. The ability to compare
|
||
competing models or microcircuits – using optogenetics, local field potentials and
|
||
electroencephalography – may be important for refining neurobiologically informed
|
||
microcircuits. In short, many of the predictions and assumptions we have made about the
|
||
specific form of the microcircuit for predictive coding may be testable in the near future.
|
||
|
||
Bastos et al.
|
||
Page 15
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
Acknowledgments
|
||
|
||
This work was supported by the Wellcome Trust and the NSF Graduate Research Fellowship under Grant No.
|
||
2009090358 to A.M.B. Support was also provided by NIH grants MH055714 (G.R.M.) and EY013588 (W.M.U.),
|
||
and NSF grant 1228535 (G.R.M and W.M.U). The authors would like to thank Julien Vezoli, Will Penny, Dimitris
|
||
Pinotsis, Stewart Shipp, Vladimir Litvak, Conrado Bosman, Laurent Perrinet and Henry Kennedy for helpful
|
||
discussions. We would also like to thank our reviewers for helpful comments and guidance.
|
||
|
||
References
|
||
|
||
Alink A, Schwiedrzik CM, Kohler A, Singer W, Muckli L. Stimulus predictability reduces responses
|
||
in primary visual cortex. J. Neurosci. 2010; 30:2960–2966. [PubMed: 20181593]
|
||
Alitto HJ, Usrey WM. Origin and dynamics of extraclassical suppression in the lateral geniculate
|
||
nucleus of the macaque monkey. Neuron. 2008; 57:135–146. [PubMed: 18184570]
|
||
Alonso JM. Book Review: Neural Connections and Receptive Field Properties in the Primary Visual
|
||
Cortex. The Neuroscientist. 2002; 8:443. [PubMed: 12374429]
|
||
Alonso JM, Martinez LM. Functional connectivity between simple cells and complex cells in cat
|
||
striate cortex. Nat. Neurosci. 1998; 1:395–403. [PubMed: 10196530]
|
||
Anderson JC, Kennedy H, Martin KA. Pathways of Attention: Synaptic Relationships of Frontal Eye
|
||
Field to V4, Lateral Intraparietal Cortex, and Area 46 in Macaque Monkey. The Journal of
|
||
Neuroscience. 2011; 31:10872. [PubMed: 21795539]
|
||
Anderson JC, Martin KAC. Synaptic connection from cortical area V4 to V2 in macaque monkey. The
|
||
Journal of Comparative Neurology. 2006; 495:709–721. [PubMed: 16506191]
|
||
Armstrong, KM.; Schafer, RJ.; Chang, MH.; Moore, T. Attention and action in the frontal eye field. In
|
||
The Neuroscience of Attention: Attentional Control and Selection. Mangun, GR., editor. Oxford
|
||
University Press; New York: 2012. p. 151-166.
|
||
Arnal LH, Wyart V, Giraud A-L. Transitions in neural oscillations reflect prediction errors generated
|
||
in audiovisual speech. Nature Neuroscience. 2011; 14:797–801.
|
||
Atick JJ. Could information theory provide an ecological theory of sensory processing? Network:
|
||
Computation in Neural Systems. 1992; 3:213–251.
|
||
Attneave F. Some informational aspects of visual perception. Psychol Rev. 1954; 61:183–193.
|
||
[PubMed: 13167245]
|
||
Bair W, Cavanaugh JR, Movshon JA. Time course and time-distance relationships for surround
|
||
suppression in macaque V1 neurons. The Journal of Neuroscience. 2003; 23:7690. [PubMed:
|
||
12930809]
|
||
Barceló F, Suwazono S, Knight RT. Prefrontal modulation of visual processing in humans. Nat.
|
||
Neurosci. 2000; 3:399–403. [PubMed: 10725931]
|
||
Barlow, HB. Possible principles underlying the transformations of sensory messages.. In: Rosenblith,
|
||
WA., editor. Sensory Communication. MIT Press; Cambridge, MA: 1961. p. 217-234.
|
||
Barone P, Batardiere A, Knoblauch K, Kennedy H. Laminar distribution of neurons in extrastriate
|
||
areas projecting to visual areas V1 and V4 correlates with the hierarchical rank and indicates the
|
||
operation of a distance rule. The Journal of Neuroscience. 2000; 20:3263. [PubMed: 10777791]
|
||
Berkes P, Orban G, Lengyel M, Fiser J. Spontaneous Cortical Activity Reveals Hallmarks of an
|
||
Optimal Internal Model of the Environment. Science. 2011; 331:83–87. [PubMed: 21212356]
|
||
Bollimunta A, Mo J, Schroeder CE, Ding M. Neuronal Mechanisms and Attentional Modulation of
|
||
Corticothalamic Alpha Oscillations. The Journal of Neuroscience. 2011; 31:4935. [PubMed:
|
||
21451032]
|
||
Bosman CA, Schoffelen J-M, Brunet N, Oostenveld R, Bastos AM, Womelsdorf T, Rubehn B,
|
||
Stieglitz T, De Weerd P, Fries P. Attentional Stimulus Selection through Selective
|
||
Synchronization between Monkey Visual Areas. Neuron. 2012; 75:875–888. [PubMed: 22958827]
|
||
Briggs F, Callaway EM. Laminar patterns of local excitatory input to layer 5 neurons in macaque
|
||
primary visual cortex. Cerebral Cortex. 2005; 15:479. [PubMed: 15319309]
|
||
Briggs F, Usrey WM. Parallel processing in the corticogeniculate pathway of the macaque monkey.
|
||
Neuron. 2009; 62:135–146. [PubMed: 19376073]
|
||
|
||
Bastos et al.
|
||
Page 16
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
Buchsbaum G, Gottschalk A. Trichromacy, opponent colours coding and optimum colour information
|
||
transmission in the retina. Proc. R. Soc. Lond., B, Biol. Sci. 1983; 220:89–113. [PubMed:
|
||
6140684]
|
||
Buffalo EA, Fries P, Landman R, Buschman TJ, Desimone R. Laminar differences in gamma and
|
||
alpha coherence in the ventral stream. Proceedings of the National Academy of Sciences. 2011;
|
||
108:11262.
|
||
Bullier J, Henry GH. Ordinal position and afferent input of neurons in monkey striate cortex. J. Comp.
|
||
Neurol. 1980; 193:913–935. [PubMed: 6253535]
|
||
Bullier J, Hupé JM, James A, Girard P. Functional interactions between areas V1 and V2 in the
|
||
monkey. Journal of Physiology-Paris. 1996; 90:217–220.
|
||
Buschman TJ, Miller EK. Top-Down Versus Bottom-Up Control of Attention in the Prefrontal and
|
||
Posterior Parietal Cortices. Science. 2007; 315:1860–1862. [PubMed: 17395832]
|
||
Buzsáki G. Neural Syntax: Cell Assemblies, Synapsembles, and Readers. Neuron. 2010; 68:362–385.
|
||
[PubMed: 21040841]
|
||
Callaway EM. Local circuits in primary visual cortex of the macaque monkey. Annual Review of
|
||
Neuroscience. 1998; 21:47–74.
|
||
Callaway EM, Wiser AK. Contributions of individual layer 2-5 spiny neurons to local circuits in
|
||
macaque primary visual cortex. Vis. Neurosci. 1996; 13:907–922. [PubMed: 8903033]
|
||
Canolty RT, Ganguly K, Kennerley SW, Cadieu CF, Koepsell K, Wallis JD, Carmena JM. Oscillatory
|
||
phase coupling coordinates anatomically dispersed functional cell assemblies. Proceedings of the
|
||
National Academy of Sciences. 2010; 107:17356.
|
||
Chu Z, Galarreta M, Hestrin S. Synaptic interactions of late-spiking neocortical neurons in layer 1. The
|
||
Journal of Neuroscience. 2003; 23:96. [PubMed: 12514205]
|
||
Covic EN, Sherman SM. Synaptic properties of connections between the primary and secondary
|
||
auditory cortices in mice. Cereb. Cortex. 2011; 21:2425–2441. [PubMed: 21385835]
|
||
Crick F. Function of the thalamic reticular complex: the searchlight hypothesis. Proceedings of the
|
||
National Academy of Sciences of the United States of America. 1984; 81:4586. [PubMed:
|
||
6589612]
|
||
Crick F, Koch C. Constraints on cortical and thalamic projections: the no-strong-loops hypothesis.
|
||
Nature. 1998; 391:245–250. [PubMed: 9440687]
|
||
Dan Y, Atick JJ, Reid RC. Efficient coding of natural scenes in the lateral geniculate nucleus:
|
||
experimental test of a computational theory. The Journal of Neuroscience. 1996; 16:3351–3362.
|
||
[PubMed: 8627371]
|
||
David O, Kiebel SJ, Harrison LM, Mattout J, Kilner JM, Friston KJ. Dynamic causal modeling of
|
||
evoked responses in EEG and MEG. NeuroImage. 2006; 30:1255–1272. [PubMed: 16473023]
|
||
Dayan P, Hinton GE, Neal RM, Zemel RS. The Helmholtz machine. Neural Comput. 1995; 7:889–
|
||
904. [PubMed: 7584891]
|
||
Desimone R. Neural mechanisms for visual memory and their role in attention. Proc. Natl. Acad. Sci.
|
||
U.S.A. 1996; 93:13494–13499. [PubMed: 8942962]
|
||
Douglas RJ, Martin K. A functional microcircuit for cat visual cortex. The Journal of Physiology.
|
||
1991; 440:735. [PubMed: 1666655]
|
||
Douglas RJ, Martin KA, Whitteridge D. A canonical microcircuit for neocortex. Neural Computation.
|
||
1989; 1:480–488.
|
||
Douglas RJ, Martin KAC. Neuronal Circuits of the Neocortex. Annu. Rev. Neurosci. 2004; 27:419–
|
||
451. [PubMed: 15217339]
|
||
Engbert R, Mergenthaler K, Sinn P, Pikovsky A. PNAS Plus: An integrated model of fixational eye
|
||
movements and microsaccades. Proceedings of the National Academy of Sciences. 2011;
|
||
108:E765–E770.
|
||
Engel, AK.; Friston, KJ.; Kelso, JA.; König, P.; Kovács, I.; MacDonald, A.; Miller, EK.; Phillips,
|
||
WA.; Silverstein, SM.; Tallon-Baudry, C., et al. Coordination in Behavior and Cognition.. In: von
|
||
der Malsburg, C.; Phillips, WA.; Singer, W., editors. Dynamic Coordination in the Brain: From
|
||
Neurons to Mind. MIT Press; Cambridge, MA: 2010. p. 267-299.
|
||
|
||
Bastos et al.
|
||
Page 17
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
Feldman H, Friston KJ. Attention, uncertainty, and free-energy. Front Hum Neurosci. 2010; 4:215.
|
||
[PubMed: 21160551]
|
||
Felleman DJ, Van Essen DC. Distributed hierarchical processing in the primate cerebral cortex. Cereb.
|
||
Cortex. 1991; 1:1–47. [PubMed: 1822724]
|
||
Fitzpatrick D, Usrey WM, Schofield BR, Einstein G. The sublaminar organization of corticogeniculate
|
||
neurons in layer 6 of macaque striate cortex. Vis. Neurosci. 1994; 11:307–315. [PubMed:
|
||
7516176]
|
||
Fries P. A mechanism for cognitive dynamics: neuronal communication through neuronal coherence.
|
||
Trends in Cognitive Sciences. 2005; 9:474–480. [PubMed: 16150631]
|
||
Fries P, Reynolds JH, Rorie AE, Desimone R. Modulation of oscillatory neuronal synchronization by
|
||
selective visual attention. Science. 2001; 291:1560–1563. [PubMed: 11222864]
|
||
Friston K. Hierarchical Models in the Brain. PLoS Comput Biol. 2008; 4:e1000211. [PubMed:
|
||
18989391]
|
||
Friston K. The free-energy principle: a unified brain theory? Nature Reviews Neuroscience. 2010;
|
||
11:127–138.
|
||
Fujisawa S, Buzsáki G. A 4 Hz Oscillation Adaptively Synchronizes Prefrontal, VTA, and
|
||
Hippocampal Activities. Neuron. 2011; 72:153–165. [PubMed: 21982376]
|
||
Garrido MI, Kilner JM, Kiebel SJ, Friston KJ. Evoked brain responses are generated by feedback
|
||
loops. Proc. Natl. Acad. Sci. U.S.A. 2007; 104:20961–20966. [PubMed: 18087046]
|
||
Garrido MI, Kilner JM, Stephan KE, Friston KJ. The mismatch negativity: a review of underlying
|
||
mechanisms. Clinical Neurophysiology. 2009; 120:453–463. [PubMed: 19181570]
|
||
Gentet LJ, Kremer Y, Taniguchi H, Huang ZJ, Staiger JF, Petersen CCH. Unique functional properties
|
||
of somatostatin-expressing GABAergic neurons in mouse barrel cortex. Nature Neuroscience.
|
||
2012; 15:607–612.
|
||
George D, Hawkins J. Towards a mathematical theory of cortical micro-circuits. PLoS Computational
|
||
Biology. 2009; 5:e1000532. [PubMed: 19816557]
|
||
Gilbert CD, Wiesel TN. Functional organization of the visual cortex. Prog. Brain Res. 1983; 58:209–
|
||
218. [PubMed: 6138809]
|
||
Girard P, Bullier J. Visual activity in area V2 during reversible inactivation of area 17 in the macaque
|
||
monkey. Journal of Neurophysiology. 1989; 62:1287. [PubMed: 2600626]
|
||
Girard P, Salin PA, Bullier J. Visual activity in areas V3a and V3 during reversible inactivation of area
|
||
V1 in the macaque monkey. Journal of Neurophysiology. 1991a; 66:1493. [PubMed: 1765790]
|
||
Girard P, Salin PA, Bullier J. Visual activity in macaque area V4 depends on area 17 input.
|
||
Neuroreport. 1991b; 2:81–84. [PubMed: 1883988]
|
||
Girard P, Salin PA, Bullier J. Response selectivity of neurons in area MT of the macaque monkey
|
||
during reversible inactivation of area V1. Journal of Neurophysiology. 1992; 67:1437. [PubMed:
|
||
1629756]
|
||
Gray CM, König P, Engel AK, Singer W. Oscillatory responses in cat visual cortex exhibit inter-
|
||
columnar synchronization which reflects global stimulus properties. Nature. 1989; 338:334–337.
|
||
[PubMed: 2922061]
|
||
Gregoriou GG, Gotts SJ, Zhou H, Desimone R. High-Frequency, Long-Range Coupling Between
|
||
Prefrontal and Visual Cortex During Attention. Science. 2009; 324:1207–1210. [PubMed:
|
||
19478185]
|
||
Gregory RL. Perceptual illusions and brain models. Proc. R. Soc. Lond., B. Biol. Sci. 1968; 171:279–
|
||
296. [PubMed: 4387405]
|
||
Gregory RL. Perceptions as hypotheses. Philos. Trans. R. Soc. Lond., B. Biol. Sci. 1980; 290:181–197.
|
||
[PubMed: 6106237]
|
||
Gwin JT, Ferris DP. Beta- and gamma-range human lower limb corticomuscular coherence. Front
|
||
Hum Neurosci. 2012; 6:258. [PubMed: 22973219]
|
||
Haeusler S, Maass W. A statistical analysis of information-processing properties of lamina-specific
|
||
cortical microcircuit models. Cerebral Cortex. 2007; 17:149. [PubMed: 16481565]
|
||
Harrison LM, Stephan KE, Rees G, Friston KJ. Extra-classical receptive field effects measured in
|
||
striate cortex with fMRI. Neuroimage. 2007; 34:1199–1208. [PubMed: 17169579]
|
||
|
||
Bastos et al.
|
||
Page 18
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
Häusser M, Mel B. Dendrites: bug or feature? Current Opinion in Neurobiology. 2003; 13:372–383.
|
||
[PubMed: 12850223]
|
||
Hebb, DO. The Organization of Behavior: A Neuropsychological Theory. Wiley; New York: 1949.
|
||
Helmholtz, H. English translation. Dover; New York: 1860. Handbuch der Physiologischen Optik..
|
||
Hinton G, van Camp D. Keeping neural networks simple by minimizing the description length of
|
||
weights. Proceedings of COLT-93. 1993:5–13.
|
||
Hinton, GE.; Zemel, RS. Autoencoders, Minimum Description Length, and Helmholtz Free Energy..
|
||
In: Cowan, JD.; Tesauro, G.; Alspector, J., editors. Advances in Neural Information Processing
|
||
Systems 6. Morgan Kaufmann; San Mateo, CA: 1994.
|
||
Hopfinger JB, Buonocore MH, Mangun GR. The neural mechanisms of top- down attentional control.
|
||
Nature Neuroscience. 2000; 3:284–291.
|
||
Horton JC, Adams DL. The cortical column: a structure without a function. Philosophical Transactions
|
||
of the Royal Society B: Biological Sciences. 2005; 360:837–862.
|
||
Hubel DH, Wiesel TN. Receptive fields, binocular interaction and functional architecture in the cat's
|
||
visual cortex. The Journal of Physiology. 1962; 160:106. [PubMed: 14449617]
|
||
Hubel DH, Wiesel TN. Laminar and columnar distribution of geniculo-cortical fibers in the macaque
|
||
monkey. The Journal of Comparative Neurology. 1972; 146:421–450. [PubMed: 4117368]
|
||
Hubel DH, Wiesel TN. Sequence regularity and geometry of orientation columns in the monkey striate
|
||
cortex. J. Comp. Neurol. 1974; 158:267–293. [PubMed: 4436456]
|
||
Hupé JM, James AC, Payne BR, Lomber SG, Girard P, Bullier J. Cortical feedback improves
|
||
discrimination between figure and background by V1, V2 and V3 neurons. Nature. 1998;
|
||
394:784–787. [PubMed: 9723617]
|
||
Johnson RR, Burkhalter A. Microcircuitry of forward and feedback connections within rat visual
|
||
cortex. J. Comp. Neurol. 1996; 368:383–398. [PubMed: 8725346]
|
||
Johnson RR, Burkhalter A. A polysynaptic feedback circuit in rat visual cortex. The Journal of
|
||
Neuroscience. 1997; 17:7129. [PubMed: 9278547]
|
||
Kätzel D, Zemelman BV, Buetfering C, Wölfel M, Miesenböck G. The columnar and laminar
|
||
organization of inhibitory connections to neocortical excitatory cells. Nature Neuroscience. 2010;
|
||
14:100–107.
|
||
Kay JW, Phillips WA. Coherent Infomax as a computational goal for neural systems. Bull. Math. Biol.
|
||
2011; 73:344–372. [PubMed: 20821064]
|
||
Klampfl S, Legenstein R, Maass W. Spiking neurons can learn to solve information bottleneck
|
||
problems and extract independent components. Neural Computation. 2009; 21:911–959. [PubMed:
|
||
19018708]
|
||
Knight RT. Decreased response to novel stimuli after prefrontal lesions in man. Electroencephalogr
|
||
Clin Neurophysiol. 1984; 59:9–20. [PubMed: 6198170]
|
||
Knight RT, Scabini D, Woods DL. Prefrontal cortex gating of auditory transmission in humans. Brain
|
||
Res. 1989; 504:338–342. [PubMed: 2598034]
|
||
Knill DC, Pouget A. The Bayesian brain: the role of uncertainty in neural coding and computation.
|
||
Trends in Neurosciences. 2004; 27:712–719. [PubMed: 15541511]
|
||
de Kock CPJ, Bruno RM, Spors H, Sakmann B. Layer- and cell-type-specific suprathreshold stimulus
|
||
representation in rat primary somatosensory cortex. J. Physiol. (Lond.). 2007; 581:139–154.
|
||
[PubMed: 17317752]
|
||
Kok P, Rahnev D, Jehee JFM, Lau HC, de Lange FP. Attention Reverses the Effect of Prediction in
|
||
Silencing Sensory Signals. Cerebral Cortex. 2011; 22:2197–2206. [PubMed: 22047964]
|
||
Lakatos P, Karmos G, Mehta AD, Ulbert I, Schroeder CE. Entrainment of neuronal oscillations as a
|
||
mechanism of attentional selection. Science. 2008; 320:110–113. [PubMed: 18388295]
|
||
Larkum ME, Nevian T, Sandler M, Polsky A, Schiller J. Synaptic Integration in Tuft Dendrites of
|
||
Layer 5 Pyramidal Neurons: A New Unifying Principle. Science. 2009; 325:756–760. [PubMed:
|
||
19661433]
|
||
Linsker R. Perceptual neural organization: some approaches based on network models and information
|
||
theory. Annual Review of Neuroscience. 1990; 13:257–281.
|
||
|
||
Bastos et al.
|
||
Page 19
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
Livingstone MS. Oscillatory firing and interneuronal correlations in squirrel monkey striate cortex.
|
||
Journal of Neurophysiology. 1996; 75:2467–2485. [PubMed: 8793757]
|
||
London M, Häusser M. Dendritic computation. Annu. Rev. Neurosci. 2005; 28:503–532. [PubMed:
|
||
16033324]
|
||
Lopes-dos-Santos V, Conde-Ocazionez S, Nicolelis MAL, Ribeiro ST, Tort ABL. Neuronal Assembly
|
||
Detection and Cell Membership Specification by Principal Component Analysis. PLoS ONE.
|
||
2011; 6:e20996. [PubMed: 21698248]
|
||
MacKay, DM. Automata Studies. Shannon, CE.; McCarthy, J., editors. Princeton Univ. Press;
|
||
Princeton, NJ: 1956. p. 235-251.
|
||
Maier A, Adams GK, Aura C, Leopold DA. Distinct superficial and deep laminar domains of activity
|
||
in the visual cortex during rest and stimulation. Frontiers in Systems Neuroscience. 2010; 4
|
||
Markov NT, Misery P, Falchier A, Lamy C, Vezoli J, Quilodran R, Gariel MA, Giroud P, Ercsey-
|
||
Ravasz M, Pilaz LJ, et al. Weight consistency specifies regularities of macaque cortical networks.
|
||
Cerebral Cortex. 2011; 21:1254. [PubMed: 21045004]
|
||
Meirovithz E, Ayzenshtat I, Werner-Reiss U, Shamir I, Slovin H. Spatiotemporal Effects of
|
||
Microsaccades on Population Activity in the Visual Cortex of Monkeys during Fixation. Cerebral
|
||
Cortex. 2011; 22:294–307. [PubMed: 21653284]
|
||
Melzer S, Michael M, Caputi A, Eliava M, Fuchs EC, Whittington MA, Monyer H. Long-Range-
|
||
Projecting GABAergic Neurons Modulate Inhibition in Hippocampus and Entorhinal Cortex.
|
||
Science. 2012; 335:1506–1510. [PubMed: 22442486]
|
||
Meyer HS, Schwarz D, Wimmer VC, Schmitt AC, Kerr JND, Sakmann B, Helmstaedter M. Inhibitory
|
||
interneurons in a cortical column form hot zones of inhibition in layers 2 and 5A. Proceedings of
|
||
the National Academy of Sciences. 2011; 108:16807–16812.
|
||
Meyer T, Olson CR. Statistical learning of visual transitions in monkey inferotemporal cortex.
|
||
Proceedings of the National Academy of Sciences. 2011; 108:19401–19406.
|
||
Mignard M, Malpeli JG. Paths of information flow through visual cortex. Science. 1991; 251:1249.
|
||
[PubMed: 1848727]
|
||
Moran RJ, Stephan KE, Kiebel SJ, Rombach N, O'Connor WT, Murphy KJ, Reilly RB, Friston KJ.
|
||
Bayesian estimation of synaptic physiology from the spectral responses of neural masses.
|
||
Neuroimage. 2008; 42:272–284. [PubMed: 18515149]
|
||
Moran RJ, Symmonds M, Stephan KE, Friston KJ, Dolan RJ. An in vivo assay of synaptic function
|
||
mediating human cognition. Curr. Biol. 2011; 21:1320–1325. [PubMed: 21802302]
|
||
Mountcastle VB. Modality and topographic properties of single neurons of cat's somatic sensory
|
||
cortex. Journal of Neurophysiology. 1957; 20:408. [PubMed: 13439410]
|
||
Mountcastle VB. The columnar organization of the neocortex. Brain. 1997; 120:701. [PubMed:
|
||
9153131]
|
||
Mumford D. On the computational architecture of the neocortex. II. The role of cortico-cortical loops.
|
||
Biol Cybern. 1992; 66:241–251. [PubMed: 1540675]
|
||
Murphy PC, Sillito AM. Corticofugal feedback influences the generation of length tuning in the visual
|
||
pathway. Nature. 1987; 329:727–729. [PubMed: 3670375]
|
||
Murray SO. Spatially Specific fMRI Repetition Effects in Human Visual Cortex. Journal of
|
||
Neurophysiology. 2005; 95:2439–2445. [PubMed: 16394067]
|
||
Murray SO, Kersten D, Olshausen BA, Schrater P, Woods DL. Shape perception reduces activity in
|
||
human primary visual cortex. Proc. Natl. Acad. Sci. U.S.A. 2002; 99:15164–15169. [PubMed:
|
||
12417754]
|
||
Neisser, U. Cognitive psychology. Appleton-Century-Crofts; New York: 1967.
|
||
de No, LR. Cerebral cortex: architecture, intracortical connections, motor projections.. In: Fulton, JF.,
|
||
editor. Physiology of the Nervous System. Oxford University Press; Oxford: 1949. p. 288-330.
|
||
Olsen SR, Bortone DS, Adesnik H, Scanziani M. Gain control by layer six in cortical circuits of vision.
|
||
Nature. 2012
|
||
De Pasquale R, Sherman SM. Synaptic properties of corticocortical connections between the primary
|
||
and secondary visual cortical areas in the mouse. J. Neurosci. 2011; 31:16494–16506. [PubMed:
|
||
22090476]
|
||
|
||
Bastos et al.
|
||
Page 20
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
Raizada RDS, Grossberg S. Towards a theory of the laminar architecture of cerebral cortex:
|
||
Computational clues from the visual system. Cerebral Cortex. 2003; 13:100–113. [PubMed:
|
||
12466221]
|
||
Rao RP, Ballard DH. Predictive coding in the visual cortex: a functional interpretation of some extra-
|
||
classical receptive-field effects. Nature Neuroscience. 1999; 2:79–87.
|
||
Roopun AK, Kramer MA, Carracedo LM, Kaiser M, Davies CH, Traub RD, Kopell NJ, Whittington
|
||
MA. Period concatenation underlies interactions between gamma and beta rhythms in neocortex.
|
||
Front Cell Neurosci. 2008; 2:1. [PubMed: 18946516]
|
||
Roopun AK, Middleton SJ, Cunningham MO, LeBeau FE, Bibbig A, Whittington MA, Traub RD. A
|
||
beta2-frequency (20–30 Hz) oscillation in nonsynaptic networks of somatosensory cortex.
|
||
Proceedings of the National Academy of Sciences. 2006; 103:15646.
|
||
Saalmann YB, Pigarev IN, Vidyasagar TR. Neural Mechanisms of Visual Attention: How Top-Down
|
||
Feedback Highlights Relevant Locations. Science. 2007; 316:1612–1615. [PubMed: 17569863]
|
||
Saalmann YB, Pinsk MA, Wang L, Li X, Kastner S. The Pulvinar Regulates Information Transmission
|
||
Between Cortical Areas Based on Attention Demands. Science. 2012; 337:753–756. [PubMed:
|
||
22879517]
|
||
Sakata S, Harris KD. Laminar Structure of Spontaneous and Sensory-Evoked Population Activity in
|
||
Auditory Cortex. Neuron. 2009; 64:404–418. [PubMed: 19914188]
|
||
Salin PA, Bullier J. Corticocortical connections in the visual system: structure and function. Physiol.
|
||
Rev. 1995; 75:107–154. [PubMed: 7831395]
|
||
Sandell JH, Schiller PH. Effect of cooling area 18 on striate cortex cells in the squirrel monkey.
|
||
Journal of Neurophysiology. 1982; 48:38. [PubMed: 6288886]
|
||
Sherman SM, Guillery R. On the actions that one nerve cell can have on another: distinguishing
|
||
“drivers” from “modulators.”. Proceedings of the National Academy of Sciences of the United
|
||
States of America. 1998; 95:7121. [PubMed: 9618549]
|
||
Sherman SM, Guillery RW. Distinct functions for direct and transthalamic corticocortical connections.
|
||
Journal of Neurophysiology. 2011; 106:1068–1077. [PubMed: 21676936]
|
||
Shipp S. Structure and function of the cerebral cortex. Current Biology. 2007; 17:443–449.
|
||
Shlosberg D, Amitai Y, Azouz R. Time-Dependent, Layer-Specific Modulation of Sensory Responses
|
||
Mediated by Neocortical Layer 1. Journal of Neurophysiology. 2006; 96:3170–3182. [PubMed:
|
||
17110738]
|
||
Sillito AM, Cudeiro J, Murphy PC. Orientation sensitive elements in the corticofugal influence on
|
||
centre-surround interactions in the dorsal lateral geniculate nucleus. Exp Brain Res. 1993; 93:6–
|
||
16. [PubMed: 8467892]
|
||
Sincich LC, Horton JC. The circuitry of V1 and V2: integration of color, form, and motion. Annu.
|
||
Rev. Neurosci. 2005; 28:303–326. [PubMed: 16022598]
|
||
Singer W. Neuronal synchrony: a versatile code for the definition of relations? Neuron. 1999; 24:49–
|
||
65. 111–125. [PubMed: 10677026]
|
||
Singer W, Engel AK, Kreiter AK, Munk MH, Neuenschwander S, Roelfsema PR. Neuronal
|
||
assemblies: necessity, signature and detectability. Trends Cogn. Sci. (Regul. Ed.). 1997; 1:252–
|
||
261. [PubMed: 21223920]
|
||
Spratling MW. Reconciling predictive coding and biased competition models of cortical function.
|
||
Front Comput Neurosci. 2008; 2:4. [PubMed: 18978957]
|
||
Srinivasan MV, Laughlin SB, Dubs A. Predictive coding: a fresh view of inhibition in the retina. Proc.
|
||
R. Soc. Lond., B, Biol. Sci. 1982; 216:427–459. [PubMed: 6129637]
|
||
Summerfield C, Trittschuh EH, Monti JM, Mesulam M-M, Egner T. Neural repetition suppression
|
||
reflects fulfilled perceptual expectations. Nature Neuroscience. 2008; 11:1004–1006.
|
||
Summerfield C, Wyart V, Johnen VM, de Gardelle V. Human Scalp Electroencephalography Reveals
|
||
that Repetition Suppression Varies with Expectation. Front Hum Neurosci. 2011; 5:67. [PubMed:
|
||
21847378]
|
||
Theyel BB, Llano DA, Sherman SM. The corticothalamocortical circuit drives higher-order cortex in
|
||
the mouse. Nature Neuroscience. 2009; 13:84–88.
|
||
|
||
Bastos et al.
|
||
Page 21
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
Thomson AM, Bannister AP. Interlaminar connections in the neocortex. Cerebral Cortex. 2003; 13:5–
|
||
14. [PubMed: 12466210]
|
||
Todorovic A, van Ede F, Maris E, de Lange FP. Prior Expectation Mediates Neural Adaptation to
|
||
Repeated Sounds in the Auditory Cortex: An MEG Study. Journal of Neuroscience. 2011;
|
||
31:9118–9123. [PubMed: 21697363]
|
||
Ullman S. Sequence seeking and counter streams: a computational model for bidirectional information
|
||
flow in the visual cortex. Cereb. Cortex. 1995; 5:1–11. [PubMed: 7719126]
|
||
Usrey WM, Fitzpatrick D. Specificity in the axonal connections of layer VI neurons in tree shrew
|
||
striate cortex: evidence for distinct granular and supragranular systems. The Journal of
|
||
Neuroscience. 1996; 16:1203. [PubMed: 8558249]
|
||
Varela F, Lachaux JP, Rodriguez E, Martinerie J. The brainweb: phase synchronization and large-scale
|
||
integration. Nature Reviews Neuroscience. 2001; 2:229–239.
|
||
Vezoli J. Quantitative Analysis of Connectivity in the Visual Cortex: Extracting Function from
|
||
Structure. The Neuroscientist. 2004; 10:476–482. [PubMed: 15359013]
|
||
Vicente R, Gollo LL, Mirasso CR, Fischer I, Pipa G. Dynamical relaying can yield zero time lag
|
||
neuronal synchrony despite long conduction delays. Proceedings of the National Academy of
|
||
Sciences. 2008; 105:17157.
|
||
Wacongne C, Labyt E, van Wassenhove V, Bekinschtein T, Naccache L, Dehaene S. Evidence for a
|
||
hierarchy of predictions and prediction errors in human cortex. Proceedings of the National
|
||
Academy of Sciences. 2011; 108:20754–20759.
|
||
Wang X-J. Neurophysiological and computational principles of cortical rhythms in cognition. Physiol.
|
||
Rev. 2010; 90:1195–1268. [PubMed: 20664082]
|
||
Weiler N, Wood L, Yu J, Solla SA, Shepherd GMG. Top-down laminar organization of the excitatory
|
||
network in motor cortex. Nat Neurosci. 2008; 11:360–366. [PubMed: 18246064]
|
||
Wozny C, Williams SR. Specificity of Synaptic Connectivity between Layer 1 Inhibitory Interneurons
|
||
and Layer⅔ Pyramidal Neurons in the Rat Neocortex. Cerebral Cortex. 2011
|
||
Wurtz RH, McAlonan K, Cavanaugh J, Berman RA. Thalamic pathways for active vision. Trends
|
||
Cogn. Sci. (Regul. Ed.). 2011; 15:177–184. [PubMed: 21414835]
|
||
Wyart V, Nobre AC, Summerfield C. Dissociable prior influences of signal probability and relevance
|
||
on visual contrast sensitivity. Proceedings of the National Academy of Sciences. 2012;
|
||
109:3593–3598.
|
||
Yamaguchi S, Knight RT. Gating of somatosensory input by human prefrontal cortex. Brain Res.
|
||
1990; 521:281–288. [PubMed: 2207666]
|
||
Yoshimura Y, Callaway EM. Fine-scale specificity of cortical networks depends on inhibitory cell
|
||
type and connectivity. Nat. Neurosci. 2005; 8:1552–1559. [PubMed: 16222228]
|
||
Yuille A, Kersten D. Vision as Bayesian inference: analysis by synthesis? Trends Cogn. Sci. (Regul.
|
||
Ed.). 2006; 10:301–308. [PubMed: 16784882]
|
||
Zeki S, Shipp S. The functional logic of cortical connections. Nature. 1988; 335:311–317. [PubMed:
|
||
3047584]
|
||
Zeki SM. The cortical projections of foveal striate cortex in the rhesus monkey. J. Physiol. (Lond.).
|
||
1978; 277:227–244. [PubMed: 418174]
|
||
|
||
Bastos et al.
|
||
Page 22
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
Figure 1.
|
||
This is a schematic of the classical microcircuit adapted from Douglas and Martin (1991).
|
||
This minimal circuitry comprises superficial (layers 2 and 3) and deep (layers, 5 and 6)
|
||
pyramidal cells and a population of smooth inhibitory cells. Feedforward inputs – from the
|
||
thalamus – target all cell populations, but with an emphasis on inhibitory interneurons and
|
||
superficial and granular layers. Note the symmetrical deployment of inhibitory and
|
||
excitatory intrinsic connections that maintain a balance of excitation and inhibition.
|
||
|
||
Bastos et al.
|
||
Page 23
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
Figure 2.
|
||
This is a simplified schematic of the key intrinsic connections among excitatory (E) and
|
||
inhibitory (I) populations in granular (L4), supragranular (L1/2/3) and infragranular (L5/6)
|
||
layers. The excitatory interlaminar connections are based largely on Gilbert and Wiesel
|
||
(1983). Forward connections denote feedforward extrinsic corticocortical or thalamocortical
|
||
afferents that are reciprocated by backward or feedback connections. Anatomical and
|
||
functional data suggest that afferent input enters primarily into L4 and is conveyed to
|
||
superficial layers L2/3 that are rich in pyramidal cells, which project forward to the next
|
||
cortical area, forming a disynaptic route between thalamus and secondary cortical areas
|
||
(Callaway, 1998). Information from L2/3 is then sent to L5 and L6, which sends (intrinsic)
|
||
feedback projections back to L4 (Usrey and Fitzpatrick, 1996). L5 cells originate feedback
|
||
connections to earlier cortical areas as well as to the pulvinar, superior colliculus, and brain
|
||
stem. In summary, forward input is segregated by intrinsic connections into a superficial
|
||
forward stream and a deep backward stream. In this schematic, we have juxtaposed densely
|
||
interconnected excitatory and inhibitory populations within each layer.
|
||
|
||
Bastos et al.
|
||
Page 24
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
Figure 3.
|
||
This schematic shows an example of a generative model. Generative models describe how
|
||
(sensory) data are caused. In this figure, sensory states (blue circles on the periphery) are
|
||
generated by hidden variables (in the centre). The left panel shows the model as a
|
||
probabilistic graphical model, where unknown variables (hidden causes and states) are
|
||
associated with the nodes of a dependency graph and conditional dependencies are indicated
|
||
by arrows. Hidden states confer memory on the model by virtue of having dynamics, while
|
||
hidden causes connect nodes. A graphical model describes the conditional dependencies
|
||
among hidden variables generating data. These dependencies are typically modelled as
|
||
(differential) equations with nonlinear mappings and random fluctuations
|
||
with precision
|
||
(inverse variance) Π(i) (see the equations in the insert on the left). This allows one to specify
|
||
the precise form of the probabilistic generative model and leads to a simple and efficient
|
||
inversion scheme (predictive coding; see next figure). Here
|
||
denotes the set of hidden
|
||
causes that constitute the parents of sensory s̃
|
||
(i) or hidden x̃
|
||
(i) states. The ~ indicates states in
|
||
generalised coordinates of motion: x̃
|
||
= (x, x′, x″,...). An intuitive version of the model is
|
||
shown on the right: here, we imagine that a singing bird is the cause of sensations, which –
|
||
through a cascade of dynamical hidden states – produces modality-specific consequences
|
||
(e.g., the auditory object of a bird song and the visual object of a song bird). These
|
||
intermediate causes are themselves (hierarchically) unpacked to generate sensory signals.
|
||
The generative model therefore maps from causes (e.g., concepts) to consequences (e.g.,
|
||
sensations), while its inversion corresponds to mapping from sensations to concepts or
|
||
representations. This inversion corresponds to perceptual synthesis – in which the generative
|
||
model is used to generate predictions. Note that this inversion implicitly resolves the binding
|
||
problem - by explaining multisensory cues with a single cause.
|
||
|
||
Bastos et al.
|
||
Page 25
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
Figure 4.
|
||
This figure describes the predictive coding scheme associated with a simple hierarchical
|
||
model shown on the left. In this model each node has a single parent. The ensuing inversion
|
||
or generalised predictive coding scheme is shown on the right. The key quantities in this
|
||
scheme are (conditional) expectations of the hidden states and causes and their associated
|
||
prediction errors. The basic architecture – implied by the inversion of the graphical
|
||
(hierarchical) model – suggests that prediction errors (caused by unpredicted fluctuations in
|
||
hidden variables) are passed up the hierarchy to update conditional expectations. These
|
||
conditional expectations now provide predictions that are passed down the hierarchy to form
|
||
prediction errors. We presume that the forward and backward message passing between
|
||
hierarchical levels is mediated by extrinsic (feedforward and feedback) connections.
|
||
Neuronal populations encoding conditional expectations and prediction errors now have to
|
||
be deployed in a canonical microcircuit to understand the computational logic of intrinsic
|
||
connections – within each level of the hierarchy – as shown in the next figure.
|
||
|
||
Bastos et al.
|
||
Page 26
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
Figure 5.
|
||
The left hand panel is the canonical microcircuit based on Haeusler and Maass (2007),
|
||
where we have removed inhibitory cells from the deep layers – because they have very little
|
||
interlaminar connectivity. The numbers denote connection strengths (mean amplitude of
|
||
PSPs measured at soma in mV) and connection probabilities (in parentheses) according to
|
||
Thomson et al. (2002). The right panel shows the proposed cortical microcircuit for
|
||
predictive coding, where the quantities of the previous figure have been associated with
|
||
various cell types. Here, prediction error populations are highlighted in pink. Inhibitory
|
||
connections are shown in red, while excitatory connections are in black. The dotted lines
|
||
refer to connections that are not present in the microcircuit on the left (but see Figure 2). In
|
||
this scheme, expectations (about causes and states) are assigned to (excitatory and
|
||
inhibitory) interneurons in the supragranular layers, which are passed to infragranular layers.
|
||
The corresponding prediction errors occupy granular layers, while superficial pyramidal
|
||
cells encode prediction errors that are sent forward to the next hierarchical level. Conditional
|
||
expectations and prediction errors on hidden causes are associated with excitatory cell types,
|
||
while the corresponding quantities for hidden states are assigned to inhibitory cells. Dark
|
||
circles indicate pyramidal cells. Finally, we have placed the precision of the feedforward
|
||
prediction errors against the superficial pyramidal cells. This quantity controls the
|
||
postsynaptic sensitivity or gain to (intrinsic and top-down) pre-synaptic inputs. We have
|
||
previously discussed this in terms of attentional modulation, which may be intimately linked
|
||
to the synchronisation of pre-synaptic inputs and ensuing postsynaptic responses (Fries et al
|
||
2001; Feldman and Friston, 2010).
|
||
|
||
Bastos et al.
|
||
Page 27
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
Figure 6.
|
||
This schematic illustrates the functional asymmetry between the spectral activity of
|
||
superficial and deep cells predicted theoretically. In this illustrative example, we have
|
||
ignored the effects of influences on the expectations of hidden causes (encoded by deep
|
||
pyramidal cells), other than the prediction error on causes (encoded by superficial pyramidal
|
||
cells). The lower panel shows the spectral density of deep pyramidal cell activity, given the
|
||
spectral density of superficial pyramidal cell activity in the upper panel. The equation
|
||
expresses the spectral density of the deep cells as a function of the spectral density of the
|
||
superficial cells; using Equation (2). This schematic is meant to illustrate how the relative
|
||
amounts of low (beta) and high (gamma) frequency activity in superficial and deep cells can
|
||
be explained by the evidence accumulation implicit in predictive coding.
|
||
|
||
Bastos et al.
|
||
Page 28
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
Bastos et al.
|
||
Page 29
|
||
|
||
Table 1
|
||
|
||
Electrophysiological and neuroimaging findings consistent with predictive coding.
|
||
|
||
Prediction violated
|
||
Area studied
|
||
Neuronal expression of Prediction-
|
||
error
|
||
|
||
Study
|
||
|
||
Learned visual object pairings
|
||
Monkey inferotemporal cortex
|
||
(IT)
|
||
|
||
Enhanced firing rate
|
||
Meyer and Olson, 2011
|
||
|
||
Natural image statistics
|
||
Monkey V1, V2, V3
|
||
Enhanced firing rate
|
||
Hupé et al., 1998;
|
||
Bullier et al., 1996; Bair
|
||
et al., 2003
|
||
|
||
Repetitive auditory stream
|
||
Early human auditory cortex
|
||
Enhanced Event Related Potentials
|
||
(ERPs), enhanced gamma-band power
|
||
|
||
Garrido et al., 2007,
|
||
2009; Todorovic et al.,
|
||
2011
|
||
|
||
Coherence of visual form and
|
||
motion
|
||
|
||
Human V1, V2, V3, V4, V5/
|
||
MT
|
||
|
||
Enhanced BOLD response
|
||
Murray et al 2002;
|
||
Murray et al 2005;
|
||
Harrison et al., 2007
|
||
|
||
Audio-visual congruence of speech
|
||
Visual and auditory cortex
|
||
Gamma-band oscillatory activity
|
||
Arnal et al., 2011
|
||
|
||
Predictability of visual stimuli as a
|
||
function of attention
|
||
|
||
Human V1, V2, V3
|
||
Enhanced BOLD response when
|
||
unattended, reduced BOLD when
|
||
attended
|
||
|
||
Kok et al., 2011
|
||
|
||
Hierarchical expectations in
|
||
auditory sequences
|
||
|
||
Human temporal cortex
|
||
Enhanced Event Related Potentials
|
||
(ERPs)
|
||
|
||
Wacongne et al., 2011
|
||
|
||
Expected repetition (or
|
||
alternation) of face stimuli
|
||
|
||
FFA in fMRI, parietal and
|
||
central electrodes of EEG
|
||
|
||
Enhanced BOLD response, diminished
|
||
repetition suppression of ERP
|
||
|
||
Summerfield et al.,
|
||
2008, 2011
|
||
|
||
Apparent motion of visual
|
||
stimulus
|
||
|
||
V1
|
||
Enhanced BOLD response
|
||
Alink et al., 2010
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
NIH-PA Author Manuscript
|
||
|
||
Bastos et al.
|
||
Page 30
|
||
|
||
Table 2
|
||
|
||
The functional (computational) correlates of the anatomy and physiology of cortical hierarchies and their
|
||
extrinsic connections.
|
||
|
||
Anatomy and physiology
|
||
Functional correlates
|
||
|
||
Hierarchical organisation of cortical areas (Zeki and Shipp 1988;
|
||
Felleman and Van Essen, 1991; Barone et al., 2000; Vezoli, 2004)
|
||
|
||
Encoding of conditional dependencies in terms of a graphical
|
||
model (Mumford, 1992; Rao and Ballard, 1999; Friston 2008).
|
||
|
||
Distinct (laminar-specific) neuronal responses (Douglas et al., 1989;
|
||
Douglas and Martin, 1991)
|
||
|
||
Encoding expected states of the world (superficial pyramidal cells)
|
||
and prediction errors (deep pyramidal cells) (Mumford, 1992;
|
||
Friston 2008).
|
||
|
||
Distinct (laminar-specific) extrinsic connections (Zeki and Shipp
|
||
1988; Felleman and Van Essen, 1991; Barone et al., 2000; Vezoli, 2004;
|
||
Markov et al., 2011).
|
||
|
||
Forward connections convey prediction error (from superficial
|
||
pyramidal cells) and backward connections convey predictions
|
||
(from deep pyramidal cells) (Mumford, 1992; Friston 2008).
|
||
|
||
Reciprocal extrinsic connectivity (Zeki and Shipp 1988; Felleman and
|
||
Van Essen, 1991; Barone et al., 2000; Vezoli, 2004; Markov et al., 2011)
|
||
|
||
Recurrent dynamics are intrinsically stable because they are trying
|
||
to suppress prediction error (Crick and Koch 1998;; Friston 2008).
|
||
|
||
Feedback extrinsic connections are (driving and) modulatory
|
||
(Mignard and Malpeli 1991; Bullier et al., 1996; Sherman and Guillery
|
||
1998; Covic and Sherman, 2011; De Pasquale and Sherman, 2011).
|
||
|
||
Forwards (driving) and backwards (driving and modulatory)
|
||
connections mediate the (linear) influence of prediction errors and
|
||
the (linear and non-linear) construction of predictions (Friston
|
||
2008; 2010).
|
||
|
||
Feedback extrinsic connections are inhibitory (Murphy and Sillito,
|
||
1987; Sillito et al., 1993; Chu et al., 2003; Olsen et al. 2012; Meyer et al.,
|
||
2011; Wozny and Williams, 2011).
|
||
|
||
Top-down predictions suppress or counter prediction errors
|
||
produced by bottom up inputs (Mumford, 1992; Rao and Ballard,
|
||
1999; Friston 2008).
|
||
|
||
Differences in neuronal dynamics of superficial and deep layers (de
|
||
Kock et al., 2007; Sakata and Harris, 2009; Maier et al., 2010;
|
||
Bollimunta et al., 2011; Buffalo et al., 2011).
|
||
|
||
Principal cells elaborating predictions (deep pyramidal cells) may
|
||
show distinct (low-pass) dynamics, relative to those encoding error
|
||
(superficial pyramidal cells) (Friston 2008).
|
||
|
||
Dense intrinsic and horizontal connectivity (Thomson and Bannister,
|
||
2003; Katzel et al., 2010).
|
||
|
||
Lateral predictions and prediction errors mediating winnerless
|
||
competition and competitive lateral dependencies (Desimone,
|
||
1996; Friston 2010).
|
||
|
||
Predominance of nonlinear synaptic (dendritic and
|
||
neuromodulatory) infrastructure in superficial layers (Häusser and
|
||
Mel, 2003; London and Häusser, 2005; Gentet et al., 2012).
|
||
|
||
Required to scale prediction errors, in proportion to their precision,
|
||
affording a form of cortical bias or gain control that encodes
|
||
uncertainty (Feldman and Friston 2010; Spratling, 2008)
|
||
|
||
Neuron. Author manuscript; available in PMC 2013 September 19.
|
||
|
||
|