Files
intellecton/papers/references/Albantakis2023.txt
T

3205 lines
156 KiB
Plaintext
Raw Normal View History

2026-06-01 23:11:00 +00:00
RESEARCH ARTICLE
Integrated information theory (IIT) 4.0:
Formulating the properties of phenomenal
existence in physical terms
Larissa AlbantakisID1☯, Leonardo Barbosa1,2☯, Graham Findlay1,3☯, Matteo GrassoID1☯,
Andrew M. Haun1☯, William MarshallID1,4☯, William G. P. Mayner1,3☯,
Alireza Zaeemzadeh1☯, Melanie Boly1,5, Bjørn E. Juel1,6, Shuntaro Sasai1,7, Keiko Fujii1,
Isaac David1, Jeremiah Hendren1,8, Jonathan P. LangID1, Giulio TononiID1*
1 Department of Psychiatry, University of Wisconsin, Madison, Wisconsin, United States of America, 2 Fralin
Biomedical Research Institute at VTC, Virginia Tech, Roanoke, Virginia, United States of America,
3 Neuroscience Training Program, University of Wisconsin, Madison, Wisconsin, United States of America,
4 Department of Mathematics and Statistics, Brock University, St. Catharines, Ontario, Canada,
5 Department of Neurology, University of Wisconsin, Madison, Wisconsin, United States of America,
6 Institute of Basic Medical Sciences, University of Oslo, Oslo, Norway, 7 Araya Inc., Tokyo, Japan,
8 Graduate School Language & Literature, Ludwig Maximilian University of Munich, Munich, Germany
☯ These authors contributed equally to this work.
* gtononi@wisc.edu
Abstract
This paper presents Integrated Information Theory (IIT) 4.0. IIT aims to account for the prop-
erties of experience in physical (operational) terms. It identifies the essential properties of
experience (axioms), infers the necessary and sufficient properties that its substrate must
satisfy (postulates), and expresses them in mathematical terms. In principle, the postulates
can be applied to any system of units in a state to determine whether it is conscious, to what
degree, and in what way. IIT offers a parsimonious explanation of empirical evidence,
makes testable predictions concerning both the presence and the quality of experience, and
permits inferences and extrapolations. IIT 4.0 incorporates several developments of the
past ten years, including a more accurate formulation of the axioms as postulates and math-
ematical expressions, the introduction of a unique measure of intrinsic information that is
consistent with the postulates, and an explicit assessment of causal relations. By fully
unfolding a systems irreducible causeeffect power, the distinctions and relations specified
by a substrate can account for the quality of experience.
Author summary
As a theory of consciousness, IIT aims to answer two questions: 1) Why is experience
present vs. absent? and 2) Why do specific experiences feel the way they do? The theorys
starting point is the existence of experience. IIT then aims to account for phenomenal
existence and its essential properties in physical terms. It concludes that a substrate—a set
of interacting units—can support consciousness if it can take and make a difference for
itself (intrinsicality), select a specific cause and effect as an irreducible whole with a
PLOS COMPUTATIONAL BIOLOGY
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
1 / 45
a1111111111
a1111111111
a1111111111
a1111111111
a1111111111
OPEN ACCESS
Citation: Albantakis L, Barbosa L, Findlay G,
Grasso M, Haun AM, Marshall W, et al. (2023)
Integrated information theory (IIT) 4.0: Formulating
the properties of phenomenal existence in physical
terms. PLoS Comput Biol 19(10): e1011465.
https://doi.org/10.1371/journal.pcbi.1011465
Editor: Lyle J. Graham, Universite´ Paris Descartes,
Centre National de la Recherche Scientifique,
FRANCE
Received: January 11, 2023
Accepted: August 26, 2023
Published: October 17, 2023
Copyright: © 2023 Albantakis et al. This is an open
access article distributed under the terms of the
Creative Commons Attribution License, which
permits unrestricted use, distribution, and
reproduction in any medium, provided the original
author and source are credited.
Data Availability Statement: There are no primary
data in the paper; the code used to produce the
results and analyses presented in this manuscript
is available at https://github.com/wmayner/pyphi/
tree/feature/iit-4.0/pyphi.
Funding: This project was made possible through
the support of a grant from Templeton World
Charity Foundation (TWCF0216, G.T.). In addition,
this research was supported by the David P White
Chair in Sleep Medicine at the University of
definite border and grain, and specify a structure of causes and effects through subsets of
its units. To that end, IIT provides a mathematical formalism that can be employed to
“unfold’’ the substrates causeeffect structure. This allows IIT to answer the two questions
above: 1) Experience is present for any substrate that fulfills the essential properties of
existence, and 2) specific experiences feel the way they do because of the specific cause-
effect structure specified by their substrates. The theory is consistent with neurological
data, and some of its core principles have been successfully tested empirically.
Introduction
A scientific theory of consciousness should account for experience, which is subjective, in
objective terms [1]. Being conscious—having an experience—is understood to mean that
“there is something it is like to be” [2]: something it is like to see a blue sky, hear the ocean
roar, dream of a friends face, imagine a melody flow, contemplate a choice, or reflect on the
experience one is having.
IIT aims to account for phenomenal properties—the properties of experience—in physical
terms. IITs starting point is experience itself rather than its behavioral, functional, or neural
correlates [1]. Furthermore, in IIT “physical” is meant in a strictly operational sense—in terms
of what can be observed and manipulated.
The starting point of IIT is the existence of an experience, which is immediate and irrefut-
able [3]. From this “zeroth” axiom, IIT sets out to identify the essential properties of conscious-
ness—those that are immediate and irrefutably true of every conceivable experience. These are
IITs five axioms of phenomenal existence: every experience is for the experiencer (intrinsical-
ity), specific (information), unitary (integration), definite (exclusion), and structured
(composition).
Unlike phenomenal existence, which is immediate and irrefutable (an axiom), physical exis-
tence is an explanatory construct (a postulate), and it is assessed operationally (from within
consciousness): in physical terms, to be is to have causeeffect power. In other words, some-
thing can be said to exist physically if it can “take and make a difference”—bear a cause and
produce an effect—as judged by a conscious observer/manipulator.
The next step of IIT is to formulate the essential phenomenal properties (the axioms) in
terms of corresponding physical properties (the postulates). This formulation is an “inference
to a good explanation” and rests on basic assumptions such as realism, physicalism, and atom-
ism (see Box 1: Methodological guidelines of IIT). If IIT is correct, the substrate of conscious-
ness (see (1) in S1 Notes), beyond having causeeffect power (existence), must satisfy all five
essential phenomenal properties in physical terms: its causeeffect power must be for itself
(intrinsicality), specific (information), unitary (integration), definite (exclusion), and struc-
tured (composition).
On this basis, IIT proposes a fundamental explanatory identity: an experience is identical to
the causeeffect structure unfolded from a maximal substrate (defined below). Accordingly, all
the specific phenomenal properties of any experience must have a good explanation in terms
of the specific physical properties of the corresponding causeeffect structure, with no addi-
tional ingredients.
Based again on “inferences to a good explanation” (see Box 1), IIT formulates the postulates
in a mathematical framework that is in principle applicable to general models of interacting
units (but see (2) in S1 Notes). A mathematical framework is needed (a) to evaluate whether
the theory is self-consistent and compatible with our overall knowledge about the world, (b) to
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
2 / 45
Wisconsin-Madison, by the Tiny Blue Dot
Foundation (UW 133AAG3451; G.T.), and by the
Natural Science and Engineering Research Council
of Canada (NSERC; RGPIN-2019-05418; W.M.). L.
A. also acknowledges the support of a grant from
the Templeton World Charity Foundation (TWCF-
2020-20526, L.A.). The funders had no role in
study design, data collection and analysis, decision
to publish, or preparation of the manuscript.
Competing interests: I have read the journals
policy and the authors of this manuscript have the
following competing interests: G.T. holds an
executive position and has a financial interest in
Intrinsic Powers, Inc., a company whose purpose
is to develop a device that can be used in the clinic
to assess the presence and absence of
consciousness in patients. This does not pose any
conflict of interest with regard to the work
undertaken for this publication.
make specific predictions regarding the quality and quantity of our experiences and their sub-
strate within the brain, and (c) to extrapolate from our own consciousness to infer the presence
(or absence) and nature of consciousness in beings different from ourselves.
Ultimately, the theory should account for why our consciousness depends on certain por-
tions of the world and their state, such as certain regions of the brain and not others, and for
why it fades during dreamless sleep, even though the brain remains active. It should also
account for why an experience feels the way it does—why the sky feels extended, why a melody
feels flowing in time, and so on. Moreover, the theory makes several predictions concerning
both the presence and the quality of experience, some of which have been and are being tested
empirically [4].
While the main tenets of the theory have remained the same, its formal framework has
been progressively refined and extended [58]. Compared to IIT 1.0 [5, 6], 2.0 [7, 9], and 3.0
[8], IIT 4.0 presents a more complete, self-consistent formulation and incorporates several
recent advances [1013]. Chief among them are a more accurate formulation of the axioms as
postulates and mathematical expressions, the introduction of an Intrinsic Difference (ID) mea-
sure [12, 14] that is uniquely consistent with IITs postulates, and the explicit assessment of
causal relations [11].
In what follows, after introducing IITs axioms and postulates, we provide its updated
mathematical formalism. In the “Results and discussion” section, we apply the mathematical
framework of IIT to representative examples and discuss some of their implications. The arti-
cle is meant as a reference for the theorys mathematical formalism, a concise demonstration
of its internal consistency, and an illustration of how a substrates causeeffect structure is
unfolded computationally. A discussion of the theorys motivation, its axioms and postulates,
and its assumptions and implications can be found in a forthcoming book (see (3) in S1 Notes)
and wiki [15] as well as in several publications [1, 1621]. A survey of the explanatory power
and experimental predictions of IIT can be found in [4]. The way IITs analysis of causeeffect
power can be applied to actual causation, or “what caused what,” is presented in [10].
From phenomenal axioms to physical postulates
Axioms of phenomenal existence
That experience exists—that “there is something it is like to be”—is immediate and irrefutable,
as everybody can confirm, say, upon awakening from dreamless sleep. Phenomenal existence
is immediate in the sense that my experience is simply there, directly rather than indirectly: I
do not need to infer its existence from something else. It is irrefutable because the very doubt-
ing that my experience exists is itself an experience that exists—the experience of doubting [1,
3]. Thus, to claim that my experience does not exist is self-contradictory or absurd. The exis-
tence of experience is IITs zeroth axiom.
Existence Experience exists: there is something.
Traditionally, an axiom is a statement that is assumed to be true, cannot be inferred from
any other statement, and can serve as a starting point for inferences. The existence of experi-
ence is the ultimate axiom—the starting point for everything, including logic and physics.
On this basis, IIT proceeds by considering whether experience—phenomenal existence—
has some axiomatic or essential properties, properties that are immediate and irrefutably true
of every conceivable experience. Drawing on introspection and reason, IIT identifies the fol-
lowing five:
Intrinsicality Experience is intrinsic: it exists for itself.
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
3 / 45
Information Experience is specific: it is this one.
Integration Experience is unitary: it is a whole, irreducible to separate experiences.
Exclusion Experience is definite: it is this whole.
Composition Experience is structured: it is composed of distinctions and the relations that
bind them together, yielding a phenomenal structure that feels the way it feels.
To exemplify, if I awaken from dreamless sleep and experience the white wall of my room,
my bed, and my body, the experience not only exists, immediately and irrefutably, but 1) it
exists for me, not for something else, 2) it is specific (this one experience, not a generic one), 3)
it is unitary (the left side is not experienced separately from the right side, and vice versa), 4) it
is definite (it includes the visual scene in front of me—neither less, say, its left side only, nor
more, say, the wall behind my head), 5) it is structured by distinctions (the wall, the bed, the
body) and relations (the body is on the bed, the bed in the room), which make it feel the way it
does and not some other way.
The axioms are not only immediately given, but they are irrefutably true of every conceiv-
able experience. For example, once properly understood, the unity of experience cannot be
refuted. Trying to conceive of an experience that were not unitary leads to conceiving of two
separate experiences, each of which is unitary, which reaffirms the validity of the axiom. Even
though each of the axioms spells out an essential property in its own right, the axioms must be
considered together to properly characterize phenomenal existence.
IIT takes the above set of axioms to be complete: there are no further properties of experi-
ence that are essential. Other properties that might be considered as candidates for axiomatic
status include space (experience typically takes place in some spatial frame), time (an experi-
ence usually feels like it flows from a past to a future), change (an experience usually transitions
or flows into another), subjectobject distinction (an experience seems to involve both a sub-
ject and an object), intentionality (experiences usually refer to something in the world, or at
least to something other than the subject), a sense of self (many experiences include a reference
to ones body or even to ones narrative self), figureground segregation (an experience usually
includes some object and some background), situatedness (an experience is often bound to a
time and a place), will (experience offers the opportunity for action), and affect (experience is
often colored by some mood), among others. However, experiences lacking each of these can-
didate properties are conceivable—that is, conceiving of them does not lead to self-contradic-
tion or absurdity. They are also achievable, as revealed by altered states of consciousness
reached through dreaming, meditative practices, or drugs.
Postulates of physical existence
To account for the many regularities of experience (Box 1), it is a good inference to assume
the existence of a world that persists independently of ones experience (realism). From
within consciousness, we can probe the physical existence of things outside of our experience
operationally—through observations and manipulations. To be granted physical existence,
something should have the power to “take a difference” (be affected) and “make a difference”
(produce effects) in a reliable way (physicalism). IIT also assumes “operational reduction-
ism,” which means that, ideally, to establish what exists in physical terms, one would start
from the smallest units that can take and make a difference, so that nothing is left out
(atomism).
By characterizing physical existence operationally as causeeffect power, IIT can proceed to
formulate the axioms of phenomenal existence as postulates of physical existence. This
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
4 / 45
establishes the requirements for the substrate of consciousness, where “substrate” is meant
operationally as a set of units that can be observed and manipulated.
Existence The substrate of consciousness can be characterized operationally by causeeffect
power: its units must take and make a difference.
Building from this “zeroth” postulate, IIT formulates the five axioms in terms of postulates
of physical existence that must be satisfied by the substrate of consciousness:
Intrinsicality Its causeeffect power must be intrinsic: it must take and make a difference
within itself.
Information Its causeeffect power must be specific: it must be in this state and select this
causeeffect state.
This state is the one with maximal intrinsic information (ii), a measure of the difference a
system takes or makes over itself for a given cause state and effect state.
Integration Its causeeffect power must be unitary: it must specify its causeeffect state as a
whole set of units, irreducible to separate subsets of units.
Irreducibility is measured by integrated information (φ) over the substrates minimum
partition.
Exclusion Its causeeffect power must be definite: it must specify its causeeffect state as this
whole set of units.
This is the set of units that is maximally irreducible, as measured by maximum φ (φ*). This
set is called a maximal substrate, also known as a complex [8, 13].
Composition Its causeeffect power must be structured: subsets of its units must specify
causeeffect states over subsets of units (distinctions) that can overlap with one another
(relations), yielding a causeeffect structure or Φ-structure (“Phi-structure”) that is the way
it is.
Distinctions and relations, in turn, must also satisfy the postulates of physical existence:
they must have causeeffect power, within the substrate of consciousness, in a specific, unitary,
and definite way (they do not have components, being components themselves). They thus
have an associated φ value. The Φ-structure unfolded from a complex corresponds to the qual-
ity of consciousness. The sum total of the φ values of the distinctions and relations that com-
pose the Φ-structure measures its structure integrated information Φ (“big Phi,” “structure
Phi”) and corresponds to the quantity of consciousness.
According to IIT, the physical properties characterized by the postulates are necessary and
sufficient for an entity to be conscious. They are necessary because they are needed to account
for the properties of experience that are essential, in the sense that it is inconceivable for an
experience to lack any one of them. They are also sufficient because no additional property of
experience is essential, in the sense that it is conceivable for an experience to lack that property.
Thus, no additional physical property is a necessary requirement for being a substrate of
consciousness.
The postulates of IIT have been and are being applied to account for the location of the sub-
strate of consciousness in the brain [4] and for its loss and recovery in physiological and patho-
logical conditions [22, 23].
The explanatory identity between experiences and Φ-structures
Having determined the necessary and sufficient conditions for a substrate to support con-
sciousness, IIT proposes an explanatory identity: every property of an experience is accounted
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
5 / 45
for in full by the physical properties of the Φ-structure unfolded from a maximal substrate (a
complex) in its current state, with no further or “ad hoc” ingredients. That is, there must be a
one-to-one correspondence between the way the experience feels and the way distinctions and
relations are structured. Importantly, the identity is not meant as a correspondence between
the properties of two separate things. Instead, the identity should be understood in an explana-
tory sense: the intrinsic (subjective) feeling of the experience can be explained extrinsically
(objectively, i.e., operationally or physically) in terms of causeeffect power (see (4) in S1
Notes).
The explanatory identity has been applied to account for how space feels (spatial extended-
ness) and which neural substrates may account for it [11]. Ongoing work is applying the iden-
tity to provide a basic account of the feeling of temporal flow [24] and that of objects [25].
Box 1. Methodological guidelines of IIT
Inference to a good explanation
We should generally assume that an explanation is good if it can account for a broad set
of facts (scope), does so in a unified manner (synthesis), can explain facts precisely (speci-
ficity), is internally coherent (self-consistency), is coherent with our overall understand-
ing of things (system consistency), is simpler than alternatives (simplicity), and can make
testable predictions (scientific validation). For example, IIT 4.0 aims at expressing the
postulates of intrinsicality, information, integration, and exclusion in a self-consistent
manner when applied to systems, causal distinctions, and relations (see formulas).
Realism
We should assume that something exists (and persists) independently of our own experi-
ence. This is a much better hypothesis than solipsism, which explains nothing and pre-
dicts nothing. Although IIT starts from our own phenomenology, it aims to account for
the many regularities of experience in a way that is fully consistent with realism.
Operational physicalism
To assess what exists independently of our own experience, we should employ an opera-
tional criterion: we should systematically observe and manipulate a substrates units and
determine that they can indeed take and make a difference in a way that is reliable.
Doing so demonstrates a substrates causeeffect power—the signature of physical exis-
tence. Ideally, causeeffect power is fully captured by a substrates transition probability
matrix (TPM) (1). This assumption is embedded in IITs zeroth postulate.
Operational reductionism (“atomism”)
Ideally, we should account for what exists physically in terms of the smallest units we
can observe and manipulate, as captured by unit TPMs. Doing so would leave nothing
unaccounted for. IIT assumes that, in principle, it should be possible to account for
everything purely in terms of causeeffect power—causeeffect power “all the way
down” to conditional probabilities between atomic units (see (5) in S1 Notes). Eventu-
ally, this would leave neither room nor need to assume intrinsic properties or laws.
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
6 / 45
Overview of IITs framework
IIT 4.0 aims at providing a formal framework to characterize the causeeffect structure of a
substrate in a given state by expressing IITs postulates in mathematical terms. In line with
operational physicalism (Box 1), we characterize a substrate by the transition probability func-
tion of its constituting units.
On this basis, the IIT formalism first identifies sets of units that fulfill all required properties
of a substrate of consciousness according to the postulates of physical existence. First, for a
candidate system, we determine a maximal causeeffect state based on the intrinsic informa-
tion (ii) that the system in its current state specifies over its possible cause states and effect
states. We then determine the maximal substrate based on the integrated information (φs, “sys-
tem phi”) of the maximal causeeffect state. To qualify as a substrate of consciousness, a candi-
date system must specify a maximum of integrated information (φ∗
s) compared to all
competing candidate systems with overlapping units.
The second part of the IIT formalism unfolds the causeeffect structure specified by a maxi-
mal substrate in its current state, its Φ-structure. To that end, we determine the distinctions
and relations specified by the substrates subsets according to the postulates of physical exis-
tence. Distinctions are causeeffect states specified over subsets of substrate units (purviews)
by subsets of substrate units (mechanisms). Relations are congruent overlaps among distinc-
tions cause and/or effect states. Distinctions and relations are also characterized by their inte-
grated information (φd, φr). The Φ-structure they compose corresponds to the quality of the
experience specified by the substrate; the sum of their φd/r values corresponds to its quantity
(Φ).
While IIT must still be considered as work in progress, having undergone successive refine-
ments, IIT 4.0 is the first formulation of IIT that strives to characterize Φ-structures completely
and to do so based on measures that satisfy the postulates uniquely. For a comparison of the
updated framework with IIT 1.0, 2.0, and 3.0, see S2 Text.
Intrinsic perspective
When accounting for experience itself in physical terms, existence should be evaluated
from the intrinsic perspective of an entity—what exists for the entity itself—not from the
perspective of an external observer. This assumption is embedded in IITs postulate of
intrinsicality and has several consequences. One is that, from the intrinsic perspective,
the quality and quantity of existence must be observer-independent and cannot be arbi-
trary. For instance, information in IIT must be relative to the specific state the entity is
in, rather than an average of states as assessed by an external observer. Similarly, it
should be evaluated based on the uniform distribution of possible states, as captured by
the entitys TPM (1), rather than on an observed probability distribution. By the same
token, units outside the entity should be treated as background conditions that do not
contribute directly to what the system is. The intrinsic perspective also imposes a tension
between expansion and dilution (see below and [12, 14]): from the intrinsic perspective
of a system (or a mechanism within the system), having more units may increase its
informativeness (causeeffect power measured as deviation from chance), while at the
same time diluting its selectivity (ability to concentrate causeeffect power over a specific
state).
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
7 / 45
Substrates, transition probabilities, and causeeffect power
IIT takes physical existence as synonymous with having causeeffect power, the ability to take
and make a difference. Consequently, a substrate U with state space OU is operationally
defined by its potential interactions, assessed in terms of conditional probabilities (physical-
ism, Box 1). We denote the complete transition probability function of a substrate U over a sys-
tem update u ! u as
T U u j uÞ;
u; u 2 OU:
ð1Þ
A substrate in IIT can be described as a stochastic system U = {U1, U2, . . ., Un} of n interacting
units with state space OU ¼ Q
i OUi and current state u 2 OU. We define units in state u as a set
of tuples, where each tuple contains the unit and the state of the unit, i.e., u = {(Ui, state(Ui)) :
Ui 2 U}. This allows us to define set operations over u that consider both the units and their
states. OU is the set of all possible such tuple sets, corresponding to all the possible states of U.
We assume that the system updates in discrete steps, that the state space OU is finite, and that
the individual random variables Ui 2 U are conditionally independent from each other given
the preceding state of U:
u j uÞ ¼
Y
n
i¼1
ui j uÞ:
ð2Þ
Finally, we assume a complete description of the substrate, which means that we can determine
the conditional probabilities in (2) for every system state, with pðu j uÞ ¼ pðu j doðuÞÞ [10,
2628], where the “do-operator” do(u) indicates that u is imposed by intervention. This
implies that U must correspond to a causal network [10], and T U is a transition probability
matrix (TPM) of size |OU| (see (6) in S1 Notes).
The TPM T U, which forms the starting point of IITs analysis, serves as an overall descrip-
tion of a systems causal evolution under all possible interventions: what is the probability that
the system will transition into each of its possible states upon being initialized into every possi-
ble state (Fig 1)? (Notably, there is no additional role for intrinsic physical properties or laws of
nature.) In practice, a causal model will be neither complete nor atomic (capturing the smallest
units that can be observed and manipulated), but will capture the relevant features of what we
are trying to explain and predict (see (7) in S1 Notes).
In the “Results and discussion” section, the IIT formalism will be applied to extremely sim-
ple, simulated networks, rather than causal models of actual substrates. The causeeffect struc-
tures derived from these simple networks only serve as convenient illustrations of how a
hypothetical substrates causeeffect power can be unfolded.
Implementing the postulates
In what follows, our goal is to evaluate whether a hypothetical substrate (also called “system”)
satisfies all the postulates of IIT. To that end, we must verify whether the system has cause
effect power that is intrinsic, specific, integrated, definite, and structured.
Existence.
According to IIT, existence understood as causeeffect power requires the
capacity to both take and make a difference (see Box 2, Principle of being). On the basis of a
complete description of the system in terms of interventional conditional probabilities (T U)
(1), causeeffect power can be quantified as causal informativeness. Cause informativeness
measures how much a potential cause increases the probability of the current state, and effect
informativeness how much the current state increases the probability of a potential effect (as
compared to chance).
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
8 / 45
Intrinsicality.
Building upon the existence postulate, the intrinsicality postulate further
requires that a system exerts causeeffect power within itself. In general, the systems we want
to evaluate are open systems S U that are part of a larger “universe” U. From the intrinsic
perspective of a system S (see Box 1), the set of the remaining units W = U\S merely act as
background conditions that do not contribute directly to causeeffect power. To enforce this,
we causally marginalize the background units, conditional on the current state of the universe,
rendering them causally inert (see “Identifying substrates of consciousness” for details).
Information.
The information postulate requires that a systems causeeffect power be
specific: the system in its current state must select a specific causeeffect state for its units.
Based on the principle of maximal existence (Box 2), this is the state for which intrinsic infor-
mation is maximal—the maximal causeeffect state. Intrinsic information (ii) measures the dif-
ference a system takes or makes over itself for a given cause and effect state as the product of
Fig 1. Identifying substrates of consciousness through the postulates of existence, intrinsicality, information, integration, and exclusion. (A) The substrate S =
aBC in state (1, 1, 1) (lowercase letters for units indicated state “−1,” uppercase letters state “+1”) is the starting point for applying the postulates. The substrate
updates its state according to the depicted transition probability matrix (TPM) (gray shading indicates probability value from white (p = 0) to black (p = 1); each unit
follows a logistic equation (see “Results” for definition) with k = 4.0 and connection weights as indicated in the causal model). Existence requires that the substrate
must have causeeffect power, meaning that the TPM among substrate states must differ from chance. (B) Intrinsicality requires that a candidate substrate, for
example, units aB, has causeeffect power over itself. Units outside the candidate substrate (in this case, unit C) are treated as background conditions. The
corresponding cause and effect TPMs (Tc and Te) of system aB are depicted on the right. (C) Information requires that the candidate substrate aB selects a specific
causeeffect state (s0). This is the cause state (red) and effect state (green) for which intrinsic information (ii) is maximal. Bar plots on the right indicate the three
probability terms relevant for computing iic (7) and iie (5): the selectivity (light colored bar), as well as the constrained (dark colored bar) and unconstrained (gray bar)
effect probabilities in the informativeness term. (D) Integration requires that the substrate specifies its causeeffect state irreducibly (“as one”). This is established by
identifying the minimum partition (MIP; θ0) and measuring the integrated information of the system (φs)—the minimum between cause integrated information (φc)
and effect integrated information (φe). Here, gray bars represent the partitioned probability required for computing φc (20) and φe (19). (E) Exclusion requires that the
substrate of consciousness is definite, including some units and excluding others. This is established by identifying the candidate substrate with the maximum value of
system integrated information (φ∗
s )—the maximal substrate, or complex. In this case, aB is a complex since its system integrated information (φs = 0.17) is higher than
that of all other overlapping systems (for example, subset a with φs = 0.04 and superset aBC with φs = 0.13).
https://doi.org/10.1371/journal.pcbi.1011465.g001
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
9 / 45
informativeness and selectivity. As we have seen (existence), informativeness quantifies the
causal power of a system in its current state as a reduction of uncertainty with respect to
chance. Selectivity measures how much causeeffect power is concentrated over that specific
cause or effect state. Selectivity is reduced by uncertainty in the cause or effect state with
respect to other potential cause and effect states.
From the intrinsic perspective of the system, the product of informativeness and selectivity
leads to a tension between expansion and dilution, whereby a system comprising more units
may show increased deviation from chance but decreased concentration of causeeffect power
over a specific state [12, 14].
Integration.
By the integration postulate, it is not sufficient for a system to have cause
effect power within itself and select a specific causeeffect state: it must also specify its maximal
causeeffect state in a way that is irreducible. This can be assessed by partitioning the set of
units that constitute the system into separate parts. The system integrated information (φs)
then quantifies how much the intrinsic information specified by the maximal state is reduced
due to the partition (see (8) in S1 Notes). Integrated information is evaluated over the partition
that makes the least difference, the minimum partition (MIP), in accordance with the principle
of minimal existence (see Box 2).
Integrated information is highly sensitive to the presence of fault lines—partitions that sep-
arate parts of a system that interact weakly or directionally [13].
Exclusion.
Many overlapping sets of units may have a positive value of integrated infor-
mation (φs). However, the exclusion postulate requires that the substrate of consciousness
must be constituted of a definite set of units, neither less nor more. Moreover, units, updates,
and states must have a definite grain. Operationally, the exclusion postulate is enforced by
selecting the set of units that maximizes integrated information over itself (φ∗
s), based again on
the principle of maximal existence (see Box 2). That set of units is called a maximal substrate,
or complex. Over a universal substrate, sets of units for which integrated information is maxi-
mal compared to all competing candidate systems with overlapping units can be assessed
recursively (by identifying the first complex, then the second complex, and so on).
Composition.
Once a complex has been identified, composition requires that we charac-
terize its causeeffect structure by considering all its subsets and fully unfolding its causeeffect
power.
Usually, causal models are conceived in holistic terms, as state transitions of the system as a
whole (1), or in reductionist terms, as a description of the individual units of the system and
their interactions (2) [29]. However, to account for the structure of experience, considering only
the causeeffect power of the individual units or of the system as a whole would be insufficient
[17, 29]. Instead, by the composition postulate, we have to evaluate the systems causeeffect
structure by considering the causeeffect power of its subsets as well as their causal relations.
To contribute to the causeeffect structure of a complex, a system subset must both take and
make a difference (as required by existence) within the system (as required by intrinsicality). A
subset M S in state m 2 OM is called a mechanism if it links a cause and effect state over sub-
sets of units Zc/e S, called purviews. A mechanism together with the cause and effect state it
specifies is called a causal distinction. Distinctions are evaluated based on whether they satisfy
all the postulates of IIT (except for composition). For every mechanism, the causeeffect state is
the one having maximal intrinsic information (ii), and the cause and effect purviews are those
yielding the maximum value of integrated information (φd) within the complex—that is, those
that are maximally irreducible. By the information postulate, the causeeffect power of a com-
plex must be specific, which means that it selects a specific causeeffect state at the system level.
Consequently, the distinctions that exist for the complex are only those whose causeeffect state
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
10 / 45
is congruent with the causeeffect state of the complex as a whole (incongruent distinctions are
not components of the complex and its specific causeeffect power because they would violate
the specificity postulate, according to which the experience can only be “this one”).
Distinctions whose cause or effect states overlap congruently within the system (over the
same subset of units in the same state) are bound together by causal relations. Relations also
have an associated value of integrated information (φr), corresponding to their irreducibility.
Together, these distinctions and relations compose the causeeffect structure of the complex
in its current state. The causeeffect structure specified by a complex is called a Φ-structure.
The sum of its distinction and relation integrated information amounts to the structure inte-
grated information (Φ) of the complex.
In the following, we will provide a formal account of the IIT analysis. The first part demon-
strates how to identify complexes. This requires that we (a) determine the causeeffect state of
a system in its current state, (b) evaluate the system integrated information (φs) over that
causeeffect state, and (c) search iteratively for maxima of integrated information (φ∗
s) within a
universe. The second part describes how the postulates of IIT are applied to unfold the cause
effect structure of a complex. This requires that we identify the causal distinctions specified by
subsets of units within the complex and the causal relations determined by the way distinctions
overlap, yielding the systems Φ-structure and its structure integrated information (Φ).
Box 2. Ontological principles of IIT
Principle of being
The principle of being states that to be is to have causeeffect power. In other words, in
physical, operational terms, to exist requires being able to take and make a difference.
The principle is closely related to the so-called Eleatic principle, as found in Platos Soph-
ist dialogue [30]: “I say that everything possessing any kind of power, either to do any-
thing to something else, or to be affected to the smallest extent by the slightest cause,
even on a single occasion, has real existence: for I claim that entities are nothing else but
power.” A similar principle can be found in the work of the Buddhist philosopher Dhar-
makīrti: “Whatever has causal powers, that really exists.” [31] Note that the Eleatic prin-
ciple is enunciated as a disjunction (either to do something. . . or to be affected. . .),
whereas IITs principle of being is presented as a conjunction (take and make a
difference).
Principle of maximal existence
The principle of maximal existence states that, when it comes to a requirement for exis-
tence, what exists is what exists the most. The principle is offered by IIT as a good expla-
nation for why the system state specified by the complex and the causeeffect states
specified by its mechanisms are what they are. It also provides a criterion for determin-
ing the set of units constituting a complex—the one with maximally irreducible cause
effect power—for determining the subsets of units constituting the distinctions and rela-
tions that compose its causeeffect structure, and for determining the units grain. To
exemplify, consider a set of candidate complexes overlapping over the same substrate.
By the postulates of integration and exclusion, a complex must be both unitary and defi-
nite. By the maximal existence principle, the complex should be the one that lays the
greatest claim to existence as one entity, as measured by system integrated information
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
11 / 45
Identifying substrates of consciousness
Our starting point is a substrate U in current state u with TPM T U (1). We consider any subset
s u as a possible complex and refer to a set of units S U as a candidate system. (Note that s
and u are sets of tuples containing both the units and their states.).
By the intrinsicality postulate, the units W = U\S are background conditions, and do not
contribute directly to the causeeffect power of the system. To discount the contribution of
background units, they are causally marginalized, conditional on the current state of the uni-
verse. This means that the background units are marginalized based on a uniform marginal
distribution, updated by conditioning on u. The process is repeated separately for each unit in
the system, and they are then combined using a product (in line with conditional indepen-
dence), which eliminates any residual correlations due to the background units. Accordingly,
we obtain two TPMs T e and T c (for evaluating effects and causes, respectively) for the candi-
date system S. For evaluating effects, the state of the background units is fully determined by
the current state of the universe. The corresponding TPM, T e, is used to identify the effect of
the current state:
T e ¼ T eðT U; u; wÞ peðs j sÞ ¼ pðs j s; wÞ;
s;s 2 OS;
ð3Þ
where w = u\s. For evaluating causes, knowledge of the current state is used to compute the
probability distribution over potential prior states of the background units, which is not neces-
sarily uniform or deterministic. The corresponding TPM, T c, is used to evaluate the cause of
the current state:
T c ¼ T cðT U; u; wÞ pcðs j sÞ ¼
Y
jSj
i¼1
X
w
pðsi j s;
P
^spðu j ^s;
P
^upðu j ^uÞ
;
s;s 2 OS:
ð4Þ
(φs). For the same reason, candidate complexes that overlap over the same substrate but
have a lower value of φs are excluded from existence. In other words, if having maximal
φs is the reason for assigning existence as a unitary complex to a set of units, it is also the
reason to exclude from existence any overlapping set not having maximal φs.
Principle of minimal existence
Another key principle of IIT is the principle of minimal existence, which complements
that of maximal existence. The principle states that, when it comes to a requirement for
existence, nothing exists more than the least it exists. The principle is offered by IIT as a
good explanation for why, given that a system can only exist as one system if it is irreduc-
ible, its degree of irreducibility should be assessed over the partition across which it is
least irreducible (the minimum partition). Similarly, a distinction within a system can
only exist as one distinction to the extent that it is irreducible, and its degree of irreduc-
ibility should be assessed over the partition across which it is least irreducible. Moreover,
a set of units can only exist as a system, or as a distinction within the system, if it specifies
both an irreducible cause and an irreducible effect, so its degree of irreducibility should
be the minimum between the irreducibility on the cause side and on the effect side (see
(9) in S1 Notes).
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
12 / 45
In both TPMs, the background units W are rendered causally inert, so that causes and effects
are evaluated from the intrinsic perspective of the system.
The intrinsic information iic/e is a measure of the intrinsic cause or effect power exerted by
a system S in its current state s over itself by selecting a specific cause or effect state s. The
causeeffect state for which intrinsic information (iic and iie) is maximal is called the maximal
causeeffect state s0 ¼ fs0
c; s0
eg. The integrated information φs is a measure of the irreducibility
of a causeeffect state, compared to the directional system partition θ0 that affects the maximal
causeeffect state the least (minimum partition, or MIP). Systems for which integrated infor-
mation is maximal (φ∗
s) compared to any competing candidate system with overlapping units
are called maximal substrates, or complexes.
The IIT 4.0 formalism to measure a systems integrated information φs and to identify max-
imal substrates was first presented in [13]. An example of how to identify complexes in a sim-
ple system is given in Fig 1, while a comparison with prior accounts (IIT 1.0, IIT 2.0, and IIT
3.0) can be found in S2 Text. An outline of the IIT algorithm is included in S1 Fig.
Existence, intrinsicality, and information: Determining the maximal
causeeffect state of a candidate system
Given a causal model with corresponding TPMs T e (3) and T c (4), we wish to identify the
maximal causeeffect state specified by a system in its current state over itself and to quantify
the causal power with which it does so. In this way, we quantify the causeeffect power of a sys-
tem from its intrinsic perspective, rather than from the perspective of an outside observer (see
Box 1).
System intrinsic information ii. Intrinsic information iiðs;sÞ measures the causal power
of a system S over itself, for its current state s, over a specific cause or effect state s. Intrinsic
information depends on interventional conditional probabilities and unconstrained probabili-
ties of cause or effect states and is the product of selectivity and informativeness.
On the effect side, intrinsic effect information iie of the current state s over a possible effect
state s is defined as:
iieðs;sÞ ¼ peðs j sÞ log
peðs j sÞ
peð
;
ð5Þ
where peðs j sÞ (3) is the interventional conditional probability that the current state s produces
the effect state s, as indicated by T e.
The interventional unconstrained probability peð
peðsÞ ¼ jOSj
1X
s2OS
peðs j sÞ;
ð6Þ
is defined as the marginal probability of s, averaged across all possible current states of S with
equal probability (where |OS| denotes the cardinality of the state space OS).
On the cause side, intrinsic cause information iic of the current state s over a possible cause
state s is defined as:
iicðs;sÞ ¼ p
c ðs j sÞ log
pcðs j
pcðsÞ
;
ð7Þ
where pcðs;sÞ (4) is the interventional conditional probability that the cause state s produces
the current state s, as indicated by T c, and the interventional unconstrained probability is
again defined as the marginal probability of s, averaged across all possible cause states of S with
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
13 / 45
equal probability,
pcðsÞ ¼ jOSj
1X
s2OS
pcðs j sÞ:
ð8Þ
Moreover, p
c ðs j sÞ (4) is the interventional conditional probability that the current state
s 2 OS was produced by s; it is derived from T c using Bayes rule, where we again assign a uni-
form prior to the possible cause states s,
p
c ðs j sÞ ¼ pcðs j jOSj
1
pcðsÞ
¼
pcðs j
X
^s2OS
pcðs j ^sÞ
:
ð9Þ
Informativeness (over chance).
In (5) and (7), the logarithmic term (in base 2 through-
out) is called informativeness. Note that informativeness is expressed in terms of forward
probabilities (probability of a subsequent state given the current state) for both iie (5) and iic
(7). However, iie (5) evaluates the increase in probability of the effect state due to the current
state based on T e, while iic (7) evaluates the increase in probability of the current state due to
the cause state based on T c.
In line with the existence postulate, a system S in state s has causeeffect power (it takes and
makes a difference) if it raises the probability of a possible effect state compared to chance,
which is to say compared to its unconstrained probability,
log
peðs j sÞ
peð
> 0;
ð10Þ
and if the probability of the current state is raised above chance by a possible cause state,
log
pcðs j
pcðsÞ
> 0:
ð11Þ
Informativeness is additive over the number of units: if a system specifies a cause or effect state
with probability p = 1, its causal power increases additively with the number of units whose
states it fully specifies (expansion), given that the chance probability of all states decreases
exponentially.
Selectivity (over states). From the intrinsic perspective of a system, causeeffect power
over a specific cause or effect state depends not only on the deviation from chance it produces,
but also on how its probability is concentrated on that state, rather than being diluted over
other states. This is measured by the selectivity term in front of the logarithmic term in (5) and
(7), corresponding to the conditional probability p
c ðs j sÞ or peðs j sÞ of that specific cause or
effect state. (Note that here, on the cause side, we use the backward probability (probability of
a prior state given the current state) obtained through Bayes rule, while we use the forward
probability of the effect state s given s on the effect side.) Selectivity means that if p < 1, the sys-
tems causal power becomes subadditive (dilution) (see [14] for details). For example, as
shown in [12], if an unconstrained unit is added to a fully specified unit, intrinsic information
does not just stay the same, but decreases exponentially. From the intrinsic perspective of the
system, the informativeness of a specific cause or effect state is diluted because it is spread over
multiple possible states, yet the system must select only one state.
Altogether, taking the product of informativeness and selectivity leads to a tension between
expansion and dilution: a larger system will tend to have higher informativeness than a smaller
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
14 / 45
system because it will deviate more from chance, but it will also tend to have lower selectivity
because it will have a larger repertoire of states to select from.
Because of the selectivity term, intrinsic information is reduced by indeterminism and
degeneracy. As shown in [13], indeterminism decreases the probability of the selected effect
state because it implies that the same state can lead to multiple states. In turn, degeneracy
decreases the probability of the selected cause state because it implies that multiple states can
lead to the same state, even in a deterministic system.
The intrinsic information ii is quantified in units of intrinsic bits, or ibits, to distinguish it
from standard information-theoretic measures (which are typically additive). Formally, the
ibit corresponds to a point-wise information value (measured in bits) weighted by a
probability.
The maximal causeeffect state. Taking the product of informativeness and selectivity
on the systems cause and effect sides captures the postulates of existence (taking and making a
difference) and intrinsicality (taking and making a difference over itself) for each possible
cause or effect state, as measured by intrinsic information. However, the information postulate
further requires that the system selects a specific cause or effect state. The selection is deter-
mined by the principle of maximal existence (Box 1): the cause or effect specified by the system
should be the one that maximizes intrinsic information. On the effect side (and similarly for
the cause side, see S1 Fig),
s0
eðT e; sÞ
¼ argmax
s2OS
iieðs;
¼ argmax
s2OS
peðs j sÞ log
peðs j sÞ
peð
:
ð12Þ
The systems intrinsic effect information is the value of iie (5) for its maximal effect state:
iieðT e; sÞ ≔ iieðs; s0
eÞ ¼ max
s2OS peðs j sÞ log
peðs j sÞ
peð
:
ð13Þ
We have made the dependency of s0 and iie on T e explicit in (12) and (13) to highlight that, for
intrinsic information to properly assess causeeffect power, all probabilities must be derived
from the systems interventional transition probability function, while imposing a uniform
prior distribution over all possible system states. If iieðT e; sÞ ¼ 0, the system S in state s has no
causal power. This is the case if and only if peðs j sÞ ¼ peðsÞ for every s [14] (and likewise, it
can be shown that iicðT c; sÞ ¼ 0 if and only if pcðs j sÞ ¼ pcðsÞ for every s.) It is worthwhile to
mention that when iieðT e; sÞ 6¼ 0, the system state s always increases the probability of the
intrinsic effect state compared to chance. Similarly, when iicðT c; sÞ 6¼ 0 the intrinsic cause
state increases the probability of the system state, satisfying (11). Note also that a systems
intrinsic causeeffect state does not necessarily correspond to the actual cause and effect states
(what actually happened before / will happen after) in the dynamical evolution of the system,
which typically also depends on extrinsic influences. (For an account of actual causation
according to the causal principles of IIT, see [10].).
Intrinsic difference.
Because consciousness is the way it is, the formulation of its proper-
ties in physical, operational terms should be unique and based on quantities that uniquely sat-
isfy the postulates [12, 32]. Intrinsic information is formulated as a product of selectivity and
informativeness based on the notion of intrinsic difference (ID) [14]. This is a measure of the
difference between two probability distributions which uniquely satisfies three properties (cau-
sality, intrinsicality, and specificity) that align with the postulates of IIT (but also have inde-
pendent justification):
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
15 / 45
causality (Existence): the measure is zero if and only if the system does not make a difference
intrinsicality (Intrinsicality): the measure increases if the system is expanded without noise
(expansion) and decreases if the system is expanded without signal (dilution)
specificity (Information): the measure reflects the causeeffect power of a specific state over a
specific cause and effect state.
The properties uniquely satisfied by the ID are described in a general mathematical context
in [14], as well as some additional discussion in S2 Text.
Note that, on the effect side, iie is formally equivalent to the ID between the constrained
effect repertoire peðs j sÞ and the unconstrained effect repertoire peðsÞ. On the cause side, the
application of Bayes rule to compute p
c ðs j sÞ as the selectivity term means that iic is not
strictly equivalent to the ID between two probability distributions. However, analogously to
the effect formulation, it is defined as the product of selectivity and informativeness of
causes.
Integration: Determining the irreducibility of a candidate system
Having identified the maximal causeeffect state s0 ¼ fs0
c; s0
eg of a candidate system S in its cur-
rent state s, the next step is to evaluate whether the system specifies the causeeffect state of its
units in a way that is irreducible, as required by the integration postulate: a candidate system
can only be a substrate of consciousness if it is one system—that is, if it cannot be subdivided
into subsets of units that exist separately from one another.
Directional system partitions.
To that end, we define a set of directional system partitions
Θ(S) that divide S into k 2 parts fSðiÞg
k
i¼1, such that
SðiÞ 6¼ ; SðiÞ \ SðjÞ ¼ ; and
[
k
i¼1
SðiÞ ¼ S:
ð14Þ
In words, each part S(i) must contain at least one unit, there must be no overlap between any
two parts S(i) and S(j), and every unit of the system must appear in exactly one part. For each
part S(i), the partition removes the causal connections of that part with the rest of the system
in a directional manner: either the parts inputs, outputs, or both are replaced by indepen-
dent “noise” (they are “cut” by the partition in the sense that their causal powers are substi-
tuted by chance). Directional partitions are necessary because, from the intrinsic perspective
of a system, a subset of units that cannot affect the rest of the system, or cannot be affected
by it, cannot truly be a part of the system. In other words, to be a part of a system, a subset of
units must be able to interact with the rest of the system in both directions (cause and
effect).
A partition θ 2 Θ(S) thus has the form
y ¼ fS
ð1Þ
d1 ; S
ð2Þ
d2 ; . . . ; S
ðkÞ
dk g;
ð15Þ
where δi 2 { , !, $} indicates whether the inputs ( ), outputs (!), or both ($) are cut for
a given part. For each part S(i), we can then identify a set of units X(i) S whose inputs to S(i)
have been cut by the partition, and the complementary set Y(i) = S\X(i) whose inputs to S(i) are
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
16 / 45
left intact. Specifically,
XðiÞ ¼
SnSðiÞ
if di 2 f ; $g
[
j 6¼ i :
dj 2 f!; $g
SðjÞ
if di 2 f!g:
8
>
>
<
>
>
:
ð16Þ
In the first case, if δi 2 { , $}, all inputs to S(i) from S\S(i) are cut. In the second case, if
δi 2 {!}, there may still be inputs to S(i) that are cut, which correspond to the outputs of all S(j)
with δj 2 {!, $}.
Given a partition θ 2 Θ(S), we define partitioned transition probability matrices T
y
e and T
y
c
in which all connections affected by the partition are “noised.” This is done by combining the
independent contributions of each unit Sj 2 S in line with the conditional independence
assumption (2). For the effect TPM (and analogously for the cause TPM)
T
y
e py
s j sÞ ¼
Y
n
j¼1
py
sj j sÞ; s; s 2 OS;
ð17Þ
where the partitioned probability of a unit Sj 2 S(i) is defined as
py
sj j sÞ ¼ jOXðiÞj
1 X
xðiÞ2OXðiÞ
peðsj j xðiÞ; yðiÞÞ;
ð18Þ
and y(i) = s\x(i). This means that all connections to unit Sj that are affected by the partition are
causally marginalized (replaced by independent noise).
System integrated information φs. The integrated effect information φe measures how
much the partition θ 2 ΘS reduces the probability with which a system S in state s 2 OS speci-
fies its effect state s0
e (12),
φeðT e; s; yÞ ¼ peðs0
e j sÞ
log
peðs0
e j sÞ
py
eðs0
e j sÞ
þ
:
ð19Þ
Note that φe has the same form as the intrinsic information iieðs;sÞ (5), with the partitioned
effect probability taking the place of the unconstrained (marginal) probability. Here, |.|+ repre-
sents the positive part operator, which sets the negative values to 0. This ensures that the sys-
tem as a whole raises the probability of the effect state compared to the partitioned probability.
Likewise, the integrated cause information φc is defined as
φcðT c; s; yÞ ¼ p
c ðs0
c j sÞ
log
pcðs j s0
py
cðs j s0
þ
:
ð20Þ
(By the principle of maximal existence, if two or more causeeffect states are tied for maximal
intrinsic information, the system specifies the one that maximizes φc/e.).
By the zeroth postulate, existence requires cause and effect power, and the integration pos-
tulate requires that its causeeffect power be irreducible. By the principle of minimal existence
(Box 2), then, system integrated information for a given partition is the minimum of its irre-
ducibility on the cause and effect sides:
φsðT e; T c; s; yÞ ¼ minfφcðT c; s; yÞ; φeðT e; s; yÞg:
ð21Þ
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
17 / 45
Moreover, again by the principle of minimal existence, the integrated information of a sys-
tem is given by its irreducibility over its minimum partition (MIP) θ0 2 ΘS, such that
φsðT e; T c; sÞ ≔ φsðT e; T c; s; y0Þ:
ð22Þ
The MIP is defined as the partition θ 2 ΘS that minimizes the systems integrated informa-
tion, relative to the maximum possible value it could take for arbitrary TPMs T
0
e; T
0
c over the
units of system S
y0 ¼ argmin
y2YðSÞ
φsðT e; T c; s; yÞ
max
T 0
e;T 0
c
φsðT
0
e; T
0
c; s; yÞ :
ð23Þ
Accordingly, the system is reducible if at least one partition θ 2 ΘS makes no difference to the
cause or effect probability. The normalization term in the denominator of (23) ensures that
φsðT e; T c; sÞ is evaluated fairly over a systems fault lines by assessing integration relative to its
maximum possible value over a given partition. Using the relative integrated information
quantifies the strength of the interactions between parts in a way that does not depend on the
number of parts and their size. As proven in [13], the maximal value of φsðT e; T c; s; yÞ for a
given partition θ is the normalization factor max
T 0
e;T 0
c
φsðT
0
e; T
0
c; s; yÞ ¼
X
k
i¼1
jSðiÞjjXðiÞj, which corre-
sponds to the maximal possible number of “connections” (pairwise interactions) affected by θ.
For example, as shown in [13], the MIP will correctly identify the fault line dividing a system
into two large subsets of units linked through a few interconnected units (a “bridge”), rather
than defaulting to partitions between individual units and the rest of the system. Once the
minimum partition has been identified, the integrated information across it is an absolute
quantity, quantifying the loss of intrinsic information due to cutting the minimum partition of
the system. (If two or more partitions θ 2 Θ(S) minimize Eq (23), we select the partition with
the largest unnormalized φs value as θ0, applying the principle of maximal existence.) Defining
θ0 as in (23), moreover, ensures that φsðT e; T c; sÞ ¼ 0 if the system is not strongly connected in
graph-theoretic terms (see (10) in S1 Notes).
In summary, the system integrated information (φsðT e; T c; sÞ, also called small phi, quan-
tifies the extent to which system S in state s has causeeffect power over itself as one system (i.
e., irreducibly). φsðT e; T c; sÞ is thus a quantifier of irreducible existence.
Exclusion: Determining maximal substrates (complexes)
In general, multiple candidate systems with overlapping units may have positive values of
φsðT e; T c; sÞ. By the exclusion postulate, the substrate of consciousness must be definite; that
is, it must comprise a definite set of units. But which one? Once again, we employ the principle
of maximal existence (Box 2): among candidate systems competing over the same substrate
with respect to an essential requirement for existence, in this case irreducibility, the one that
exists is the one that exists the most. Accordingly, the maximal substrate, or complex, is the
candidate substrate with the maximum value of system integrated information (φ∗
s), and over-
lapping substrates with lower φs are thus excluded from existence.
Determining maximal substrates recursively.
Within a universal substrate U0 in state u0,
subsets of units that specify maxima of irreducible causeeffect power (complexes) can be
identified iteratively: the substrate with maximum φ∗
s is identified as a complex, the corre-
sponding units are excluded from further consideration, the remaining units are searched for
the next maximal substrate. Formally, an iterative search is performed to find a sequence of
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
18 / 45
systems S
k Uk with
φ∗
sðT e; T c; ukÞ ¼ max
SUk φsðT e; T c; sÞ;
ð24Þ
such that
S
k ¼ argmax
SUk
φsðT e; T c; sÞ;
ð25Þ
and Ukþ1 ¼ UknS
k until Uk+1 = ; or Uk+1 = Uk (the units in U0\Uk+1 still serve as background
conditions, for details see [13]). If the maximal substrate S
k is not unique, and all tied systems
overlap, the next best system that is unique is chosen instead (see S1 Text).
For any complex S* in its corresponding state s* 2 OS*, overlapping substrates that specify
less integrated information (φs < φsðT e; T c; s∗Þ) are excluded. Consequently, specifying a
maximum of integrated information φ∗
s compared to all overlapping systems
S \ ~S 6¼ ; ) φsðsÞ > φsð~sÞ; 8S 6¼ ~S U
ð26Þ
is a sufficient requirement for a system S U to be a complex.
As described in [13], this recursive search for maximal substrates “condenses” the universe
U0 in state u0 2 OU0 into a disjoint (non-overlapping) and exhaustive set of complexes—the
first complex, second complex, and so on.
Determining maximal unit grains.
Above, we presented how to determine the borders of
a complex within a larger system U, assuming a particular grain for the units Ui 2 U. In princi-
ple, however, all possible grains should be considered [33, 34]. In the brain, for example, the
grain of units could be brain regions, groups of neurons, individual neurons, sub-cellular
structures, molecules, atoms, quarks, or anything finer, down to hypothetical atomic units of
causeeffect power [3, 4]. For any unit grain—neurons, for example—the grain of updates
could be minutes, seconds, milliseconds, micro-seconds, and so on. However, by the exclusion
postulate, the units that constitute a system S must also be definite, in the sense of having a def-
inite grain.
Once again, the grain is defined by the principle of maximal existence: across the possible
micro- and macroscopic levels, the “winning” grain is the one that ensures maximally irreduc-
ible existence (φ∗
s) for the entity to which the units belong [33, 34].
To evaluate integrated information across grains requires a mathematical framework for
defining coarser (macro) units from finer (micro) units. Such a framework has been developed
in previous work [3335], and is updated here to fully align with the postulates.
Supposing that U = u is a universe of micro units in a state, a macro unit J = j is a combina-
tion of a set of micro units ^S U, and a mapping g from the state ^S to the state of J,
j ¼ gð^sÞ;
where
g : O^S ! OJ:
As constituents of a complex upon which its causeeffect power rests, the units themselves
should comply with the postulates of IIT. Otherwise it would be possible to “make something
out of nothing.” Accordingly, units themselves must also be maximally irreducible, as mea-
sured by the integrated information of the units when they are treated as candidate systems
(φs); otherwise, they would not be units but “disintegrate” into their constituents. However, in
contrast to systems, units only need to be maximally irreducible within, because they do not
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
19 / 45
exist as complexes in their own right: a unit J with substrate ^S qualifies as a candidate unit of a
larger system S if its integrated information when treated as a candidate system (φs) is higher
than that of any system of units (including potential macro units) that can be defined using a
subset of ^S. Out of all possible sets of such candidate units, the set of (macro) units that define
a complex is the one that maximizes the existence of the complex to which the units belong,
rather than their own existence.
In practice, the search for the maximal grain should be an iterative process, starting from
micro units: identify potential substrates for macro units (^S) that are maximally irreducible
within, identify mappings g that maximize the integrated information of systems of macro
units, then consider additional potential substrates for macro units, and so on iteratively, until
a global maximum is found. The iterative approach is necessary for establishing that a substrate
is maximally irreducible within, as this criterion requires consideration not only of micro
units, but also of all finer grains (potential meso units defined from subsets of ^S).
Here we outlined an overall framework for identifying macro units consistent with the pos-
tulates. Additional details about the nature of the mapping g, and how to derive the transition
probabilities for a system of macro units are also informed by the postulates (see (11) in S1
Notes).
Unfolding the causeeffect structure of a complex
Once a maximal substrate and the associated maximal causeeffect state have been identified,
we must unfold its causeeffect power to reveal its causeeffect structure of distinctions and
relations, in line with the composition postulate. As components of the causeeffect structure,
distinctions and relations must also satisfy the postulates of IIT (save for composition).
Composition and causal distinctions
Causal distinctions capture how the causeeffect power of a substrate is structured by subsets
of units that specify irreducible causes and effects over subsets of its units. A candidate distinc-
tion d(m) consists of (1) a mechanism M S in state m 2 OM inherited from the system state s
2 OS; (2) a maximal causeeffect state z ¼ fz
c; z
eg over the cause and effect purviews (Zc, Ze
S) linked by the mechanism; and (3) an associated value of irreducibility (φd > 0). A distinc-
tion d(m) is thus represented by the tuple
dðmÞ ¼ ðm; z; φdÞ:
ð27Þ
For a given mechanism m, our goal is to identify its maximal cause Z
c in state z
c 2 OZc and
its maximal effect Z
e in state z
e 2 OZe within the system, where Z
c; Z
e S.
As above, in line with existence, intrinsicality, and information, we determine the maximal
cause or effect state specified by the mechanism over a candidate purview within the system
based on the value of intrinsic information ii(m, z). Next, in line with integration, we deter-
mine the value of integrated information φd(m, Z, θ) over the minimum partition θ0. In line
with exclusion, we determine the maximal causeeffect purviews for that mechanism over all
possible purviews Z S based on the associated value of irreducibility φd(m, Z, θ0). Finally, we
determine whether the maximal causeeffect state specified by the mechanism is congruent
with the systems overall causeeffect state (z
c s
c, z
e s
e), in which case we conclude that it
contributes a distinction to the overall causeeffect structure.
The updated formalism to identify causal distinctions within a system S in state s was first
presented in [12]. Here we provide a summary with minor adjustments on selecting z
c and z
e,
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
20 / 45
the cause integrated information φc(m, Z), and the requirement that causal distinctions must
be congruent with the systems maximal causeeffect state (see S2 Text).
Existence, intrinsicality, and information: Determining the cause and effect state speci-
fied by a mechanism over candidate purviews.
Like the system as a whole, its subsets must
comply with existence, intrinsicality, and information. As for the system, we begin by quantify-
ing, in probabilistic terms, the difference a subset of units M S in its current state m s
takes and makes from and to subsets of units Z S (cause and effect purview). As above, we
start by establishing the interventional conditional probabilities and unconstrained probabili-
ties from the TPMs T c and T e.
When dealing with a mechanism constituted by a subset of system units, it is important to
capture the constraints on a purview state z that are exclusively due to the mechanism in its
state (m), removing any potential contribution from other system units. This is done by caus-
ally marginalizing all variables in X = S\M, which corresponds to imposing a uniform distribu-
tion as p(X) [8, 10, 12] (see (12) in S1 Notes). The effect probability of a single unit Zi 2 Z
conditioned on the current state m is thus defined as
peðzi j mÞ ¼ jOXj
1X
x2OX
pðzi j m; xÞ;
zi 2 OZi:
ð28Þ
In addition, product probabilities π(zjm) are used instead of conditional probabilities pe(zjm)
to discount correlations from units in X = S\M with divergent outputs to multiple units in Z
S [8, 10, 36]. Otherwise, X might introduce correlations in Z that would be wrongly considered
as effects of M. Based on the appropriate TPM, the probability over a set Z of |Z| units is thus
defined as the product of the probabilities over individual units
peðz j mÞ ¼
Y
jZj
i¼1
peðzi j mÞ;
z 2 OZ;
ð29Þ
and
pcðm j zÞ ¼
Y
jMj
i¼1
pcðmi j zÞ;
m 2 OM:
ð30Þ
Note that for a single unit purview πe(zjm) = pe(zjm), and for a single unit mechanism πc(mjz)
= pc(mjz). By using product probabilities, causal marginalization maintains the conditional
independence between units (2) because independent noise is applied to individual connec-
tions. The assumption of conditional independence distinguishes IITs causal powers analysis
from standard information-theoretic analyses of information flow [10, 27] and corresponds to
an assumption that variables are “physical” units in the sense that they are irreducible within
and can be observed and manipulated independently.
From Eqs (29) and (30) we can also define unconstrained probabilities
peðz; MÞ ¼ jOMj
1 X
m2OM
peðz j mÞ;
z 2 OZ;
ð31Þ
and
pcðm; ZÞ ¼ jOZj
1X
z2OZ
pcðm j zÞ;
m 2 OM:
ð32Þ
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
21 / 45
Given the set Y = S\Z, the backward cause probability (selectivity) for a mechanism m with
|M| units is computed using Bayes rule over the product distributions
p
c ðz j mÞ ¼ pcðm j zÞ jOZj
1
pcðm; ZÞ
¼
Y
jMj
i¼1
pcðmi j zÞ
X
^z2OZ
Y
jMj
i¼1
pcðmi j ^zÞ
;
z 2 OZ;
ð33Þ
where pcðmi j zÞ ¼ jOYj
1 X
y2OY
pcðmi j z; yÞ in line with (28).
To correctly quantify intrinsic causal constraints, the marginal probability of possible cause
states (for computing p
c ðz j mÞ or πc(m; Z)) is again set to the uniform distribution. As above,
all probabilities are obtained from the TPMs T e (3) and T c (4) and thus correspond to inter-
ventional probabilities throughout.
Having defined cause and effect probabilities, we can now evaluate the intrinsic informa-
tion of a mechanism m over a purview state z 2 OZ analogously to the system intrinsic infor-
mation (5) and (7). The intrinsic effect information that a mechanism in a state m specifies
about a purview state z is
iieðm; zÞ ¼ peðz j mÞ log
peðz j mÞ
peðz; MÞ
:
ð34Þ
The intrinsic cause information that a mechanism in a state m specifies about a purview state z
is
iicðm; zÞ ¼ p
c ðz j mÞ log
pcðm j zÞ
pcðm; ZÞ
:
ð35Þ
As with system intrinsic information, the logarithmic term is the informativeness, which
captures how much causal power is exerted by the mechanism m on its potential effect z (how
much it increases the probability of that state above chance), or by the potential cause z on the
mechanism m. The term in front of the logarithm corresponds to the mechanisms selectivity,
which captures how much the causal power of the mechanism m is concentrated on a specific
state of its purview (as opposed to other states). In the following we will again focus on the
effect side, but an equivalent procedure applies on the cause side (see S1 Fig).
Based on the principle of maximal existence, the maximal effect state of m within the pur-
view Z is defined as
z0
eðm; ZÞ ¼ argmax
z2OZ
iieðm; zÞ;
ð36Þ
which corresponds to the specific effect of m on Z. Note that z0
e is not always unique (see S1
Text). The maximal intrinsic information of mechanism m over a purview Z is then
iieðm; ZÞ ≔ iieðm; z0
eÞ ¼ max
z2OZ iieðm; zÞ:
ð37Þ
Note that, by this definition, if iie(m, Z) 6¼ 0, mechanism m always raises the probability of
its maximal effect state compared to the unconstrained probability. This is because there is at
least one state z 2 OZ such that πe(zjm) > πe(z; M).
The intrinsic information of a candidate distinction, like that of the system as a whole, is
sensitive to indeterminism (the same state leading to multiple states) and degeneracy (multiple
states leading to the same state) because both factors decrease the probability of the selected
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
22 / 45
state. Moreover, the product of selectivity and informativeness leads to a tension between
expansion and dilution: larger purviews tend to increase informativeness because conditional
probabilities will deviate more from chance, but they also tend to decrease selectivity because
of the larger repertoire of states.
Integration: Determining the irreducibility of a candidate distinction.
To comply with
integration, we must next ask whether the specific effect of m on Z is irreducible. As for the
system, we do so by evaluating the integrated information φe(m, Z). To that end, we define a
set of “disintegrating” partitions Θ(M, Z) as
YðM; ZÞ ¼
fðMðiÞ; ZðiÞÞg
k
i¼1 : k 2 f2; 3; 4; . . .g; MðiÞ 2 PðMÞ; ZðiÞ 2 PðZÞ;
S MðiÞ ¼ M; S ZðiÞ ¼ Z; ZðiÞ \ ZðjÞ ¼ MðiÞ \ MðjÞ ¼ ; 8 i 6¼ j; MðiÞ ¼ M ) ZðiÞ ¼ ;
;
ð38Þ
where {M(i)} is a partition of M and {Z(i)} is a partition of Z, but the empty set may also be used
as a part (P denotes the power set). As introduced in [10, 12], a disintegrating partition θ 2 Θ
(M, Z) either “cuts” the mechanism into at least two independent parts if |M| > 1, or it severs
all connections between M and Z, which is always the case if |M| = 1 (we refer to [10, 12] for
details). Note that disintegrating partitions differ from system partitions (23), which divide the
system into two or more parts in a directed manner to evaluate whether and to what extent the
system is integrated in terms of its causeeffect power. Instead, disintegrating partitions apply
to mechanismpurview pairs within the system, which are already directed, to evaluate the
cause or effect power specified by the mechanism over its purview.
Given a partition θ 2 Θ(M, Z), we can define the partitioned effect probability
py
eðz0
e j mÞ ¼
Y
k
i¼1
peðz0ðiÞ
e
j mðiÞÞ;
ð39Þ
with pðjmðiÞÞ ¼ pðÞ ¼ 1. In the case of mðiÞ ¼ , peðz0ðiÞ
e jÞ corresponds to the fully parti-
tioned effect probability
peðz j Þ ¼
Y
jZj
i¼1
X
s2OS
peðzi j sÞjOSj
1:
ð40Þ
The integrated effect information of mechanism m over a purview Z S with effect state z0
e
for a particular partition θ 2 Θ(M, Z) is then defined as
φeðm; Z; yÞ ¼ peðz0
e j mÞ
log
peðz0
e j mÞ
py
eðz0
e j mÞ
þ
:
ð41Þ
The effect of m on z0
e is reducible if at least one partition θ 2 Θ(M, Z) makes no difference to
the effect probability or increases it compared to the unpartitioned probability. In line with the
principle of minimal existence, the total integrated effect information φe(m, Z) again has to be
evaluated over θ0, the minimum partition (MIP)
φeðm; ZÞ ≔ φeðm; Z; y0Þ;
ð42Þ
which requires a search over all possible partitions θ 2 Θ(M, Z):
y0 ¼ argmin
y2YðM;ZÞ
φðm; Z; yÞ
max
T 0 φðm; Z; yÞ :
ð43Þ
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
23 / 45
As in (23), the minimum partition is evaluated against its maximum possible value across all
possible systems TPMs T
0, which again corresponds to the number of possible pairwise inter-
actions affected by the partition.
The integrated cause information is defined analogously, as
φcðm; ZÞ ≔ φcðm; Z; y0Þ ¼ p
c ðz0
c j mÞ
log
pcðm j z0
py0
c ðm j z0
þ
;
ð44Þ
where the partitioned probability py
cðm j zÞ is again a product distribution over the parts in the
partition, as in (39).
Taken together, the intrinsic information (37) determines what cause or effect state the
mechanism m specifies. Its integrated information quantifies to what extent m specifies its
cause or effect in an irreducible manner. Again, φ(m, Z) is a quantifier of irreducible existence.
Exclusion: Determining causal distinctions.
Finally, to comply with exclusion, a mecha-
nism must select a definite effect purview, as well as a cause purview, out of a set of candidate
purviews. Resorting again to the principle of maximal existence, the mechanisms effect pur-
view and associated effect is the one having the maximum value of integrated information
across all possible purviews Z S in state z0
eðm; ZÞ (36)
z
eðmÞ ¼ argmax
ZS
φeðm; z0
eðm; ZÞÞ:
ð45Þ
The integrated effect information of a mechanism m within S is then
φeðmÞ ≔ φeðm; z
eðmÞÞ ¼ max
ZS φeðm; z0
eðm; ZÞÞ:
ð46Þ
The integrated cause information φc(m) and the maximally irreducible cause z
cðmÞ are
defined in the same way (see S1 Fig). Based again on the principle of minimal existence, the
irreducibility of the distinction specified by a mechanism is given by the minimum between its
integrated cause and effect information
φdðmÞ ¼ min ðφcðmÞ; φeðmÞÞ:
ð47Þ
Determining the set of causal distinctions that are congruent with the system cause
effect state. As required by composition, unfolding the full causeeffect structure of the sys-
tem S in state s requires assessing the irreducible causeeffect power of every subset of units
within S (Fig 2). Any m s with φd > 0 specifies a candidate distinction d(m) = (m, z*, φd)
(27) within the system S in state s. However, in order to contribute to the causeeffect structure
of a system, distinctions must also comply with intrinsicality and information at the system
level. Thus, the fact that the system must select a specific causeeffect state implies that the
causeeffect state they specify over subsets of the system (z ¼ fz
c; z
eg) must be congruent
with the causeeffect state specified over itself by the system as a whole s0.
We thus define the set of all causal distinctions within S in state s as
DðT e; T c; sÞ ¼ fdðmÞ : m s; φdðmÞ > 0; z
cðmÞ s0
c; z
eðmÞ s0
eg:
ð48Þ
Altogether, distinctions can be thought of as irreducible “handles” through which the sys-
tem can take and make a difference to itself by linking an intrinsic cause to an intrinsic effect
over subsets of itself. As components within the system, causal distinctions have no inherent
structure themselves. Whatever structure there may be between the units that make up a dis-
tinction is not a property of the distinction but due to the structure of the system, and thus cap-
tured already by its compositional set of distinctions. Similarly, from an extrinsic perspective,
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
24 / 45
one may uncover additional causes and effects, both within the system and across its borders,
at either macro or micro grains. However, from the intrinsic perspective of the system causes
and effects that are excluded from its causeeffect structure do not exist [17, 29].
For example, as shown in Fig 3(A), a system may have a mechanism through which it speci-
fies, in a maximally irreducible manner, the effect state of a triplet of units (e.g., z
e ¼ abc, a
third-order purview; again lowercase letters for units indicate state “−1,” uppercase letters state
“+1”). However, if the system lacks a mechanism through which it can specify the effect state
of single units, each taken individually (say, unit a, a first-order effect purview), then, from its
intrinsic perspective, that unit does not exist as a single unit. By the same token, if the system
can specify individually the state of unit a, b, and c, but lacks a way to specify irreducibly the
state of abc together, then, from its intrinsic perspective, the triplet abc does not exist as a trip-
let (see Fig 3(B)). Finally, even if the system can distinguish the single units a, b, and c, as well
as the triplet abc, if it lacks handles to distinguish pairs of units such as ab and bc, it cannot
order units in a sequence.
Composition and causal relations
Causal relations capture how the causes and/or effects of a set of distinctions within a complex
overlap with each other. Just as a distinction specifies which units/states constitute a cause pur-
view and the linked effect purview, a relation specifies which units/states correspond to which
units/states among the purviews of a set of distinctions. Relations thus reflect how the cause
effect power of its distinctions is “bound together” within a complex. The irreducibility due to
this binding of causeeffect power is measured by the relations irreducibility (φr > 0). Rela-
tions between distinctions were first described in [11] (for differences with the initial presenta-
tion see S2 Text).
A set of distinctions d D(s) is related if the causeeffect state of each distinction d 2 d
overlaps congruently over a set of shared units, which may be part of the cause, the effect, or
Fig 2. Composition and causal distinctions. Identifying the irreducible causal distinctions specified by a substrate in a state requires evaluating the specific
causes and effects of every system subset. The candidate substrate is constituted of two interacting units S = aB (see Fig 1) with TPMs T e and T c as shown.
In addition to the two first-order mechanisms a and B, the second-order mechanism aB specifies its own irreducible cause and effect, as indicated by
φd > 0.
https://doi.org/10.1371/journal.pcbi.1011465.g002
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
25 / 45
both the cause and the effect of each distinction. Below we will denote the cause of a distinction
d as z
cðdÞ and its effect as z
eðdÞ. For a given set of distinctions d D(s), there are potentially
many “relating” sets of causes and/or effects z such that
z : z \ fz
cðdÞ; z
eðdÞg 6¼ 8d 2 d;
\
z2z
z 6¼ ; jzj > 1
ð49Þ
with maximal overlap
o∗ðzÞ ¼
\
z2z
z 6¼ :
ð50Þ
Since z
cðmÞ s0
c and z
eðmÞ s0
e are sets of tuples containing both the units and their states,
the intersection operation considers both the units and the state of the units.
All possible sets z specify unique aspects about a relation r(d) and constitute the various
“faces” of the relation (Fig 4). The maximal overlap o*(z) (50) is also called the “face purview.”
The set of faces associated with a relation thus specifies which type of relation it is (e.g., a sin-
gle-faceted relation that only relates the causes of the set of distinctions, or a multi-faceted rela-
tion, which requires some of the distinctions to overlap on both the cause and effect side).
Note that (49) includes the case z ¼ fz
cðdÞ; z
eðdÞg, which indicates a “self-relation” over the
cause and effect of a single distinction d 2 D(s).
A relation r(d) thus consists of a set of distinctions d 2 D(s), with an associated set of faces
f(d) = {f(z)}d and irreducibility φr > 0,
rðdÞ ¼ ðd; f ðdÞ; φrÞ:
ð51Þ
A relation that binds together h = |d| distinctions is a h-degree relation. A relation face f(z) 2 f(d)
Fig 3. Composition of intrinsic effects. From the intrinsic perspective of the system, a specific cause or effect is only available to the system if it is selected
by a causal distinction d 2 D(s). In (A), only the top-order effect is specified. From the intrinsic perspective, the system cannot distinguish the individual
units. In (B), only first-order effects are specified. The system has no “handle” to select all three units together. (C) If both first- and third-order effects are
specified, but no second-order effects, the system can distinguish individual units and select them together, but has no way of ordering them sequentially.
(D) The system can distinguish individual units, select them altogether, as well as order them sequentially, in the sense that it has a handle for ab and bc, but
not ac. The ordering becomes apparent once the relations among the distinctions are considered (see below, Fig 5).
https://doi.org/10.1371/journal.pcbi.1011465.g003
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
26 / 45
consists of a set of causes and effects z (as in 49), with associated face purview o∗ðzÞ (50)
f ðzÞ ¼ ðz; o∗ðzÞÞ:
ð52Þ
A relation face over k = |z| purviews is a k-degree face. The set of faces includes all the ways in
which the set of distinctions d counts as related according to (49). Because z may include either
the cause, or the effect, or both the cause and effect of a distinction d 2 d, a relation r(d) with
|d| > 1 may comprise up to 3|d| faces. If a set of distinctions d 2 D(s) does not overlap con-
gruently, it is not related (in that case o∗ðzÞ ¼ for all possible f(z) 2 f(d)) (Fig 5).
Causal relations inherit existence from the causeeffect power of the distinctions that com-
pose them. They inherit intrinsicality because the causes and effects that compose their faces
are specified within the substrate. Moreover, relations are specific because the joint purviews
of their faces must be congruent for all causes and effects z* 2 z. Note that relation purviews
are necessarily congruent with the overall cause and effect state specified by the system as a
whole, because the causes and effects of the distinctions composing a relation must themselves
be congruent.
The irreducibility of a causal relation is measured by “unbinding” distinctions from their
joint purviews, taking into account all faces of the relation. Distinctions d 2 D(s) are already
established as maximally irreducible components, characterized by their value of integrated
information φd. To assess the irreducibility of a relation, we thus assume that the integrated
information φd of a distinction is distributed uniformly across unique cause and effect purview
units, such that
φd
jz
cðdÞ [ z
eðdÞj
ð53Þ
is the average irreducible information φd per unique purview unit for an individual distinction
d 2 d with causeeffect state z∗ðdÞ ¼ fz
cðdÞ; z
eðdÞg. Since the union operator takes the states
of the units into account, incongruent units are counted separately, while congruent units on
the cause and effect side count as one.
Since distinctions are related by specifying common units into common states, the effect of
“unbinding” a distinction must be proportional to the number of units jointly specified in the
Fig 4. Composition and causal relations. Relations between distinctions specify joint causes and/or effects. The two distinctions d(a) and d(aB) each
specify their own cause and effect. In this example, their cause and effect purviews overlap over the unit b and are congruent, which means that they all
specify b to be in state “-1.” The relation r({a, aB}) thus binds the two distinctions together over the same unit. Relation faces are indicated by the blue lines
and surfaces between the distinctions causes and/or effects (different shades are used to individuate the faces). Because all four purviews overlap over the
same unit, all nine possible faces exist. Note that the fact that the two distinctions overlap irreducibly can only be captured by a relation and not by a high-
order distinction.
https://doi.org/10.1371/journal.pcbi.1011465.g004
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
27 / 45
relation, i.e. the number of distinct units over the joint purviews of all faces in the relation:
[
f 2f ðdÞ
o
f
:
ð54Þ
This union of the face purviews o
f is also called the “relation purview” or the “joint purview”
of the relation. While any partition of one or more distinctions from the relation will “unbind”
the set of distinctions d, by the principle of minimal existence, a relation can only be as irre-
ducible as the minimal amount of integrated information specified by any one distinction in
the relation. Therefore, the relation integrated information φr(d) is defined as
φrðdÞ ¼ min
d2d
[
f 2f ðdÞ
o
f
φd
jz
cðdÞ [ z
eðdÞj :
ð55Þ
In words, for each distinction, we take the average integrated information per distinct purview
element (53), multiply it by the number of units across all faces of the relation (54), and then
find the distinction that contributes the least integrated information per overlap unit as the
minimum partition of the relation (with corresponding integrated information φr). Defining
φr in this way guarantees that the integrated information of a relation cannot exceed the inte-
grated information of its weakest distinction. For a given set of distinctions, the maximum
value of φr occurs for a relation in which the cause and effect of each distinction is fully over-
lapped by all other distinctions in the relation (in that case, φr = mind2d φd). Note also that a
relation satisfies exclusion (distinctions overlap on this whole set of units) in that its integrated
information is naturally maximized (per the principle of maximal existence) over the maximal
Fig 5. Structuring of intrinsic effects by relations. (A) A single undifferentiated effect has no relations. (B) Likewise, there are no relations among
multiple non-overlapping effects. (C) The set of three first-order effects and one third-order effect supports three relations, which bind the effects together.
(D) The set of first, second, and third-order effects supports a large number of relations (ten 2-relations (between two effects), six 3-relations, and one
4-relation), which bind the effects in a structure that is ordered sequentially.
https://doi.org/10.1371/journal.pcbi.1011465.g005
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
28 / 45
congruent overlap o
f for each relation face (50) (taking subsets of these overlaps could only
reduce the integrated information of the relation).
In summary, just as distinctions link a cause with an effect, relations bind various combina-
tions of causes and effects that are congruent over the same units (Fig 4). And just as a distinc-
tion captures the irreducibility of an individual causeeffect linked by a mechanism, a relation
captures the irreducibility of a set of distinctions bound by the joint purviews of their causes
and/or effects.
For a set of distinctions D, we define the set of all relations among them as
RðDÞ ¼ frðdÞ : φrðdÞ > 0g; 8d D:
ð56Þ
In practice, the total number of relations and their SR(D) φr can be determined analytically for
a given set of distinctions D, which greatly reduces the necessary computations (see S3 Text).
Together, a set of distinctions D and its associated set of relations R(D) compose a causeeffect
structure.
Causeeffect structures and Φ-structures
A causeeffect structure is defined as the union of the distinctions specified by a substrate and
the relations binding them together:
CðDÞ ¼ D [ RðDÞ:
ð57Þ
The causeeffect structure specified by a maximal substrate—a complex—is also called a Φ-
structure:
CðT e; T c; s∗Þ ¼
fdðmÞ ¼ fm; z; φdg 2 T e; T c; s∗Þg S frðdÞ ¼ fd; f ðdÞ; φrg 2 RðDðT e; T c; s∗ÞÞg
: ð58Þ
The sum of the values of integrated information of a substrates distinctions and relations,
called Φ (“big Phi,” “structure Phi”) corresponds to the structure integrated information of the
Φ-structure,
ΦðT e; T c; s∗Þ ¼
X
CðT e;T c;s∗Þ
φ:
ð59Þ
Note that Φ is not computed based on a partition (as system phi), but rather a sum of the
integrated information within the structure (where each term of the sum was computed by
partitioning). Within a Φ-structure, various types of meaningful sub-structures can be speci-
fied, which we term Φ-folds. A Φ-fold is composed of a subset of the distinctions and relations
that compose the overall causeeffect structure. A special case is the distinction Φ-fold, denoted
C({d}), a sub-structure composed of a single distinction and the relations bound to it, which
form its context [11] (see (13) in S1 Notes). A compound Φ-fold is a sub-structure composed of
the distinction Φ-folds specified by a subset of units. A compound Φ-fold is a relevant part of a
Φ-structure because it can be accessed or manipulated by changing the state, connections, or
functioning of a part of the substrate. Finally, a content Φ-fold, or simply content, is composed
of a subset of distinctions that are highly interrelated (regardless of the mechanisms and units
that specify them).
In conclusion, a maximal substrate or complex is a set of units S* = s* that satisfies all of
IITs postulates: its causeeffect power is intrinsic, specific, irreducible, definite, and struc-
tured. By IIT, a complex S* does not exist as such, but exists “unfolded” into its Φ-structure,
with all the causal distinctions and relations that compose it. In other words, a substrate is
what can be observed and manipulated “operationally” from the extrinsic perspective. From
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
29 / 45
the intrinsic perspective, what truly exists is a complex with all its causal powers unfolded—an
intrinsic entity that exists for itself, absolutely, rather than relative to an external observer.
According to the explanatory identity of IIT, an experience is identical to the Φ-structure of
an intrinsic entity: every property of the experience should be accounted for by a correspond-
ing property of the Φ-structure, with no additional ingredients. If a system S in state s is a com-
plex, then its Φ-structure corresponds to the quality of the experience of S in state s, while its Φ
value corresponds to its quantity—in other words, to the nature and amount of intrinsic
content.
Results and discussion
In this section, we apply the mathematical framework of IIT 4.0 to several example systems.
The goal is to illustrate three critical implications of IITs postulates:
1. Consciousness and connectivity: how the way units interact determines whether a sub-
strate can support a Φ-structure of high Φ.
2. Consciousness and activity: how changes in the state of a substrates units change Φ-
structures.
3. Consciousness and functional equivalence: how substrates that are functionally equivalent
may not be equivalent in terms of their Φ-structures, and thus in terms of consciousness.
The following examples will feature very simple networks constituted of binary units Ui 2
U with OUi ¼ f1; 1g for all Ui and a logistic (sigmoidal) activation function
pðUi;t ¼ 1 j ut1Þ ¼
1
1 þ exp ðk Pn
j¼1 wj;iuj;t1Þ ;
ð60Þ
where k > 0 and
X
n
j¼1
wj;i ¼ 1 8 i:
ð61Þ
In Eq (60), the parameter k defines the slope of the logistic function and allows one to adjust
the amount of noise or determinism in the activation function (higher values signify a
steeper slope and thus more determinism). The units Ui can thus be viewed as noisy linear
threshold units with weighted connections among them, where k determines the connection
strength.
As in Figs 1 and 2, units denoted by uppercase letters are in state 1 (ON, depicted in
black), units denoted by lowercase letters are in state ‘−1 (OFF, depicted in white). Cause
effect structures are illustrated as geometrical shapes projected into 3D space (Fig 6). Dis-
tinctions are depicted as mechanisms (black labels) tying a cause (red labels) and an effect
(green labels) through a link (orange edges, thickness indicating φd). Relation faces of sec-
ond- and third-degree relations are depicted as edges or triangular surfaces between the
causes and effects of the related distinctions. While edges always bind pairs of distinctions
(a second-degree relation), triangular surfaces may bind the causes and effects of two or
three distinctions (second- or third-degree relation). Relations of higher degrees are not
depicted.
All examples were computed using the “iit-4.0” feature branch of PyPhi [37]. This branch
will be available in the next official release of the software. An example notebook available here
recreates the analysis of Fig 1 (identifying complexes), Fig 2 (computing distinctions), and Fig
4 (computing relations).
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
30 / 45
Fig 6. Causal powers analysis of various network architectures. Each panel shows the networks causal model and
weights on the left. Blue regions indicate complexes with their respective φs values. In all networks, k = 4 and the state
is Abcdef. The Φ-structure(s) specified by the networks complexes are illustrated to the right (with only second- and
third-degree relation faces depicted) with a list of their distinctions for smaller systems and their ∑φ values for those
systems with many distinctions and relations. All integrated information values are in ibits. (A) A degenerate network
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
31 / 45
Consciousness and connectivity
The first set of examples highlights how the organization of connections among units impacts
the ability of a substrate to support a causeeffect structure with high structure integrated
information (high Φ). Fig 6 shows five systems, all in the same state s = Abcdef with the same
number of units, but with different connectivity among the units.
Degenerate systems, indeterminism, and specificity.
Fig 6A shows a network with
medium indeterminism (k = 4) and high degeneracy, due to the fact that unit A forms a “bot-
tleneck” with inputs and outputs to and from the remaining units. The network condenses
into one complex of two units Ab and four complexes corresponding to the individual units c,
d, e, and f (also called “monads”).
The causes and effects of the causal distinctions for the two types of complexes are shown in
the middle, and the corresponding causeeffect structures are illustrated on the right. In this
case, degeneracy (coupled with indeterminism) undermines the ability of the maximal sub-
strate to grow in size, which in turn limits the richness of the Φ-structure that can be sup-
ported. Because of the bottleneck architecture, the current state of candidate system Abcdef has
many possible causes and effects, leading to an exponential decrease in selectivity (the condi-
tional probabilities of cause and effect states). This dilutes the value of intrinsic information
(ii) for larger subsets of units, which in turn reduces their value of system integrated informa-
tion φs. Consequently, the maximal substrates are small, and their Φ values are necessarily low.
This example suggests that to grow and achieve high values of Φ, substrates must be consti-
tuted of units that are specialized (low degeneracy) and interact very effectively (low
indeterminism).
Notably, the organization of the cerebral cortex, widely considered as the likely substrate of
human consciousness, is characterized by extraordinary specialization of neural units at all lev-
els [3840]. Moreover, if the background conditions are well controlled, neurons are thought
to interact in a highly reliable, nearly deterministic manner [4143].
Modular systems, fault lines, and irreducibility.
Fig 6B shows a network comprising
three weakly interconnected modules, each having two strongly connected units (k = 4). In
this case, the weak inter-module connections are clear fault lines. Properly normalized, parti-
tions along these fault lines separating modules yield values of φs that are much smaller than
those yielded by partitions that cut across modules. As a consequence, the 6-unit system con-
denses into three complexes (Ab, cd, and ef), as determined by their maximal φs values. Again,
because the modules are small, their Φ values are low. Intriguingly, a brain region such as the
cerebellum, whose anatomical organization is highly modular, does not contribute to con-
sciousness [44, 45], even though it contains several times more neurons than the cerebral cor-
tex (and is indirectly connected to it).
Note that fault lines can be due not just to neuroanatomy but also to neurophysiological fac-
tors. For example, during early slow-wave sleep, the dense interconnections among neuronal
groups in cerebral cortical areas may break down, becoming causally ineffective due to the
in which unit A forms a bottleneck with redundant inputs from and outputs to the remaining units. The first-maximal
complex is Ab, which excludes all other subsets with φs > 0 except for the individual units c, d, e, and f. (B) The
modular network condenses into three complexes along its fault lines (which exclude all subsets and supersets), each
with a maximal φs value, but low Φ, as the modules each specify only two or three distinctions and at most five
relations. (C) A directed cycle of six units forms a six-unit complex with φs = 1.74 ibits, as no other subset is integrated.
However, the Φ-structure of the directed cycle is composed of only first-order distinctions and few relations. (D) A
specialized lattice also forms a complex (which excludes all subsets), but specifies 27 first- and high-order distinctions,
with many relations (>1.5 × 106) among them. Its Φ value is 11452 ibits. (E) A slightly modified version of the
specialized lattice in which the first-maximal complex is Abef. The full system is not maximally irreducible and is
excluded as a complex, despite its positive φs value (indicated in gray).
https://doi.org/10.1371/journal.pcbi.1011465.g006
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
32 / 45
bistability of neuronal excitability. This bistability, brought about by neuromodulatory changes
[46], is associated with the loss of consciousness [47].
Directed cycles, structural sparseness, and composition.
Fig 6C shows a directed cycle
in which six units are unidirectionally connected with weight w = 1.0 and k = 4. Each unit cop-
ies the state of the unit before it, and its state is copied by the unit after it, with some indeter-
minism. The copy cycle constitutes a 6-unit complex with a maximal φs = 1.74 ibits. However,
despite the “large” substrate, the Φ-structure it specifies has low structure integrated informa-
tion (Φ = 7.65). This is because the systems Φ-structure is composed exclusively of first-order
distinctions, and consequently of a small number of relations.
Highly deterministic directed cycles can easily be extended to constitute large complexes,
being more irreducible than any of their subsets. However, the lack of cross-connections
(“chords” in graph-theoretic terms) greatly limits the number of components of the Φ-struc-
tures specified by the complexes, and thus their structure integrated information (Φ). (Note
also that increasing the number of units that constitute the directed cycle would not change
the amount of φs specified by the network as a whole.).
The brain is rich in partially segregated, directed cycles, such as those originating in cortical
areas, sequentially reaching stations in the basal ganglia and thalamus, and cycling back to cor-
tex [48, 49]. These cycles are critical for carrying out many cognitive and other functions, but
they do not appear to contribute directly to experience [4].
Specialized lattices and Φ-structures with high structure integrated information.
Fig
6D shows a network consisting of six heterogeneously connected units—a “specialized” lattice,
again with k = 4. While many subsystems within the specialized network have positive values
of system integrated information φs, the full 6-unit system is the maximal substrate (excluding
all its subsets from being maximal substrates). Out of 63 possible distinctions, the Φ-structure
comprises 27 distinctions with causes and effects congruent with the systems maximal cause
effect state. Consequently, the full 6-unit system also specifies a much larger number of causal
relations compared to the copy cycle system.
Preliminary work indicates that lattices of specialized units, implementing different input
output functions, but partially overlapping in their inputs (receptive field) and outputs (projec-
tive fields), are particularly well suited to constituting large substrates that unfold into extraor-
dinarily rich Φ-structures. The number of distinctions specified by an optimally connected,
specialized system is bounded above by 2n1, and that of the relations among as many distinc-
tions is bounded by 2ð2n 1. The structure integrated information of such structures is cor-
respondingly large [50].
In the brain, a large part of the cerebral cortex, especially its posterior regions, is organized
as a dense, divergent-convergent hierarchical 3D lattice of specialized units, which makes it a
plausible candidate for the substrate of human consciousness [4, 11, 51, 52]. Note that directed
cycles originating and ending in such lattices typically remain excluded from the first-maximal
complex because minimal partitions across such cycles yield a much lower value of φs com-
pared to minimal partitions across large lattices.
Near-maximal substrates, extrinsic entities, and exclusion.
Finally, Fig 6E shows a net-
work of six units, four of which (Abef) constitute a specialized lattice that corresponds to the
first complex. Though integrated, the full set of 6 units happens to be slightly less irreducible
(φs = 0.15) than one of its 4-unit subsets (φs = 0.27). From the extrinsic perspective, the 6-unit
system undoubtedly behaves as a highly integrated whole (nearly as much as its 4-unit subset),
one that could produce complex inputoutput functions due to its rich internal structure.
From the intrinsic perspective of the system, however, only the 4-unit subset satisfies all the
postulates of existence, including maximal irreducibility (accounting for the definite nature of
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
33 / 45
experience). In this example, the remaining units form a second complex with low φs and
serve as background conditions for the first complex.
A similar situation may occur in the brain. The brain as a whole is undoubtedly integrated
(not to mention that it is integrated with the body as a whole), and neural “traffic” is heavy
throughout. However, its anatomical organization may be such that a subset of brain regions,
arranged in a dense 3D lattice primarily located in posterior cortex, may achieve a much
higher value of integrated information than any other subset. Those regions would then con-
stitute the first complex (the “main complex,” [4]), and the remaining regions might condense
into a large number of much smaller complexes.
Taken together, the examples in Fig 6 demonstrate that the connectivity among the units of
a system has a strong impact on what set of units can constitute a complex and thereby on the
structure integrated information it can specify. The examples also demonstrate the role played
by the various requirements that must be satisfied by a substrate of consciousness: existence
(causal power), intrinsicality, specificity, maximal irreducibility (integration and exclusion),
and composition (structure).
Consciousness and activity: Active, inactive, and inactivated units
A substrate exerts causeeffect power in its current state. For the same substrate, changing the
state of even one unit may have major consequences on the distinctions and relations that
compose its Φ-structure: many may be lost, or gained, and many may change their value of
irreducibility (φd and φr).
Fig 7 shows a network of five binary units that interact through excitatory and inhibitory
connections (weights indicated in the figure). The system is initially in state s = ABcdE (Fig
7A) and is a maximal substrate with φs = 1.1 ibits and a Φ-structure composed of 23 distinc-
tions and their 13740 relations.
If we change the state of unit E from ON to OFF (in neural terms, the unit becomes inac-
tive), the distinctions that the unit contributes to when ON, as well as the associated relations,
may change (Fig 7B). In the case illustrated by the Figure, what changes are the purviews and
irreducibility of several distinctions and associated relations, the number of distinctions stays
the same, φs changes only slightly, but the number of relations is lower, leading to a lower Φ
value. In other words, what a single unit contributes to intrinsic existence is not some small
“bit” of information. Instead, a unit contributes an entire sub-structure, composed of a very
large number of distinctions and relations. The set of distinctions to which a subset of units
contributes as a mechanism, either alone or in combination with other units, together with
their associated relations, forms a compound Φ-fold. With respect to the neural substrate of
consciousness in the brain, this means that even a change in the state of a single unit is typically
associated with a change in an entire Φ-fold within the overall Φ-structure, with a correspond-
ing change in the structure of the experience. (Note, however, that in larger systems such
changes will typically be less extreme, see also [11].).
In Fig 7C, we see what happens if unit E, instead of just turning inactive (OFF) is inactivated
(abolishing its causeeffect power because it no longer has any counterfactual states and thus
cannot be intervened upon). In this case, all the distinctions and relations to which that unit
contributes as a mechanism would cease to exist (its compound Φ-fold collapses). Moreover,
all the distinctions and relations to whose purviews that unit contributes—its purview Φ-fold
—would also collapse or change. In fact, the complex shrinks because it cannot include that
unit. With respect to the neural substrate of consciousness, this means that while an inactive
unit contributes to a different experience, an inactivated unit ceases to contribute to experience
altogether. The fundamental difference between inactive and inactivated units leads to the
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
34 / 45
Fig 7. Causal powers analysis of the same system with one of its units set to active, inactive, or inactivated. In all panels,
the same causal model and weights are shown on the left, but in different states. For all networks k = 4. The set of distinctions
D s), their causes and effects, and their φd values are shown in the middle. The Φ-structure specified by the networks
complex is illustrated on the right (again with only second- and third-degree relation faces depicted). All integrated
information values are in ibits. (A) The system in state ABcdE is a complex with 23 out of 31 distinctions and Φ = 22.26. (B)
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
35 / 45
following corollary of IIT: unlike a fully inactivated substrate which, as would be suspected,
cannot support any experience, an inactive substrate can. If a maximal substrate in an inactive
state is in working order and specifies a large Φ-structure, it will support a highly structured
experience, such as the experience of empty space [11] or the feeling of “pure presence” (see
(14) in S1 Notes).
Consciousness and functional equivalence: Being is not doing
By the intrinsicality postulate, the Φ-structure of a complex depends on the causal interactions
between system subsets, not on the systems interaction with its environment (except for the
role of the environment in triggering specific system states). In general, different physical sys-
tems with different internal causal structure may perform the same inputoutput functions.
Fig 8 shows three simple deterministic systems with binary units (here the “OFF” state is 0,
and “ON” is 1) that perform the same inputoutput function, treating the internal dynamics of
the system as a black box. The function could be thought of, for example, as an electronic toll-
booth “counting 8 valid coins” (8 times input I = 1) before opening the gate [53]. Each system
receives one binary input (I) and has one binary output (O). The output unit switches “ON”
on a count of eight positive inputs I = 1 (when the global state with label 0 is reached in the
cycle), upon which the system resets (Fig 8A).
In addition to being functionally equivalent in their outward behavior, the three systems
share the same internal global dynamics, as their internal states update according to the same
global state-transition diagram (Fig 8B). Given an input I = 1, the system updates its state,
cycling through all its 8 global states (labeled 07) over 8 updates. For an input of I = 0, the sys-
tem remains in its present state. Moreover, all three systems are constituted of three binary
units whose joint states map one-to-one onto the systems global state labels (07). However,
the mapping is different for different systems (Fig 8C, left). This is because the internal binary
update sequence depends on the interactions among the internal units [29, 53], which differ in
the three cases, as can easily be determined through manipulations and observations.
For consistency in the causal powers analysis, in all three cases, the global state “0” that acti-
vates the output unit if I = 1 is selected such that it corresponds to the binary state “all OFF”
(000), which is followed by 1 ≔ 100 and 2 ≔ 010. Also, the Φ-structure of each system is
unfolded in state 1 ≔ 100 in all three cases.
Despite their functional equivalence and equivalent global dynamics, the systems differ in
how they condense into complexes and in the causeeffect structures they specify.
As shown in Fig 8C, the first system forms a 3-unit complex with a relatively rich Φ-struc-
ture (Φ = 21.01 ibits). While the second system also forms a 3-unit complex with the same φs =
2 ibits, it specifies a completely different set of distinctions and has much lower structure inte-
grated information (Φ = 3.64 ibits).
Finally, the third system is reducible (φs = 0 ibits)—in this case, because there are only feed-
forward connections from unit A to units B and C—and it condenses into three complexes
with small Φ-structures.
These examples illustrate a simple scenario of functional equivalence of three systems char-
acterized by a different architecture. The equivalence is with respect to a simple inputoutput
The same system in state ABcde, where unit E is inactive (“OFF”) also forms a complex with the same number of distinctions,
but a somewhat lower Φ value due to a lower number of relations between distinctions. In addition, the systems Φ-structure
differs from that in (A), as the system now specifies a different set of compositional causes and effects. (C) If instead of being
inactive, unit E is inactivated (fixed into the “OFF” state), the inactivated unit cannot contribute to the complex or Φ-
structure anymore. The complex is now constituted of four units (ABcd), with only 14 distinctions and markedly reduced
structure integrated information (Φ = 3.35).
https://doi.org/10.1371/journal.pcbi.1011465.g007
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
36 / 45
function, in this case coin counting, which they multiply realize. The systems are also equiva-
lent in terms of their global system dynamics, in the sense that they go through a globally
equivalent sequence of internal states. However, because of their different substrates, the three
systems specify different causeeffect structures. Therefore, based on the postulates of IIT,
they are not phenomenally equivalent. In other words, they are equivalent in what they do
extrinsically, but not in what they are intrinsically.
This dissociation between phenomenal and functional equivalence has important implica-
tions. As we have seen, a purely feed-forward system necessarily has φs = 0. Therefore, it can-
not support a causeeffect structure and cannot be conscious, whereas systems with a
recurrent architecture can. On the other hand, the behavior (inputoutput function) of any
(discrete) recurrent system can also be implemented by a system with a feed-forward architec-
ture [54]. This implies that any behavior performed by a conscious system supported by a
recurrent architecture can also be performed by an unconscious system, no matter how com-
plex the behavior is. More generally, digital computers implementing programs capable of arti-
ficial general intelligence may in principle be able to emulate any function performed by
conscious humans and yet, because of the way they are physically organized, they would do so
without experiencing anything, or at least anything resembling, in quantity and quality, what
each of us experiences [20] (see also (15) in S1 Notes).
Fig 8. Functionally equivalent networks with different Φ-structures. (A) The inputoutput function realized by three different systems (shown in (C)): a
count of eight instances of input I = 1 leads to output O = 1. (B) The global state-transition diagram is also the same for the three systems: if I = 0, the
systems will remain in their current global state, labeled as 07; if I = 1, the systems will move one state forward, cycling through their global states, and
activate the output if S = 0. (C) Three systems constituted of three binary units but differing in how the units are connected and interact. As a consequence,
the one-to-one mapping between the 3-bit binary states and the global state labels differ. However, all three systems initially transition from 000 to 100 to
010. Analyzed in state 100, the first system (top) turns out to be a single complex that specifies a Φ-structure with six distinctions and many relations,
yielding a high value of Φ. The second system (middle) is also a complex, with the same φs value, but it specifies a Φ-structure with fewer distinctions and
relations, yielding a lower value of Φ. Finally, the third system (bottom) is reducible (φs = 0) and splits into three smaller complexes (entities) with minimal
Φ-structures and low Φ.
https://doi.org/10.1371/journal.pcbi.1011465.g008
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
37 / 45
The examples also show that the overall system dynamics, while often revealing relevant
aspects of a systems architecture, typically do not and cannot exhaust the richness of its cur-
rent causeeffect structure. For example, a system in a fixed point is dynamically “dead” (and
“does” nothing), but it may be phenomenally quite “alive,” for example, experiencing “pure
presence” (see (14) in S1 Notes). Of course, the systems causal powers can be fully unfolded,
and revealed dynamically, by extensive manipulations and observations of subsets of system
units because they are implicitly captured by the systems causal model and ultimately by its
transition probability matrix [29].
Conclusions
IIT attempts to account for the presence and quality of consciousness in physical terms. It
starts from the existence of experience, and proceeds by characterizing its essential properties
—those that are immediate and irrefutably true of every conceivable experience (axioms).
These are then formulated as essential properties of physical existence (postulates), the neces-
sary and sufficient conditions that a substrate must satisfy to support an experience—to consti-
tute a complex. Note that “substrate” is meant in purely operational terms—as a set of units
that a conscious observer can observe and manipulate. Likewise, “physical” is understood in
purely operational terms as causeeffect power—the power to take and make a difference.
The postulates can be assessed based purely on a substrates transition probability matrix,
as was illustrated by a few idealized causal models. Thus, a substrate of consciousness must
be able to take and make a difference upon itself (existence and intrinsicality), it must be able
to specify a cause and an effect state that are highly informative and selective (information),
and it must do so in a way that is both irreducible (integration) and definite (exclusion).
Finally, it must specify its cause and effect in a structured manner (composition), where the
causal powers of its subsets over its subsets compose a causeeffect structure of distinctions
and relations—a Φ-structure. Thus, a complex does not exist as such but only “unfolded” as
a Φ-structure—an intrinsic entity that exists for itself, absolutely, rather than relative to an
external observer.
As shown above, these requirements constrain what substrates can and cannot support con-
sciousness. Substrates that lack in specificity, due to indeterminism and/or degeneracy, cannot
grow to be large complexes. Substrates that are weakly integrated, due to architectural or func-
tional fault lines in their interactions, are less integrated than some of their subsets. Because
they are not maximally irreducible, they do not qualify as complexes. This is the case even
though they may “hang together” well enough from an extrinsic perspective (having a respect-
able value of φs). Furthermore, even substrates that are maximally integrated may support Φ-
structures that are extremely sparse, as in the case of directed cycles. Based on the postulates of
IIT, a universal substrate ultimately “condenses” into a set of disjoint (non-overlapping) com-
plexes, each constituted of a set of macro or micro units.
The physical account of consciousness provided by IIT should be understood as an explana-
tory identity: every property of an experience should ultimately be accounted for by a property
of the causeeffect structure specified by a substrate that satisfies its postulates, with no addi-
tional ingredients. The identity is not between two different substances or realms—the phe-
nomenal and the physical—but between intrinsic (subjective) existence and extrinsic
(objective) existence. Intrinsic existence is immediate and irrefutable, while extrinsic existence
is defined operationally as causeeffect power discovered through observation and manipula-
tion. The primacy of intrinsic existence (of experience) in IIT contrasts with standard attempts
at accounting for consciousness as something “generated by” or “emerging from” a substrate
constituted of matter and energy and following physical laws.
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
38 / 45
The physical correspondent of an experience is not the substrate as such but the Φ-structure
specified by the substrate in its current state. Therefore, minor changes in the substrate state
can correspond to major changes in the specified Φ-structure. For example, if the state of a sin-
gle unit changes, an entire Φ-fold within the Φ-structure will change, and if a single inactive
unit is inactivated, its associated Φ-fold will collapse, even though the current state of the sub-
strate appears the same (Fig 7).
Each experience corresponds to a Φ-structure, not a set of functions, processes, or computa-
tions. Said otherwise, consciousness is about being, not doing [1, 29, 55]. This means that systems
with different architectures may be functionally equivalent—both in terms of global inputoutput
functions and global intrinsic dynamics—but they will not be phenomenally equivalent. For
example, a feed-forward system can be functionally equivalent to a recurrent system that consti-
tutes a complex, but feed-forward systems cannot constitute complexes because they do not sat-
isfy maximal irreducibility. Accordingly, artificial systems powered by super-intelligent computer
programs, but implemented by feed-forward hardware or encompassing critical bottlenecks,
would experience nothing (or nearly nothing) because they have the wrong kind of physical
architecture, even though they may be behaviorally indistinguishable from human beings [20].
Even though the entire framework of IIT is based on just a few axioms and postulates, it is
not possible in practice to exhaustively apply the postulates to unfold the causeeffect power of
realistic systems [32, 56]. It is not feasible to perform all possible observations and manipula-
tions to fully characterize a universal TPM, or to perform all calculations on the TPM that
would be necessary to condense it exhaustively into complexes and unfold their causeeffect
power in full. The number of possible systems, of system partitions, of candidate distinctions
—each with their partitions and relations—is the result of multiple, nested combinatorial
explosions. Moreover, these observations, manipulations, and calculations would need to be
repeated at many different grains, with many rounds of maximizations. For these reasons, a
full analysis of complexes and their causeeffect structure can only be performed on idealized
systems of a few units [37].
On the other hand, we can simplify the computation considerably by using various assump-
tions and approximations, as with the “cut one” approximation described in [37]. Also, while
the number of relations vastly exceeds the number of units and of distinctions (its upper
bound for a system of n units is 2ð2n 1), it can be determined analytically, and so can ∑φr
for a given set of distinctions S3 Text. Developing tight approximations, as well as bounded
estimates of a systems integrated information (φs and Φ), is one of the main areas of ongoing
research related to IIT [50].
Despite the infeasibility of an exhaustive calculation of the relevant quantities and structures
for a realistic system, IIT already provides considerable explanatory and predictive power in
many real-world situations, making it eminently testable [4, 57, 58]. A fundamental prediction
is that Φ should be high in conscious states, such as wakefulness and dreaming, and low in
unconscious states, such as dreamless sleep and anesthesia. This prediction has already found
substantial support in human studies that have applied measures of complexity inspired by IIT
to successfully classify subjects as conscious vs. unconscious [4, 22, 23, 59]. IIT can also
account mechanistically for the loss of consciousness in deep sleep and anesthesia [4, 47]. Fur-
thermore, it can provide a principled account of why certain portions of the brain may consti-
tute an ideal substrate of consciousness and others may not, why the borders of the main
complex in the brain should be where they are, and why the units of the complex should have
a particular grain (the one that yields a maximum of φs). A stringent prediction is that the loca-
tion of the main complex, as determined by the overall maximum of φs within the brain,
should correspond to its location as determined through clinical and experimental evidence.
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
39 / 45
Another prediction that follows from first principles is that constituents of the main complex
can support conscious contents even if they are mostly inactive, but not if they are inactivated
[4, 11]. Yet another prediction is that the complete inactivation of constituents of the main
complex should lead to absolute agnosia (unawareness that anything is missing).
IIT further predicts that the quality of experience should be accounted for by the way the
Φ-structure is composed, which in turn depends on the architecture of the substrate specifying
it. This was demonstrated in a recent paper showing how the fundamental properties of spatial
experiences—those that make space feel “extended”—can be accounted for by those of Φ-
structures specified by 2D grids of units, such as those found in much of posterior cortex [11].
This prediction is in line with neurological evidence of their role in supporting the experience
of space [11]. Ongoing work aims at accounting for the quality of experienced time and that of
experienced objects (see (16) in S1 Notes). A related prediction is that changes in the strength
of connections within the neural substrate of consciousness should be associated with changes
in experience, even if neural activity does not change [60]. Also, similarities and dissimilarities
in the structure of experience should be accounted for by similarities and dissimilarities
among Φ-structures and Φ-folds specified by the neural substrate of consciousness.
While the listed predictions may appear largely qualitative in nature, many of them rest on
specific features of the accompanying quantitative analysis. This is the case for predictions
regarding the borders (and grain) of the main complex in the brain, which depend on the rela-
tive φs values of potential substrates of interest, and even more so for predictions regarding the
quality and richness of certain experiences and the predicted features of their underlying sub-
strates. IITs postulates, and the mathematical framework proposed to evaluate them, rest on
“inferences to a good explanation” (Box 1). While we have aimed for maximal consistency,
specificity, and simplicity at every junction in formulating IITs mathematical implementation,
some of the algorithmic choices remain open to further evaluation. These include, for example,
the proper treatment of background conditions and the resolution of ties given symmetries in
the TPMs of specific systems (see S1 Text). More generally, further validation of IIT will
depend on a systematic back-and-forth between phenomenology, theoretical inferences, and
neuroscientific evidence [1].
In addition to empirical work aimed at validating the theory, much remains to be done at
the theoretical level. According to IIT, the meaning of an experience is its feeling—whether
those of spatial extendedness, of temporal flow, or of objects, to name but a few (“the meaning
is the feeling”). This means that every meaning is identical to a sub-structure within a current
Φ-structure—a content of experience—whether it is triggered by extrinsic inputs or it occurs
spontaneously during a dream. Therefore, all meaning is ultimately intrinsic. Ongoing work
aims at providing a self-consistent explanation of how intrinsic meanings can capture relevant
features of causal processes in the environment (see (17) in S1 Notes). It will also be important
to explain how intersubjectively validated knowledge can be obtained despite the intrinsic and
partially idiosyncratic nature of meaning.
To the extent that the theory is validated through empirical evidence obtained from the
human brain, IIT can then offer a plausible inferential basis for addressing several questions
that depend on an explicit theory of consciousness. As indicated in the section on phenomenal
and functional equivalence, and argued in ongoing work [20], one consequence of IIT is that
typical computer architectures are not suitable for supporting consciousness, no matter
whether their behavior may resemble ours. By the same token, it can be inferred from IIT that
animal species that may look and behave quite differently from us may be highly conscious, as
long as their brains have a compatible architecture. Other inferences concern our own experi-
ence and whether it plays a causal role, or is simply “along for the ride” while our brain per-
forms its functions. As recently argued, IIT implies that we have true free will—that we have
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
40 / 45
true alternatives, make true decisions, and truly cause. Because only what truly exists (intrinsi-
cally, for itself) can truly cause, we, rather than our neurons, cause our willed actions and are
responsible for their consequences [18].
Finally, an ontology that is grounded in experience as intrinsic existence—an intrinsic
ontology—must not only provide an account of subjective existence in objective, operational
terms, but also offer a path toward a unified view of nature—of all that exists and happens.
One step in this direction is the application of the same postulates that define causal powers
(existence) to the evaluation of actual causes and effects (“what caused what” [10]). Another is
to unify classical accounts of information (as communication and storage of signals) with IITs
notion of information as derived from the properties of experience—that is, information as
causal, intrinsic, specific, maximally irreducible, and structured (meaningful) [8] (see also (18)
in S1 Notes). Yet another is the study of the evolution of a substrates causal powers as condi-
tional probabilities that update themselves [61].
Even so, there are many ways in which IIT may turn out to be inadequate or wrong. Are
some of its assumptions, including those of a discrete, finite set of “atomic” units of cause
effect power, incompatible with current physics [32, 62] (but see [6366])? Are its axiomatic
basis and the formulation of axioms as postulates sound and unique? And, most critically, can
IIT survive the results of empirical investigations assessing the relationship between the quan-
tity and quality of consciousness and its substrate in the brain?
Supporting information
S1 Text. Resolving ties in the IIT algorithm. Operational process for resolving ties due to
maxima / minima in the IIT algorithm.
(PDF)
S2 Text. Comparison to IIT 1.0—3.0 and subsequent publications. Summary of the changes
in IIT 4.0 relative to earlier versions of the theory.
(PDF)
S3 Text. Analytical results for the number and integrated information of relations. State-
ment and proof of theorems describing the number of relations and the sum of their integrated
information, ∑φr.
(PDF)
S1 Fig. IIT Algorithm. Visual summary of the algorithm for identifying complexes and
unfolding causeeffect structures.
(PDF)
S1 Notes. Footnotes.
(PDF)
Author Contributions
Conceptualization: Larissa Albantakis, Leonardo Barbosa, Graham Findlay, Matteo Grasso,
Andrew M. Haun, William Marshall, Alireza Zaeemzadeh, Melanie Boly, Bjørn E. Juel, Jer-
emiah Hendren, Jonathan P. Lang, Giulio Tononi.
Formal analysis: Larissa Albantakis, Leonardo Barbosa, Graham Findlay, Matteo Grasso,
Andrew M. Haun, William Marshall, William G. P. Mayner, Alireza Zaeemzadeh.
Funding acquisition: Larissa Albantakis, William Marshall, Giulio Tononi.
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
41 / 45
Investigation: Larissa Albantakis, Leonardo Barbosa, Graham Findlay, Matteo Grasso,
Andrew M. Haun, William Marshall, William G. P. Mayner, Alireza Zaeemzadeh, Bjørn E.
Juel, Shuntaro Sasai, Keiko Fujii, Isaac David.
Methodology: Larissa Albantakis, Leonardo Barbosa, Graham Findlay, Matteo Grasso,
Andrew M. Haun, William Marshall, William G. P. Mayner, Alireza Zaeemzadeh, Shuntaro
Sasai, Keiko Fujii, Giulio Tononi.
Project administration: Jonathan P. Lang, Giulio Tononi.
Software: William G. P. Mayner, Isaac David.
Supervision: Larissa Albantakis, Giulio Tononi.
Validation: Larissa Albantakis.
Visualization: Larissa Albantakis, Matteo Grasso.
Writing original draft: Larissa Albantakis, Giulio Tononi.
Writing review & editing: Leonardo Barbosa, Graham Findlay, Matteo Grasso, Andrew M.
Haun, William Marshall, William G. P. Mayner, Alireza Zaeemzadeh, Bjørn E. Juel, Isaac
David, Jeremiah Hendren, Jonathan P. Lang.
References
1.
Ellia F, Hendren J, Grasso M, Kozma C, Mindt G, P Lang J, M Haun A, Albantakis L, Boly M, and Tononi
G. Consciousness and the fallacy of misplaced objectivity. Neuroscience of Consciousness. 2021;
2021(2):112. https://doi.org/10.1093/nc/niab032 PMID: 34667639
2.
Nagel T. What is it like to be a bat? The philosophical review. 1974; 83(4):435450. https://doi.org/10.
2307/2183914
3.
Tononi G. Integrated information theory. Scholarpedia. 2015; 10(1):4164. https://doi.org/10.4249/
scholarpedia.4164
4.
Tononi G, Boly M, Massimini M, Koch C. Integrated information theory: from consciousness to its physi-
cal substrate. Nature Reviews Neuroscience. 2016; 17(7):450461. https://doi.org/10.1038/nrn.2016.
44 PMID: 27225071
5.
Tononi G, Sporns O. Measuring information integration. BMC neuroscience. 2003; 4(31):120. https://
doi.org/10.1186/1471-2202-4-31 PMID: 14641936
6.
Tononi G. An information integration theory of consciousness. BMC neuroscience. 2004; 5:42. https://
doi.org/10.1186/1471-2202-5-42 PMID: 15522121
7.
Balduzzi D, Tononi G. Integrated information in discrete dynamical systems: motivation and theoretical
framework. PLoS Comput Biol. 2008; 4(6):e1000091. https://doi.org/10.1371/journal.pcbi.1000091
PMID: 18551165
8.
Oizumi M, Albantakis L, Tononi G. From the Phenomenology to the Mechanisms of Consciousness:
Integrated Information Theory 3.0. PLoS Computational Biology. 2014; 10(5):e1003588. https://doi.org/
10.1371/journal.pcbi.1003588 PMID: 24811198
9.
Balduzzi D, Tononi G. Qualia: the geometry of integrated information. PLoS computational biology.
2009; 5(8):e1000462. https://doi.org/10.1371/journal.pcbi.1000462 PMID: 19680424
10.
Albantakis L, Marshall W, Hoel E, Tononi G. What caused what? A quantitative account of actual causa-
tion using dynamical causal networks. Entropy. 2019; 21(5):459. https://doi.org/10.3390/e21050459
PMID: 33267173
11.
Haun AM, Tononi G. Why Does Space Feel the Way it Does? Towards a Principled Account of Spatial
Experience. Entropy. 2019; 21(12):1160. https://doi.org/10.3390/e21121160
12.
Barbosa LS, Marshall W, Albantakis L, Tononi G. Mechanism Integrated Information. Entropy. 2021; 23
(3):362. https://doi.org/10.3390/e23030362 PMID: 33803765
13.
Marshall W, Grasso M, Mayner WG, Zaeemzadeh A, Barbosa LS, Chastain E, et al. System Integrated
Information. Entropy. 2023; 25. https://doi.org/10.3390/e25020334 PMID: 36832700
14.
Barbosa LS, Marshall W, Streipert S, Albantakis L, Tononi G. A measure for intrinsic information. Scien-
tific Reports. 2020; 10(1):18803. https://doi.org/10.1038/s41598-020-75943-4 PMID: 33139829
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
42 / 45
15.
Intrinsic Ontology Wiki;. Available from: https://centerforsleepandconsciousness.psychiatry.wisc.edu/
intrinsic-ontology-wiki/.
16.
Albantakis L. Integrated information theory. In: Overgaard M, Mogensen J, Kirkeby-Hinrup A, editors.
Beyond Neural Correlates of Consciousness. Routledge; 2020. p. 87103.
17.
Grasso M, Albantakis L, Lang JP, Tononi G. Causal reductionism and causal structures. Nature Neuro-
science. 2021; 24(10):13481355. https://doi.org/10.1038/s41593-021-00911-8 PMID: 34556868
18.
Tononi G, Albantakis L, Boly M, Cirelli C, Koch C. Only what exists can cause: An intrinsic view of free
will. 2022.
19.
Tononi G, Koch C. Consciousness: here, there and everywhere? Philosophical transactions of the
Royal Society of London Series B, Biological sciences. 2015; 370:20140167. https://doi.org/10.1098/
rstb.2014.0167 PMID: 25823865
20.
Findlay G, Marshall W, Albantakis L, Mayner WGP, Koch C, Tononi G. Dissociating Intelligence from
Consciousness in Artificial Systems Implications of Integrated Information Theory. In: Proceedings of
the 2019 Towards Conscious AI Systems Symposium, AAAI SSS19; 2019 and forthcoming.
21.
Albantakis L, Prentner R, Durham I. Measuring the integrated information of a quantum mechanism.
Entropy. 2023; 25.
22.
Massimini M, Ferrarelli F, Huber R, Esser SK, Singh H, Tononi G. Breakdown of cortical effective con-
nectivity during sleep. Science. 2005; 309(5744):22282232. https://doi.org/10.1126/science.1117256
PMID: 16195466
23.
Casarotto S, Comanducci A, Rosanova M, Sarasso S, Fecchio M, Napolitani M, et al. Stratification of
unresponsive patients by an independently validated index of brain complexity. Annals of Neurology.
2016; 80(5):718729. https://doi.org/10.1002/ana.24779 PMID: 27717082
24.
Comolatti, R et al. Why does time feel flowing?; in preparation.
25.
Grasso, M et al. How do phenomenal objects bind general concepts with particular features?; in
preparation.
26.
Janzing D, Balduzzi D, Grosse-Wentrup M, Scho¨lkopf B. Quantifying causal influences. The Annals of
Statistics. 2013; 41(5):23242358. https://doi.org/10.1214/13-AOS1145
27.
Ay N, Polani D. Information Flows in Causal Networks. Advances in Complex Systems. 2008; 11
(01):1741. https://doi.org/10.1142/S0219525908001465
28.
Pearl J. Causality: models, reasoning and inference. vol. 29. Cambridge Univ Press; 2000.
29.
Albantakis L, Tononi G. Causal Composition: Structural Differences among Dynamically Equivalent
Systems. Entropy 2019, Vol 21, Page 989. 2019; 21(10):989.
30.
Cooper J. Plato: Complete Works. Hackett; 1997.
31.
Tillemans T. Dharmak?rti. In: Zalta EN, editor. The Stanford Encyclopedia of Philosophy. Spring 2021
ed. Metaphysics Research Lab, Stanford University; 2021.
32.
Barrett AB, Mediano PAM. The phi measure of integrated information is not well-defined for general
physical systems. Journal of Consciousness Studies. 2019; 26(1-2):1120.
33.
Hoel EP, Albantakis L, Marshall W, Tononi G. Can the macro beat the micro? Integrated information
across spatiotemporal scales. Neuroscience of Consciousness. 2016; 2016(1). https://doi.org/10.1093/
nc/niw012 PMID: 30788150
34.
Marshall W, Albantakis L, Tononi G. Black-boxing and cause-effect power. PLOS Computational Biol-
ogy. 2018; 14(4):e1006114. https://doi.org/10.1371/journal.pcbi.1006114 PMID: 29684020
35.
Hoel EP, Albantakis L, Tononi G. Quantifying causal emergence shows that macro can beat micro.
PNAS. 2013; 110(49):1979019795. https://doi.org/10.1073/pnas.1314922110 PMID: 24248356
36.
Krohn S, Ostwald D. Computing integrated information. Neuroscience of Consciousness. 2017; 2017
(1). https://doi.org/10.1093/nc/nix017 PMID: 30042849
37.
Mayner WGP, Marshall W, Albantakis L, Findlay G, Marchman R, Tononi G. PyPhi: A toolbox for inte-
grated information theory. PLoS Computational Biology. 2018; 14(7):e1006343. https://doi.org/10.
1371/journal.pcbi.1006343 PMID: 30048445
38.
Kanwisher N. Functional specificity in the human brain: a window into the functional architecture of the
mind. Proceedings of the National Academy of Sciences. 2010; 107(25):1116311170. https://doi.org/
10.1073/pnas.1005062107
39.
Ponce CR, Xiao W, Schade PF, Hartmann TS, Kreiman G, Livingstone MS. Evolving images for visual
neurons using a deep generative network reveals coding principles and neuronal preferences. Cell.
2019; 177(4):9991009. https://doi.org/10.1016/j.cell.2019.04.005 PMID: 31051108
40.
Khosla M, Wehbe L. High-level visual areas act like domain-general filters with strong selectivity and
functional specialization. bioRxiv. 2022.
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
43 / 45
41.
Mainen ZF, Sejnowski TJ. Reliability of spike timing in neocortical neurons. Science. 1995; 268
(5216):15031506. https://doi.org/10.1126/science.7770778 PMID: 7770778
42.
Hires SA, Gutnisky DA, Yu J, OConnor DH, Svoboda K. Low-noise encoding of active touch by layer 4
in the somatosensory cortex. eLife. 2015; 4:e06619. https://doi.org/10.7554/eLife.06619 PMID:
26245232
43.
Nolte M, Reimann MW, King JG, Markram H, Muller EB. Cortical reliability amid noise and chaos.
Nature Communications. 2019; 10(1):115. https://doi.org/10.1038/s41467-019-11633-8 PMID:
31439838
44.
Lemon R, Edgley S. Life without a cerebellum. Brain. 2010; 133(3):652654. https://doi.org/10.1093/
brain/awq030 PMID: 20305277
45.
Yu F, Jiang Qj, Sun Xy, Zhang Rw. A new case of complete primary cerebellar agenesis: clinical and
imaging findings in a living patient. Brain. 2015; 138(6):e353e353. https://doi.org/10.1093/brain/
awu239 PMID: 25149410
46.
Steriade M, Nunez A, Amzica F. A novel slow (< 1 Hz) oscillation of neocortical neurons in vivo: depolar-
izing and hyperpolarizing components. Journal of Neuroscience. 1993; 13(8):32523265. https://doi.
org/10.1523/JNEUROSCI.13-08-03252.1993 PMID: 8340806
47.
Pigorini A, Sarasso S, Proserpio P, Szymanski C, Arnulfo G, Casarotto S, et al. Bistability breaks-off
deterministic responses to intracortical stimulation during non-REM sleep. Neuroimage. 2015;
112:105113. https://doi.org/10.1016/j.neuroimage.2015.02.056 PMID: 25747918
48.
Middleton FA, Strick PL. Basal ganglia and cerebellar loops: motor and cognitive circuits. Brain
Research Reviews. 2000; 31(2-3):236250. https://doi.org/10.1016/S0165-0173(99)00040-5 PMID:
10719151
49.
Foster NN, Barry J, Korobkova L, Garcia L, Gao L, Becerra M, et al. The mouse corticobasal ganglia
thalamic network. Nature. 2021; 598(7879):188194. https://doi.org/10.1038/s41586-021-03993-3
PMID: 34616074
50.
Zaeemzadeh, A et al. Upper Bounds for Integrated Information; in preparation.
51.
Boly M, Massimini M, Tsuchiya N, Postle BR, Koch C, Tononi G. Are the neural correlates of conscious-
ness in the front or in the back of the cerebral cortex? Clinical and neuroimaging evidence. Journal of
Neuroscience. 2017; 37(40):96039613. https://doi.org/10.1523/JNEUROSCI.3218-16.2017 PMID:
28978697
52.
Watakabe A, Skibbe H, Nakae K, Abe H, Ichinohe N, Rachmadi MF, et al. Local and long-distance orga-
nization of prefrontal cortex circuits in the marmoset brain. bioRxiv. 2022.
53.
Hanson JR, Walker SI. Formalizing falsification for theories of consciousness across computational
hierarchies. Neuroscience of Consciousness. 2021; 2021(2). https://doi.org/10.1093/nc/niab014 PMID:
34377534
54.
Krohn K, Rhodes J. Algebraic Theory of Machines. I. Prime Decomposition Theorem for Finite Semi-
groups and Machines. Transactions of the American Mathematical Society. 1965; 116:450. https://doi.
org/10.1090/S0002-9947-1965-0188316-1
55.
Albantakis L, Tononi G. The Intrinsic Cause-Effect Power of Discrete Dynamical SystemsFrom Ele-
mentary Cellular Automata to Adapting Animats. Entropy. 2015; 17(8):54725502. https://doi.org/10.
3390/e17085472
56.
Moyal R, Fekete T, Edelman S. Dynamical Emergence Theory (DET): A Computational Account of Phe-
nomenal Consciousness. Minds and Machines. 2020; 30(1):121. https://doi.org/10.1007/s11023-020-
09516-9
57.
Melloni L, Mudrik L, Pitts M, Koch C. Making the hard problem of consciousness easier. Science. 2021;
372(6545):911912. https://doi.org/10.1126/science.abj3259 PMID: 34045342
58.
Sarasso S, Casali AG, Casarotto S, Rosanova M, Sinigaglia C, Massimini M, et al. Consciousness and
complexity: a consilience of evidence. Neuroscience of Consciousness. 2021; 7(2):124.
59.
Sarasso S, DAmbrosio S, Fecchio M, Casarotto S, Viganò A, Landi C, et al. Local sleep-like cortical
reactivity in the awake brain after focal injury. Brain. 2020; 143(12):36723684. https://doi.org/10.1093/
brain/awaa338 PMID: 33188680
60.
Song C, Haun AM, Tononi G. Plasticity in the structure of visual space. Eneuro. 2017; 4(3). https://doi.
org/10.1523/ENEURO.0080-17.2017 PMID: 28660245
61.
Albantakis L, Hintze A, Koch C, Adami C, Tononi G. Evolution of Integrated Causal Structures in Ani-
mats Exposed to Environments of Increasing Complexity. PLoS computational biology. 2014; 10(12):
e1003966. https://doi.org/10.1371/journal.pcbi.1003966 PMID: 25521484
62.
Carroll S. Consciousness and the Laws of Physics. Journal of Consciousness Studies. 2021; 28(9):16
31. https://doi.org/10.53765/20512201.28.9.016
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
44 / 45
63.
Zanardi P, Tomka M, Venuti LC. Towards Quantum Integrated Information Theory. arXiv.
2018;1806.01421.
64.
Kleiner J, Tull S. The Mathematical Structure of Integrated Information Theory. Frontiers in Applied
Mathematics and Statistics. 2021; 6:74. https://doi.org/10.3389/fams.2020.602973
65.
Esteban FJ, Galadı´ JA, Langa JA, Portillo JR, Soler-Toscano F. Informational structures: A dynamical
system approach for integrated information. PLOS Computational Biology. 2018; 14(9):e1006154.
https://doi.org/10.1371/journal.pcbi.1006154 PMID: 30212467
66.
Kalita P, Langa JA, Soler-Toscano F. Informational Structures and Informational Fields as a Prototype
for the Description of Postulates of the Integrated Information Theory. Entropy. 2019; 21(5):493. https://
doi.org/10.3390/e21050493 PMID: 33267207
PLOS COMPUTATIONAL BIOLOGY
Integrated information theory (IIT) 4.0
PLOS Computational Biology | https://doi.org/10.1371/journal.pcbi.1011465
October 17, 2023
45 / 45