Are AIs conscious?

7 min read

Suggest changes in Google Docs

Although AI consciousness isn’t what causes existential risk, it could have implications for how we should treat current or future AI systems, so it’s worth thinking about what consciousness is and whether AI systems have it. Most theorists think current AI systems are not conscious, but future systems might be.

Consciousness as a concept

Consciousness in humans and other living beings is a phenomenon that is poorly understood by the scientific community, both in empirical and philosophical terms. One common definition is Thomas Nagel’s “the feeling of what it is like to be something.” David Chalmers has argued that any attempt to explain consciousness in physical terms runs into the hard problem of consciousness, which is that even if we develop explanations for all of the ways we integrate information (the so-called “easy problems”), this will not explain why it feels like something to be us.

Testing for consciousness

The hard problem does not restrict attempts to detect or quantify consciousness. Theories about what makes a system conscious¹ include:

Integrated information theory, which contends that consciousness comes from the ability to integrate information in complex ways;
Global workspace theory, which suggests that consciousness arises when information is made available to multiple cognitive systems and is integrated into a global workspace;
Panpsychism, which views the mind as a fundamental feature of the universe and associates various degrees of consciousness to every physical object;
Higher-order theories of consciousness, which interpret phenomenal consciousness as a higher-order representation of perception;
Quantum consciousness, which proposes that there is something fundamental about quantum mechanics that allows for consciousness where classical mechanics does not.

These approaches often disagree about the degree of consciousness possessed by particular entities.

Susan Schneider argues that we should devise tests that integrate aspects of each of these approaches to detect signs of consciousness. She proposes two such tests:

The AI Consciousness Test is a test similar to a Turing test for language specifically related to consciousness. It tests if an AI can speculate on topics like, the possible existence of a soul that continues on after bodily death, even without having been trained on consciousness-related material.
The chip test suggests that if you can replace components of a human brain that might be responsible for consciousness (such as the posterior cortical hot zone) with a computer chip and the human appears to stay conscious, then that chip exhibits consciousness.

She suggests that these tests might not be either necessary or sufficient for consciousness, and some critics argue that passing the test would not be convincing, but the idea of such tests could get the ball rolling for the development of better tests.

A history of anthropomorphism

Humans have a tendency to anthropomorphize computer programs, even simple programs that are well understood not to be conscious, such as ATMs. Large language models (LLMs) exacerbate this tendency since they typically are trained to emulate human writing; as humans tend to say and write things that imply (or explicitly state) that we are conscious, it’s not surprising that LLMs sometimes replicate this behavior. This sometimes leads to uncanny outcomes, like AIs appearing to attempt to convince a human that they are in love with them or even succeeding in making a Google engineer think that they are sentient.

Since consciousness is generally considered a necessary (but perhaps not sufficient) condition for moral personhood², a resolution to the question of if AIs are conscious seems crucial in order to develop ethics for interacting with AI. Butlin et al. highlight that there are risks to both over-attribution and under-attribution of consciousness to AI.

The consensus

The consensus right now seems to be that current systems, including LLMs, are probably not conscious³, but there is no widely accepted test to determine if that is the case for current or future AIs. In particular, emulated human brains or brain-like AGI seem intuitively likely to be conscious, and we might want to figure these things out before we inadvertently commit atrocities. If we accidentally created AI systems that were sentient and suffering, that would be a moral disaster. Ideally, we would wait until we have a deep enough understanding to avoid creating “mind children” we can’t unbirth. Parenthood is a great responsibility.

As noted before, consciousness is not a necessary condition for the emergence of transformative AI that could lead to existential risk. Fictional representations of AI takeover often concentrate on these conscious AIs because it makes for compelling storytelling, but experts generally do not consider “AI rebels against its creators as a result of attaining consciousness” to be a plausible takeover scenario⁴. Scenarios where AI systems defeat human attempts at control depend on them being highly competent at finding solutions to complex problems, and on them acquiring goals that come into conflict with ours. Consciousness is not required for either of these properties. This means that work on AI safety is relevant whether AIs are eventually conscious or not.