Take AISafety.info’s 3 minute survey to help inform our strategy and priorities

Take the survey

What is "agent foundations"?

Some things in the world behave like “agents”: their actions can be understood as directed toward achieving particular goals.

Agent foundations is a research program which tries to understand the nature of agents and their properties, often in a mathematically precise way. It asks questions such as “What are the necessary components of an agent?”, “Can we predict which systems will be agents, and how they will behave?”, and “Which agent designs are tolerant of human error?” There are many open problems in agent foundations research.

Some frameworks which various research groups are developing in order to study agent foundations are embedded agency, Infra-Bayesianism, and shard theory, among others.

Researchers disagree about which features of a system are essential for “agency". Some researchers have a fairly narrow view of what qualifies as an agent, requiring a long list of abilities. Other researchers (including the originators of the agent foundations research program) consider the term “agent” to be ambiguous, and include any system that optimizes towards a "goal” (in a broad sense) in their research on agent foundations.



AISafety.info

We’re a global team of specialists and volunteers from various backgrounds who want to ensure that the effects of future AI are beneficial rather than catastrophic.