Where can I find videos about AI safety?

23 min read

Suggest changes in Google Docs

Also available at aisafety.video

Top recommendation: AI Safety Intro Video playlist

Generally good sources

Channels

Robert Miles AI Safety (and Rob's videos on Computerphile)
Centre for the Governance of AI (GovAI)
Apart Research
Neel Nanda
Center for AI Safety
Towards Data Science
The Inside View
Future of Life Institute
SERI
CERI
AI Safety Talks (and its playlists)
AI Safety Reading Group
Mechanistic Interpretability
Intro to ML Safety
AGI

AGI

AI that is at least as good as humans at general problem-solving.

View full definition

safety talks from AGISF
AISS discussion days and AISS YouTube
Victoria Krakovna AI talks
Jack Parker
PIBBSS
MIRI
The Future Society
Cooperative AI Foundation
Quantified Uncertainty Research Institute
Robin Hanson AI Risk Conversations
Singular Learning Theory Summit
Relevant, but less focused on AI existential risk: Rational Animations, Centre for Effective Altruism, Future of Humanity Institute, Center for Security and Emerging Technology, CSER, SSC meetups, Foresight Institute, Science, Technology & the Future, Berkman Klein Center, Schwartz Reisman Institute, Stanford HAI, Carper AI, Lex Fridman, Digital Humanism, Cognitive Revolution Podcast, NIST AI Metrology Colloquia Series
AI content without much AI safety: AI Explained, Andrej Karpathy, John Tan Chong Min, Edan Meyer, Yannic Kilcher, Mutual Information, Computerphile, CodeEmporium, sentdex, nPlan, Jay Alammar, Assembly AI, Aleksa Gordić, Simons Institute, 2 Minute Papers, Machine Learning Street Talk, ColdFusion, HuggingFace, AI Coffee Break, Alex Smola, Welcome AI Overlords, Valence Discovery, The Alan Turing Institute, Jordan Harrod, Cambridge Ellis Unit, Weights

Model weights

The parameters of a neural network. They are tuned during training and are mostly sufficient to implement the AI model.

& Biases, UCL CSML Seminar Series, Harvard Medical AI, IARAI, Alfredo Canziani, Andreas Geiger, CMU AI Seminar, Jeremy Howard, Google Research, AI for Good, IPAM UCLA, One world theoretical ML, What's AI, Stanford MedAI, MILA neural scaling seminars, Digital Engine (sometimes misleading), Steve Brunton, PyTorch, What's AI by Louis Bouchard, Eye on AI, Super Data Science, Waterloo AI, Matt Wolfe, DeepLearningAI, TechTechPotato, Asianometry
Other languages: Karl Olsberg (German)

Lists

AI Alignment YouTube Playlists – excellent resource. Slide-light (reordered) and slide-heavy playlists.
the gears to ascenscion lists many channels for understanding current capabilities trends
AI Safety Support "Lots of Links": Videos
A ranked list of all EA-relevant documentaries, movies, and TV | Brian Tan on EAF (“AI Safety / Risks” section)
San Francisco Alignment Workshop 2023
Towards AGI: Scaling, Alignment & Emergent Behaviors in Neural Nets

Specific suggestions

Note that:

I haven’t watched all of these videos. Feel free to comment with more recommendations!
This list does not focus on podcasts, although there are a few podcast recommendations. See this page for some AI safety podcasts.

Introductory

Landscape

Inner alignment

Inner misalignment

When an AI system ends up pursuing a different objective than the one that was specified.

View full definition

Outer alignment

Outer alignment

The problem of making sure that the precise formulation of what we train the AI to do matches what we intend it to do.

View full definition

Agent foundations

Agent foundations

A research agenda which tries to understand the nature of agents and their properties.

View full definition

Interpretability

Organizations

Individual researchers

Reasoning about future AI

Frontier AI regulation

Hardware supply chain

Misc AI governance

Ethics

Career planning

Forecasting

Capabilities

How AI works

China

Rationality

Debates / discussions between people with different perspectives

Existential Risks of AI - Debate with Stuart Russell at Pakhuis de Zwijger

Misc

Watching videos in a group

Discussion prompts

Higher-level meeting tips

Show the video → people discuss afterwards with prompts
Active learning techniques: here
You can skip around through parts of the video!
See “Discussion Groups” from the EA Groups Resource Center
Make the official meeting end after 1 hour so people are free to leave, but give people the option to linger for longer and continue their discussion.
You can also do readings instead of videos, similar to this. Or play around with a model (e.g. test out hypotheses about how a language model works).
Try to keep the video portion to under 20 minutes unless the video is really interesting.
For a short video you could watch one of Apart’s ML + AI safety updates. Some of these contain many topics, so people can discuss what they find interesting.

What are some other introductions to AI safety?

Intro to AI safety

What links are especially valuable to share on social media or other contexts?

What are some good podcasts about AI safety?

Living document

Where can I find videos about AI safety?

Generally good sources

Channels

Lists

Specific suggestions

Introductory

Landscape

Inner alignment Inner misalignment When an AI system ends up pursuing a different objective than the one that was specified. View full definition

Outer alignment Outer alignment The problem of making sure that the precise formulation of what we train the AI to do matches what we intend it to do. View full definition

Agent foundations Agent foundations A research agenda which tries to understand the nature of agents and their properties. View full definition

Interpretability

Organizations

Individual researchers

Reasoning about future AI

Frontier AI regulation

International AI governance

US AI Policy

Compute governance

Hardware supply chain

Misc AI governance

Ethics

Career planning

Forecasting

Capabilities

How AI works

China

Rationality

Debates / discussions between people with different perspectives

Misc

Watching videos in a group

Discussion prompts

Higher-level meeting tips

Inner alignment

Inner misalignment

When an AI system ends up pursuing a different objective than the one that was specified.

View full definition

Outer alignment

Outer alignment

The problem of making sure that the precise formulation of what we train the AI to do matches what we intend it to do.

View full definition

Agent foundations

Agent foundations

A research agenda which tries to understand the nature of agents and their properties.

View full definition