Where can I find videos about AI safety?

23 min read

Suggest changes in Google Docs

aisafety.video redirects here

Top recommendation: AI Safety Intro Video playlist

Generally good sources

Channels

Robert Miles AI Safety (and Rob's videos on Computerphile)
Centre for the Governance of AI (GovAI)
Apart Research
Neel Nanda
Center for AI Safety
Towards Data Science
The Inside View
Future of Life Institute
SERI
AI Safety Talks (and its playlists)
AI Safety Reading Group
Mechanistic Interpretability
Intro to ML Safety
AISS discussion days and AISS YouTube
Victoria Krakovna AI talks
Jack Parker
PIBBSS
MIRI
The Future Society
Cooperative AI Foundation
Quantified Uncertainty Research Institute
Robin Hanson AI Risk Conversations
Short-form channels:
- Aric Floyd (for 80,000 Hours)
- Chana Messinger (for 80,000 Hours)
- AI Curious
Relevant, but less focused on AI existential risk: Rational Animations, Centre for Effective Altruism, Future of Humanity Institute, Center for Security and Emerging Technology, CSER, SSC meetups, Foresight Institute, Science, Technology & the Future, Berkman Klein Center, Schwartz Reisman Institute, Stanford HAI, Carper AI, Lex Fridman, Digital Humanism, Cognitive Revolution Podcast, NIST AI Metrology Colloquia Series
AI content without much AI safety: AI Explained, Andrej Karpathy, John Tan Chong Min, Edan Meyer, Yannic Kilcher, Mutual Information, Computerphile, CodeEmporium, sentdex, nPlan, Jay Alammar, Assembly AI, Aleksa Gordić, Simons Institute, 2 Minute Papers, Machine Learning Street Talk, ColdFusion, HuggingFace, AI Coffee Break, Alex Smola, Welcome AI Overlords, Valence Discovery, The Alan Turing Institute, Jordan Harrod, Cambridge Ellis Unit, Weights & Biases, UCL CSML Seminar Series, Harvard Medical AI, IARAI, Alfredo Canziani, Andreas Geiger, CMU AI Seminar, Jeremy Howard, Google Research, AI for Good, IPAM UCLA, One world theoretical ML, What's AI, Stanford MedAI, MILA neural scaling seminars, Digital Engine (sometimes misleading), Steve Brunton, PyTorch, What's AI by Louis Bouchard, Eye on AI, Super Data Science, Waterloo AI, Matt Wolfe, DeepLearningAI, TechTechPotato, Asianometry
Other languages: Karl Olsberg (German)

Lists

AI Alignment YouTube Playlists – excellent resource. Slide-light (reordered) and slide-heavy playlists.
the gears to ascenscion lists many channels for understanding current capabilities trends
AI Safety Support "Lots of Links": Videos
A ranked list of all EA-relevant documentaries, movies, and TV | Brian Tan on EAF (“AI Safety / Risks” section)
San Francisco Alignment Workshop 2023
Towards AGI: Scaling, Alignment & Emergent Behaviors in Neural Nets

Specific suggestions

Note that:

I haven’t watched all of these videos. Feel free to comment with more recommendations!
This list does not focus on podcasts, although there are a few podcast recommendations. See this page for some AI safety podcasts.

Introductory

Landscape

Inner alignment

Outer alignment

9 Examples of Specification Gaming
How to Keep Improving When You're Better Than Any Teacher - Iterated Distillation and Amplification
AI Toy Control Problem (Stuart Armstrong)
AIS via Debate (Joe Collman)
Another Outer Alignment Failure Story

Agent foundations

Interpretability

Organizations

Individual researchers

Reasoning about future AI

Frontier AI regulation

Hardware supply chain

Misc AI governance

Ethics

Career planning

Forecasting

Capabilities

How AI works

China

Rationality

Debates / discussions between people with different perspectives

Existential Risks of AI - Debate with Stuart Russell at Pakhuis de Zwijger

Misc

Watching videos in a group

Discussion prompts

Higher-level meeting tips

Show the video → people discuss afterwards with prompts
Active learning techniques: here
You can skip around through parts of the video!
See “Discussion Groups” from the EA Groups Resource Center
Make the official meeting end after 1 hour so people are free to leave, but give people the option to linger for longer and continue their discussion.
You can also do readings instead of videos, similar to this. Or play around with a model (e.g. test out hypotheses about how a language model works).
Try to keep the video portion to under 20 minutes unless the video is really interesting.
For a short video you could watch one of Apart’s ML + AI safety updates. Some of these contain many topics, so people can discuss what they find interesting.

What are some other introductions to AI safety?

What links are especially valuable to share on social media or other contexts?

What are some good podcasts about AI safety?

Where can I find videos about AI safety?

Generally good sources

Channels

Lists

Specific suggestions

Introductory

Landscape

Inner alignment

Outer alignment

Agent foundations

Interpretability

Organizations

Individual researchers

Reasoning about future AI

Frontier AI regulation

International AI governance

US AI Policy

Compute governance

Hardware supply chain

Misc AI governance

Ethics

Career planning

Forecasting

Capabilities

How AI works

China

Rationality

Debates / discussions between people with different perspectives

Misc

Watching videos in a group

Discussion prompts

Higher-level meeting tips