How can progress in non-agentic LLMs lead to capable AI agents?
AutoGPT is an example of an agent
A system that can be understood as taking actions towards achieving a goal.
A family of pretrained large language models by OpenAI, they power ChatGPT.
One threat model which includes a GPT-style AI is “misaligned model-based [reinforcement learning A machine learning method in which the machine gets rewards based on its actions, and is adjusted to be more likely to take actions that lead to high reward.
A risk of human extinction or the destruction of humanity’s long-term potential.
Something that can improve some process or physical artifact so that it is fit for a certain purpose or fulfills some set of requirements.
A more speculative possibility is that a sufficiently powerful world model may develop a mesa-optimizer An algorithm that is created by optimization and that is also itself an optimizer.