What does generative visualization look like in reinforcement learning?

1 min read

Suggest changes in Google Docs

Generative visualization for an image classifier means showing an input image which causes the classifier to strongly recognize a feature in that image.

In reinforcement learning (RL), generative visualization means showing a sequence of observations which make an RL agent strongly want to take a specific action.

The problem with generative visualization in RL is that the space of possible observations is constrained by the transition function. So, if we optimize the observations to maximize the activation for an action, this will likely result in a sequence of observations which is impossible (incompatible with the transition function).

A way around this is to compute an embedding of possible observation sequences and to optimize in the embedding space instead of the observation space. A relevant paper here is Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents.

What is reinforcement learning (RL)?

What is feature visualization?

What does generative visualization look like in reinforcement learning?

In progress