What is reinforcement learning (RL)?
Reinforcement learning (RL) trains an AI by rewarding behavior we want and punishing behavior we don't want. The AI will repeat behaviors that have helped it get higher rewards in the past, and avoid behaviors that gave negative or low reward. For example, a recommendation system (such as the one on Netflix) sees which recommendations you clicked on in the past and shows you other movies you are likely to click on, based on what people who liked similar things also liked.
RL is one of the three basic machine learning
An approach to AI in which, instead of designing an algorithm directly, we have the system search through possible algorithms based on how well they do on some training data.
Due to its generality, reinforcement learning is studied in many disciplines, such as game theory, control theory, operations research, information theory, simulation-based optimization, multi-agent systems, swarm intelligence, and statistics.