Регистрация | Вход
The basic reinforcement learning model consists of: 1. a set of environment and agent states S; 2. a set of actions A of the agent; 3. policies of transitioning from states to actions; 4. rules that determine the scalar immediate reward of a transition; and 5. rules that describe what the agent observes.