Reinforcement Learning
Table of contents
Policy Evaluation
- The intensity of the blue bar shows the influence of the state value in the weighted average
- The state value history gets visualized as a bar chart in each state
- Green indicates positive state value
- Red indicates negative state value
- Dark gray indicates the terminal states