Deep Reinforcement Learning Demysitifed (Episode 2) — Policy Iteration, Value Iteration and Q-learning | by Moustafa Alzantot | Medium
reinforcement learning - How can the policy iteration algorithm be model-free if it uses the transition probabilities? - Artificial Intelligence Stack Exchange
Bootcamp Summer 2020 Week 3 – Value Iteration and Q-learning
CS440 Lectures
What is an intuitive explanation of value iteration in reinforcement learning (RL)? - Quora
reinforcement learning - Why do value iteration and policy iteration obtain similar policies even though they have different value functions? - Artificial Intelligence Stack Exchange
4.3 Policy Iteration
Reinforcement Learning Chapter 4: Dynamic Programming (Part 3 — Value Iteration) | by Numfor Tiapo | Mar, 2023 | Medium