Reinforcement Learning by Sutton and Barto
Chapter 1