Deep Reinforcement Learning Hands-On: Apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more Autor: Maxim Lapan Editorial: Packt Publishing (2018-06-20) fbd98b93-5496-4c7c-b