Deep Reinforcement Learning Hands-On: Apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more Author: Maxim Lapan Publisher: Packt Publishing (2018-06-20) fbd98b93-5496-4c7c-b