Deep Blue to AlphaGo – Why Is It So Much Better?

20 years after Deep Blue defeated the World Champion at Chess, Alpha Go did the same for the World Champion at Go. What are the key changes that make it so much better? Deep Blue Excerpts from: https://en.wikipedia.org/wiki/Deep_Blue_(chess_computer) The system derived its playing strength mainly from brute force computing power. It was a massively parallel, …

Deep Blue to AlphaGo – Why Is It So Much Better? Read More »

OpenAI Lunar Lander – Solving with Vanilla DQN (aka Reinforcement Learning with Experience Replay)

In order to understand what is all the buzz about DeepMind’s reinforcement learning papers, I decided to implement Deep Reinforcement Learning with: Double Q-learning Experience Replay The neural network was then trained on the OpenAI Lunar Lander environment. I did my best to implement the above in TensorFlow with just the paper published by DeepMind …

OpenAI Lunar Lander – Solving with Vanilla DQN (aka Reinforcement Learning with Experience Replay) Read More »