Python Lessons

PyLessons September 22, 2019

This is an introduction tutorial to Reinforcement Learning. To understand everything from basics I will start with a simple game called - CartPole

PyLessons October 14, 2019

Solving the Cartpole with Double Deep Q Network

This is the second reinforcement tutorial part, where we'll make our environment use two (Double) neural networks to train our main model

PyLessons October 16, 2019

Solving the Cartpole with Dueling Double Deep Q Network

In this post, we’ll be covering Dueling DQN networks for reinforcement learning. This architecture is an improvement from our previous DDQN tutorial.

PyLessons November 03, 2019

Epsilon Greedy in Deep Q Learning

In this part, we'll cover the Epsilon Greedy method used in Deep Q Learning and we'll fix/prepare our source code for PER method

PyLessons November 14, 2019

D3QN Agent with Prioritized Experience Replay

Now we will try to change the sampling distribution by using a criterion to define the priority of each tuple of experience.

PyLessons November 19, 2019

DQN PER with Convolutional Neural Networks

Now I will show you how to implement DQN with CNN. After this tutorial, you'll be able to create an agent that successfully plays almost ‘any’ game using only pixel inputs

PyLessons February 24, 2020

A.I. learns to play Pong with Deep Q Network

In this tutorial, I'll implement a Deep Neural Network for Reinforcement Learning (Deep Q Network), and we will see it learns and finally becomes good enough to beat the computer in Pong!

PyLessons March 18, 2020

Introduction to Reinforcement Learning Policy Gradient

To wrap up deep reinforcement learning, I’ll introduce the types of agents beyond DQN’s (Value, Model, Policy optimization, and Imitation Learning). We'll implement Policy Gradient!

PyLessons March 20, 2020

Introduction to Advantage Actor-Critic method (A2C)

Today, we'll study a Reinforcement Learning method that we can call a 'hybrid method': Actor-Critic. This algorithm combines the value optimization and policy optimization approaches

PyLessons March 22, 2020

Asynchronous Advantage Actor-Critic (A3C) algorithm

In this tutorial, I will provide an implementation of the Asynchronous Advantage Actor-Critic (A3C) algorithm in Tensorflow and Keras. We will use it to solve a simple challenge in the Pong environmens

PyLessons March 25, 2020

Policy Optimization (PPO)

In this tutorial, we'll dive into the understanding of the PPO architecture and we'll implement a Proximal Policy Optimization (PPO) agent that learns to play Pong-v0

PyLessons November 13, 2020

LunarLander-v2 with Proximal Policy Optimization

In this step-by-step reinforcement learning tutorial with gym and TensorFlow 2. I’ll show you how to implement a PPO for teaching an AI agent how to land a rocket (Lunarlander-v2)

PyLessons November 23, 2020

BipedalWalker-v3 with Continuous Proximal Policy Optimization

In this tutorial, we'll solve the BipedalWalker-v3 environment, which is a very hard environment in the Gym. Our agent should run very fast, should not trip himself off, should use as little energy as possible...