UDEMY
UDEMY Logo

Artificial Intelligence: Reinforcement Learning in Python 

  • Offered byUDEMY

Artificial Intelligence: Reinforcement Learning in Python
 at 
UDEMY 
Overview

Complete guide to Reinforcement Learning, with Stock Trading and Online Advertising Applications

Duration

15 hours

Total fee

2,999

Mode of learning

Online

Credential

Certificate

Artificial Intelligence: Reinforcement Learning in Python
 at 
UDEMY 
Highlights

  • Earn a certificate of completion from Udemy
  • Get full lifetime access of the course material
  • Comes with 30 days money back guarantee
Details Icon

Artificial Intelligence: Reinforcement Learning in Python
 at 
UDEMY 
Course details

Skills you will learn
Who should do this course?
  • For Anyone who wants to learn about artificial intelligence, data science, machine learning, and deep learning
  • For Both students and professionals
More about this course
  • You?ll learn in this course, there are many analogous processes when it comes to teaching an agent and teaching an animal or even a human
  • Understand the relationship between reinforcement learning and psychology
  • Apply gradient-based supervised machine learning methods to reinforcement learning

Artificial Intelligence: Reinforcement Learning in Python
 at 
UDEMY 
Curriculum

Welcome

Introduction

Where to get the Code

Strategy for Passing the Course

Course Outline

Return of the Multi-Armed Bandit

Problem Setup and The Explore-Exploit Dilemma

Applications of the Explore-Exploit Dilemma

Epsilon-Greedy

Updating a Sample Mean

Designing Your Bandit Program

Comparing Different Epsilons

Optimistic Initial Values

UCB1

Bayesian / Thompson Sampling

Thompson Sampling vs. Epsilon-Greedy vs. Optimistic Initial Values vs. UCB1

Nonstationary Bandits

Bandit Summary, Real Data, and Online Learning

(Optional) Alternative Bandit Designs

High Level Overview of Reinforcement Learning

What is Reinforcement Learning?

On Unusual or Unexpected Strategies of RL

Defining Some Terms

Build an Intelligent Tic-Tac-Toe Agent

Naive Solution to Tic-Tac-Toe

Components of a Reinforcement Learning System

Notes on Assigning Rewards

The Value Function and Your First Reinforcement Learning Algorithm

Tic Tac Toe Code: Outline

Tic Tac Toe Code: Representing States

Tic Tac Toe Code: Enumerating States Recursively

Tic Tac Toe Code: The Environment

Tic Tac Toe Code: The Agent

Tic Tac Toe Code: Main Loop and Demo

Tic Tac Toe Summary

Tic Tac Toe: Exercise

Markov Decision Proccesses

Gridworld

The Markov Property

Defining and Formalizing the MDP

Future Rewards

Value Function Introduction

Value Functions

Bellman Examples

Optimal Policy and Optimal Value Function

MDP Summary

Dynamic Programming

Intro to Dynamic Programming and Iterative Policy Evaluation

Gridworld in Code

Designing Your RL Program

Iterative Policy Evaluation in Code

Policy Improvement

Policy Iteration

Policy Iteration in Code

Policy Iteration in Windy Gridworld

Value Iteration

Value Iteration in Code

Dynamic Programming Summary

Monte Carlo

Monte Carlo Intro

Monte Carlo Policy Evaluation

Monte Carlo Policy Evaluation in Code

Policy Evaluation in Windy Gridworld

Monte Carlo Control

Monte Carlo Control in Code

Monte Carlo Control without Exploring Starts

Monte Carlo Control without Exploring Starts in Code

Monte Carlo Summary

Temporal Difference Learning

Temporal Difference Intro

TD(0) Prediction

TD(0) Prediction in Code

SARSA

SARSA in Code

Q Learning

Q Learning in Code

TD Summary

Approximation Methods

Approximation Intro

Linear Models for Reinforcement Learning

Features

Monte Carlo Prediction with Approximation

Monte Carlo Prediction with Approximation in Code

TD(0) Semi-Gradient Prediction

Semi-Gradient SARSA

Semi-Gradient SARSA in Code

Course Summary and Next Steps

Stock Trading Project with Reinforcement Learning

Stock Trading Project Section Introduction

Data and Environment

How to Model Q for Q-Learning

Design of the Program

Code pt 1

Code pt 2

Code pt 3

Code pt 4

Stock Trading Project Discussion

Appendix / FAQ

What is the Appendix?

Windows-Focused Environment Setup 2018

How to install Numpy, Scipy, Matplotlib, Pandas, IPython, Theano, and TensorFlow

How to Code by Yourself (part 1)

How to Code by Yourself (part 2)

How to Succeed in this Course (Long Version)

Is this for Beginners or Experts? Academic or Practical? Fast or slow-paced?

Proof that using Jupyter Notebook is the same as not using it

Python 2 vs Python 3

What order should I take your courses in? (part 1)

What order should I take your courses in? (part 2)

BONUS: Where to get discount coupons and FREE deep learning material

Other courses offered by UDEMY

549
50 hours
– / –
3 K
10 hours
– / –
549
4 hours
– / –
599
10 hours
– / –
View Other 2344 CoursesRight Arrow Icon

Artificial Intelligence: Reinforcement Learning in Python
 at 
UDEMY 
Students Ratings & Reviews

3/5
Verified Icon1 Rating
S
SAURAV KUMAR SINGH
Artificial Intelligence: Reinforcement Learning in Python
Offered by UDEMY
3
Other: A few can be better but if u are a beginner I recommend it but if you have any knowledge on these topic don't take it
Reviewed on 12 Nov 2020Read More
Thumbs Up IconThumbs Down Icon
View 1 ReviewRight Arrow Icon
qna

Artificial Intelligence: Reinforcement Learning in Python
 at 
UDEMY 

Student Forum

chatAnything you would want to ask experts?
Write here...