Artificial Intelligence: Reinforcement Learning in Python
- Offered byUDEMY
Artificial Intelligence: Reinforcement Learning in Python at UDEMY Overview
Duration | 15 hours |
Total fee | ₹2,999 |
Mode of learning | Online |
Credential | Certificate |
Artificial Intelligence: Reinforcement Learning in Python at UDEMY Highlights
- Earn a certificate of completion from Udemy
- Get full lifetime access of the course material
- Comes with 30 days money back guarantee
Artificial Intelligence: Reinforcement Learning in Python at UDEMY Course details
- For Anyone who wants to learn about artificial intelligence, data science, machine learning, and deep learning
- For Both students and professionals
- You?ll learn in this course, there are many analogous processes when it comes to teaching an agent and teaching an animal or even a human
- Understand the relationship between reinforcement learning and psychology
- Apply gradient-based supervised machine learning methods to reinforcement learning
Artificial Intelligence: Reinforcement Learning in Python at UDEMY Curriculum
Welcome
Introduction
Where to get the Code
Strategy for Passing the Course
Course Outline
Return of the Multi-Armed Bandit
Problem Setup and The Explore-Exploit Dilemma
Applications of the Explore-Exploit Dilemma
Epsilon-Greedy
Updating a Sample Mean
Designing Your Bandit Program
Comparing Different Epsilons
Optimistic Initial Values
UCB1
Bayesian / Thompson Sampling
Thompson Sampling vs. Epsilon-Greedy vs. Optimistic Initial Values vs. UCB1
Nonstationary Bandits
Bandit Summary, Real Data, and Online Learning
(Optional) Alternative Bandit Designs
High Level Overview of Reinforcement Learning
What is Reinforcement Learning?
On Unusual or Unexpected Strategies of RL
Defining Some Terms
Build an Intelligent Tic-Tac-Toe Agent
Naive Solution to Tic-Tac-Toe
Components of a Reinforcement Learning System
Notes on Assigning Rewards
The Value Function and Your First Reinforcement Learning Algorithm
Tic Tac Toe Code: Outline
Tic Tac Toe Code: Representing States
Tic Tac Toe Code: Enumerating States Recursively
Tic Tac Toe Code: The Environment
Tic Tac Toe Code: The Agent
Tic Tac Toe Code: Main Loop and Demo
Tic Tac Toe Summary
Tic Tac Toe: Exercise
Markov Decision Proccesses
Gridworld
The Markov Property
Defining and Formalizing the MDP
Future Rewards
Value Function Introduction
Value Functions
Bellman Examples
Optimal Policy and Optimal Value Function
MDP Summary
Dynamic Programming
Intro to Dynamic Programming and Iterative Policy Evaluation
Gridworld in Code
Designing Your RL Program
Iterative Policy Evaluation in Code
Policy Improvement
Policy Iteration
Policy Iteration in Code
Policy Iteration in Windy Gridworld
Value Iteration
Value Iteration in Code
Dynamic Programming Summary
Monte Carlo
Monte Carlo Intro
Monte Carlo Policy Evaluation
Monte Carlo Policy Evaluation in Code
Policy Evaluation in Windy Gridworld
Monte Carlo Control
Monte Carlo Control in Code
Monte Carlo Control without Exploring Starts
Monte Carlo Control without Exploring Starts in Code
Monte Carlo Summary
Temporal Difference Learning
Temporal Difference Intro
TD(0) Prediction
TD(0) Prediction in Code
SARSA
SARSA in Code
Q Learning
Q Learning in Code
TD Summary
Approximation Methods
Approximation Intro
Linear Models for Reinforcement Learning
Features
Monte Carlo Prediction with Approximation
Monte Carlo Prediction with Approximation in Code
TD(0) Semi-Gradient Prediction
Semi-Gradient SARSA
Semi-Gradient SARSA in Code
Course Summary and Next Steps
Stock Trading Project with Reinforcement Learning
Stock Trading Project Section Introduction
Data and Environment
How to Model Q for Q-Learning
Design of the Program
Code pt 1
Code pt 2
Code pt 3
Code pt 4
Stock Trading Project Discussion
Appendix / FAQ
What is the Appendix?
Windows-Focused Environment Setup 2018
How to install Numpy, Scipy, Matplotlib, Pandas, IPython, Theano, and TensorFlow
How to Code by Yourself (part 1)
How to Code by Yourself (part 2)
How to Succeed in this Course (Long Version)
Is this for Beginners or Experts? Academic or Practical? Fast or slow-paced?
Proof that using Jupyter Notebook is the same as not using it
Python 2 vs Python 3
What order should I take your courses in? (part 1)
What order should I take your courses in? (part 2)
BONUS: Where to get discount coupons and FREE deep learning material
Other courses offered by UDEMY
Artificial Intelligence: Reinforcement Learning in Python at UDEMY Students Ratings & Reviews
- 2-31