Lecture: Partially Observable MDPs (POMDPs)

Andrew Ng - Stanford

Description

Lecture Description

Partially Observable MDPs (POMDPs), Policy Search, Reinforce Algorithm, Pegasus Algorithm, Pegasus Policy Search, Applications of Reinforcement Learning.

Course Description

Note: This course is offered by Stanford as an online course for credit.It can be taken individually, or as part of a masterâ€™s degree or graduate certificate earned online through the Stanford Center for Professional Development.

This course provides a broad introduction to machine learning and statistical pattern recognition.

Topics include: supervised learning (generative/discriminative learning, parametric/non-parametric learning, neural networks, support vector machines); unsupervised learning (clustering, dimensionality reduction, kernel methods); learning theory (bias/variance tradeoffs; VC theory; large margins); reinforcement learning and adaptive control.

The course will also discuss recent applications of machine learning, such as robotic control, data mining, autonomous navigation, bioinformatics, speech recognition, and text and web data processing.

Prerequisites: Knowledge of basic computer science principles and skills, at a level sufficient to write a reasonably non-trivial computer program; familiarity with basic probability theory; familiarity with basic linear algebra.

from course: Machine Learning

< < Previous Lecture
Advice for Applying Machine Learning

Comments

  
Related Lectures
State-action Rewards
The Rise of Athens (cont.)
Glycolysis Introduction
Virology/Tumor Viruses 
Integumentary System II, Course Review