Lecture: Partially Observable MDPs (POMDPs)

Andrew Ng - Stanford

 
Previous Lectureno lecture

Description

Lecture Description

Partially Observable MDPs (POMDPs), Policy Search, Reinforce Algorithm, Pegasus Algorithm, Pegasus Policy Search, Applications of Reinforcement Learning.

Course Description

Note: This course is offered by Stanford as an online course for credit.It can be taken individually, or as part of a master’s degree or graduate certificate earned online through the Stanford Center for Professional Development.

This course provides a broad introduction to machine learning and statistical pattern recognition.

Topics include: supervised learning (generative/discriminative learning, parametric/non-parametric learning, neural networks, support vector machines); unsupervised learning (clustering, dimensionality reduction, kernel methods); learning theory (bias/variance tradeoffs; VC theory; large margins); reinforcement learning and adaptive control.

The course will also discuss recent applications of machine learning, such as robotic control, data mining, autonomous navigation, bioinformatics, speech recognition, and text and web data processing.

Prerequisites: Knowledge of basic computer science principles and skills, at a level sufficient to write a reasonably non-trivial computer program; familiarity with basic probability theory; familiarity with basic linear algebra.

from course: Machine Learning

Comments

Related Lectures