← Parminces
Kochenderfer, Wheeler & Wray, 2022

Algorithms for
Decision Making

From probabilistic reasoning to multiagent systems. Every algorithm derived, visualized, and made interactive. The complete decision-making toolkit.

26
Chapters
80+
Simulations
200+
Quizzes
Part I: Probabilistic Reasoning
Chapter 2

Representation

Bayesian networks, conditional independence, joint distributions.

Chapter 3

Inference

Variable elimination, belief propagation, sampling methods.

Chapter 4

Parameter Learning

MLE, Bayesian learning, EM algorithm.

Chapter 5

Structure Learning

Graph search, Markov equivalence classes.

Chapter 6

Simple Decisions

Utility, decision networks, value of information.

Part II: Sequential Problems
Chapter 7

Exact Solution Methods

MDPs, policy iteration, value iteration.

Chapter 8

Approximate Value Functions

Parametric methods, tile coding, neural networks.

Chapter 9

Online Planning

MCTS, heuristic search, rollout algorithms.

Chapter 10

Policy Search

Genetic algorithms, CEM, evolution strategies.

Chapter 11

Policy Gradient Estimation

Finite differences, likelihood ratio, REINFORCE.

Chapter 12

Policy Gradient Optimization

Natural gradient, trust region, PPO.

Chapter 13

Actor-Critic Methods

GAE, deterministic policy gradient, A3C.

Chapter 14

Policy Validation

Robustness analysis, adversarial testing.

Part III: Model Uncertainty
Chapter 15

Exploration and Exploitation

Bandits, UCB, Thompson sampling.

Chapter 16

Model-Based Methods

Bayesian RL, posterior sampling.

Chapter 17

Model-Free Methods

Q-learning, SARSA, experience replay, DQN.

Chapter 18

Imitation Learning

Behavioral cloning, DAgger, IRL, GAIL.

Part IV: State Uncertainty
Chapter 19

Beliefs

Kalman filter, EKF, UKF, particle filters.

Chapter 20

Exact Belief State Planning

POMDPs, conditional plans, alpha vectors.

Chapter 21

Offline Belief State Planning

PBVI, SARSOP, point-based methods.

Chapter 22

Online Belief State Planning

POMCP, DESPOT, online POMDP solvers.

Chapter 23

Controller Abstractions

Finite state controllers, policy graphs.

Part V: Multiagent Systems
Chapter 24

Multiagent Reasoning

Game theory, Nash equilibrium, correlated equilibrium.

Chapter 25

Sequential Problems

Stochastic games, MARL, fictitious play.

Chapter 26

State Uncertainty

Dec-POMDPs, I-POMDPs, belief-space games.

Chapter 27

Collaborative Agents

Coordination, communication, team decision-making.