Agents, RL & Decision Science

Agents, RL & Decision Science

Reinforcement learning, multi-agent systems, causality, and decision optimization.

39

Courses

3

Subcategories

1279h+

Total Hours

All levels

Difficulty Range

Visual

Step 7 Non Linear Models Examples 7.6 Agent Based Modeling

A generic course about Agent Based Modeling. Content coming soon.

Agent-Based Modeling4hIntermediateEnglish
Visual

Decision Theory & Robust Preferences

Foundations of rational decision-making: utility theory, risk measures, and robust preference models.

Bandits, Causality & RL Theory4hAdvancedEnglish
Visual

Online Learning & Adversarial Bandits

Regret minimization in online learning: experts, adversarial bandits, and multiplicative weights.

Bandits, Causality & RL Theory4hAdvancedEnglish
Visual

Contextual Bandits & Off-Policy Evaluation

Contextual bandits for personalization with off-policy evaluation methods for safe deployment.

Bandits, Causality & RL Theory4hAdvancedEnglish
Visual

Policy Learning & Counterfactual Risk Minimization

Learn optimal policies from logged bandit data using counterfactual risk minimization.

Bandits, Causality & RL Theory4hAdvancedEnglish
Visual

MDPs & Dynamic Programming

Markov decision processes: Bellman equations, value iteration, and policy iteration foundations.

Bandits, Causality & RL Theory4hAdvancedEnglish
Visual

RL with Function Approximation

Reinforcement learning with linear and neural function approximation: DQN, policy gradient, and convergence analysis.

Bandits, Causality & RL Theory4hAdvancedEnglish
Visual

Safe, Robust & Risk-Sensitive RL

RL under safety constraints: constrained MDPs, robust MDPs, and risk-sensitive objectives.

Bandits, Causality & RL Theory4hAdvancedEnglish
Visual

Inverse RL & Imitation Learning

Learn reward functions from demonstrations: IRL, behavioral cloning, and DAgger.

Bandits, Causality & RL Theory4hAdvancedEnglish
Visual

Off-Policy Evaluation: IS, DR, FQE Guarantees

Theory of off-policy evaluation in RL: importance sampling, doubly robust methods, and fitted Q evaluation.

Bandits, Causality & RL Theory4hAdvancedEnglish
Visual

Sample Complexity & PAC-style Guarantees in RL

PAC-MDP framework, sample complexity bounds, and minimax rates for reinforcement learning.

Bandits, Causality & RL Theory4hAdvancedEnglish
Visual

POMDPs & Information-State Control

Partially observable MDPs: belief states, information states, and planning under partial observability.

Bandits, Causality & RL Theory4hAdvancedEnglish
Showing 12 resultsTotal: 39 courses
Pageof 4