Deck · Financial Engineering

Reinforcement Learning for Finance

MDPs, dynamic programming, Q-learning, policy gradients, and actor-critic methods applied to optimal execution, market making, and portfolio control.

75 cards · audited · SM-2 spaced repetition

or go All-Access →

Included with the full Financial Engineering program — 19 decks, 1,382 cards.

Sample cards

1

Markov Decision Process (MDP) — five components

2

Return Gₜ (discounted)

3

State-value function Vπ(s)

4

Action-value function Qπ(s,a)

5

Markov property in MDPs

Showing 5 of 75 cards. Unlock the program to study them all.

More in Financial Engineering

Master reinforcement learning for finance — and the rest of Financial Engineering.

One program. 1,382 audited cards across 19 decks.

or go All-Access →

See the full program →