Deck · Financial Engineering
Reinforcement Learning for Finance
MDPs, dynamic programming, Q-learning, policy gradients, and actor-critic methods applied to optimal execution, market making, and portfolio control.
75 cards · audited · SM-2 spaced repetition
Included with the full Financial Engineering program — 19 decks, 1,382 cards.
Sample cards
1
Markov Decision Process (MDP) — five components
2
Return Gₜ (discounted)
3
State-value function Vπ(s)
4
Action-value function Qπ(s,a)
5
Markov property in MDPs
Showing 5 of 75 cards. Unlock the program to study them all.
More in Financial Engineering
Master reinforcement learning for finance — and the rest of Financial Engineering.
One program. 1,382 audited cards across 19 decks.