Dynamically Augmented CVaR for MDPs
This paper introduces the time-consistent Dynamically Augmented CVaR (DCVaR) risk measure for Markov Decision Processes and presents a provably correct algorithm to optimize it by analyzing a specially defined Dynamically Augmented Robust MDP.