Vikas Agarwal and Narayan Y Naik. Risks and portfolio decisions involving hedge funds. The
Review of Financial Studies, 17(1):63–98, 2004.
Philippe Artzner, Freddy Delbaen, Jean-Marc Eber, and David Heath. Coherent measures of risk.
Mathematical finance, 9(3):203–228, 1999.
Nicole Bäuerle and Jonathan Ott. Markov decision processes with average-value-at-risk criteria.
Mathematical Methods of Operations Research, 74(3):361–379, 2011.
Marc G. Bellemare, Will Dabney, and Rémi Munos. A distributional perspective on reinforcement
learning. In Doina Precup and Yee Whye Teh, editors, Proceedings of the 34th International
Conference on Machine Learning, volume 70 of Proceedings of Machine Learning Research,
pages 449–458, International Convention Centre, Sydney, Australia, 06–11 Aug 2017. PMLR.
Tomas Bjork and Agatha Murgoci. A general theory of markovian time inconsistent stochastic
control problems. Available at SSRN 1694759, 2010.
George I Boutselis, Ziyi Wang, and Evangelos A Theodorou. Constrained sampling-based trajectory
optimization using stochastic approximation. arXiv preprint arXiv:1911.04621, 2019.
Greg Brockman, Vicki Cheung, Ludwig Pettersson, Jonas Schneider, John Schulman, Jie Tang, and
Wojciech Zaremba. Openai gym. arXiv preprint arXiv:1606.01540, 2016.
Xin Chen, Melvyn Sim, David Simchi-Levi, and Peng Sun. Risk aversion in inventory management.
Operations Research, 55(5):828–842, 2007.
Yinlam Chow and Marco Pavone. A time consistent formulation of risk constrained stochastic
optimal control. arXiv preprint arXiv:1503.07461, 2015.
Yinlam Chow, Aviv Tamar, Shie Mannor, and Marco Pavone. Risk-sensitive and robust decision-
making: a cvar optimization approach. In Advances in Neural Information Processing Systems,
pages 1522–1530, 2015.
Antonio J Conejo, Miguel Carrión, Juan M Morales, et al. Decision making under uncertainty in
electricity markets, volume 1. Springer, 2010.
Dotan Di Castro, Aviv Tamar, and Shie Mannor. Policy gradients with variance related risk criteria.
arXiv preprint arXiv:1206.6404, 2012.
Ht M ElKholy. Dynamic modeling and control of a quadrotor using linear and nonlinear approaches.
Master of Science in Robotics, The American University in Cairo, 2014.
Wendell H Fleming and William M McEneaney. Risk-sensitive control on an infinite time horizon.
SIAM Journal on Control and Optimization, 33(6):1881–1915, 1995.
Abhijit Gosavi. Variance-penalized markov decision processes: Dynamic programming and re-
inforcement learning techniques. International Journal of General Systems, 43(6):649–669,
2014.