Loading [MathJax]/extensions/TeX/ietmacros.js
Heuristic mean-variance optimization in Markov decision processes using state-dependent risk aversion | OUP Journals & Magazine | IEEE Xplore

Heuristic mean-variance optimization in Markov decision processes using state-dependent risk aversion


Abstract:

In dynamic decision problems, it is challenging to find the right balance between maximizing expected rewards and minimizing risks. In this paper, we consider NP-hard mea...Show More

Abstract:

In dynamic decision problems, it is challenging to find the right balance between maximizing expected rewards and minimizing risks. In this paper, we consider NP-hard mean-variance (MV) optimization problems in Markov decision processes with a finite time horizon. We present a heuristic approach to solve MV problems, which is based on state-dependent risk aversion and efficient dynamic programming techniques. Our approach can also be applied to mean-semivariance (MSV) problems, which particularly focus on the downside risk. We demonstrate the applicability and the effectiveness of our heuristic for dynamic pricing applications. Using reproducible examples, we show that our approach outperforms existing state-of-the-art benchmark models for MV and MSV problems while also providing competitive runtimes. Further, compared to models based on constant risk levels, we find that state-dependent risk aversion allows to more effectively intervene in case sales processes deviate from their planned paths. Our concepts are domain independent, easy to implement and of low computational complexity.
Published in: IMA Journal of Management Mathematics ( Volume: 33, Issue: 2, May 2021)
Page(s): 181 - 199
Date of Publication: May 2021

ISSN Information:


Contact IEEE to Subscribe