A stationary policy and an initial state in an MDP (Markov decision process) induce a stationary probability distribution of the reward. The problem analyzed here is generating the Pareto optima in the sense of high mean and low variance of the stationary distribution. In the unichain case, Pareto optima can be computed either with policy improvement or with a linear program having the same number of variables and one more constraint than the formulation for gain-rate optimization. The same linear program suffices in the multichain case if the ergodic class is an element of choice.
机构:
Guangdong Univ Foreign Studies, Sch Finance, Guangzhou 510006, Guangdong, Peoples R ChinaGuangdong Univ Foreign Studies, Sch Finance, Guangzhou 510006, Guangdong, Peoples R China
Yao, Haixiang
Li, Zhongfei
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sun Yat Sen Business Sch, Guangzhou 510275, Guangdong, Peoples R ChinaGuangdong Univ Foreign Studies, Sch Finance, Guangzhou 510006, Guangdong, Peoples R China
Li, Zhongfei
Li, Duan
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Univ Hong Kong, Dept Syst Engn & Engn Management, Hong Kong, Hong Kong, Peoples R ChinaGuangdong Univ Foreign Studies, Sch Finance, Guangzhou 510006, Guangdong, Peoples R China
机构:
Cent Univ Finance & Econ, China Inst Actuarial Sci, Beijing 100081, Peoples R ChinaSun Yat Sen Univ, Sun Yat Sen Business Sch, Guangzhou 510275, Guangdong, Peoples R China
Wu, Huiling
Li, Zhongfei
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sun Yat Sen Business Sch, Guangzhou 510275, Guangdong, Peoples R China
Sun Yat Sen Univ, Lingnan Univ Coll, Guangzhou 510275, Guangdong, Peoples R ChinaSun Yat Sen Univ, Sun Yat Sen Business Sch, Guangzhou 510275, Guangdong, Peoples R China
机构:
Guangdong Univ Technol, Sch Math & Stat, Guangzhou 510520, Peoples R ChinaGuangdong Univ Technol, Sch Math & Stat, Guangzhou 510520, Peoples R China
Wu Xianping
Wu Weiping
论文数: 0引用数: 0
h-index: 0
机构:
Fuzhou Univ, Sch Econ & Management, Fuzhou 350108, Peoples R ChinaGuangdong Univ Technol, Sch Math & Stat, Guangzhou 510520, Peoples R China
Wu Weiping
Lin Yu
论文数: 0引用数: 0
h-index: 0
机构:
Fuzhou Univ, Sch Econ & Management, Fuzhou 350108, Peoples R ChinaGuangdong Univ Technol, Sch Math & Stat, Guangzhou 510520, Peoples R China
机构:
Cent Univ Finance & Econ, China Inst Actuarial Sci, Beijing 100081, Peoples R ChinaCent Univ Finance & Econ, China Inst Actuarial Sci, Beijing 100081, Peoples R China
Wu, Huiling
Zeng, Yan
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Lingnan Univ Coll, Guangzhou 510275, Guangdong, Peoples R ChinaCent Univ Finance & Econ, China Inst Actuarial Sci, Beijing 100081, Peoples R China
Zeng, Yan
Yao, Haixiang
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Univ Foreign Studies, Sch Informat, Guangzhou 510006, Guangdong, Peoples R ChinaCent Univ Finance & Econ, China Inst Actuarial Sci, Beijing 100081, Peoples R China