國內學術電子期刊系統 Electronic Journal System of STPI

作者： 鐘崑仁
作者服務機構： 國立台灣工業技術學院工業管理技術系
中文摘要： 令X ，X ，……為一馬可夫決策過程之單期報酬序列，此序列之現值B被定義為 (方程式無法摘錄)，這裹0＜β＜1，本論文探討，當效用函數為指數型時，現值B之期望效用的最大值問題。
英文摘要： Let X ，X ，.…be the sequence of single-Period rewards of a Markov decision proeess. The Presentvalue of this sequence is defined as (方程式無法摘錄), where 0＜β＜1 is the discount factor. This paper n二lexamines the maximization of the expected value of B when the utility function is exponential.
中文關鍵字： markov decision processes; exponentialutility; present value; maximization
英文關鍵字： --