AN ANALYTICAL METHOD TO CALCULATE THE ERGODIC AND DIFFERENCE MATRICES OF THE DISCOUNTED MARKOV DECISION PROCESSES



In this same way, we decompose the other three elements of matrix and we rece-
ive:

4          5         5         _ 5

9 I__9                 9      _|__9

(I - βP)-1


(1) + (1-0,1β)   (1) + (1-0,1β)

4         _4        5         4

9                 9            9               9

(1) + (1-0,1β)   (1) + (1-0,1β)

Finally formula (15) becomes the following form:

(I - βP)-1


(1 - β)


0, 1β


We can check that the first matrix with factor 1/ (1 - β) is ergodic matrix of Mar-
kov Process for given stochastic matrix of transition P . The second matrix is so
named difference matrix. The sum of elements is equal zero in rows of this matrix.
Taking formula (19) into consideration, we receive total finite expected reward:

ν(β) =


1

(1 - β)


' 4

9

4

. 9


9  +--------

9      1 - 0,1β


• q.


Now we can find value ν (β) for two different β, β1 = 0, 5 and β2 = 0, 99.

After providing of values and simple calculations we receive:

ν(0, 5) =

5
9
5

9


+ 1, 052


-3


Hence we obtain for the starting state and n → ∞

ν1,∞ (0, 5) = 2 • 1 + 1, 052 • 5 =

7, 260,




More intriguing information

1. An Attempt to 2
2. Commuting in multinodal urban systems: An empirical comparison of three alternative models
3. The name is absent
4. Natural hazard mitigation in Southern California
5. Ahorro y crecimiento: alguna evidencia para la economía argentina, 1970-2004
6. Quality Enhancement for E-Learning Courses: The Role of Student Feedback
7. SLA RESEARCH ON SELF-DIRECTION: THEORETICAL AND PRACTICAL ISSUES
8. The name is absent
9. The name is absent
10. The name is absent