AN ANALYTICAL METHOD TO CALCULATE THE ERGODIC AND DIFFERENCE MATRICES OF THE DISCOUNTED MARKOV DECISION PROCESSES



In this same way, we decompose the other three elements of matrix and we rece-
ive:

4          5         5         _ 5

9 I__9                 9      _|__9

(I - βP)-1


(1) + (1-0,1β)   (1) + (1-0,1β)

4         _4        5         4

9                 9            9               9

(1) + (1-0,1β)   (1) + (1-0,1β)

Finally formula (15) becomes the following form:

(I - βP)-1


(1 - β)


0, 1β


We can check that the first matrix with factor 1/ (1 - β) is ergodic matrix of Mar-
kov Process for given stochastic matrix of transition P . The second matrix is so
named difference matrix. The sum of elements is equal zero in rows of this matrix.
Taking formula (19) into consideration, we receive total finite expected reward:

ν(β) =


1

(1 - β)


' 4

9

4

. 9


9  +--------

9      1 - 0,1β


• q.


Now we can find value ν (β) for two different β, β1 = 0, 5 and β2 = 0, 99.

After providing of values and simple calculations we receive:

ν(0, 5) =

5
9
5

9


+ 1, 052


-3


Hence we obtain for the starting state and n → ∞

ν1,∞ (0, 5) = 2 • 1 + 1, 052 • 5 =

7, 260,




More intriguing information

1. Spatial patterns in intermunicipal Danish commuting
2. The Value of Cultural Heritage Sites in Armenia: Evidence From a Travel Cost Method Study
3. A Rare Case Of Fallopian Tube Cancer
4. The name is absent
5. Motivations, Values and Emotions: Three Sides of the same Coin
6. The economic value of food labels: A lab experiment on safer infant milk formula
7. Whatever happened to competition in space agency procurement? The case of NASA
8. Long-Term Capital Movements
9. The name is absent
10. The name is absent