AN ANALYTICAL METHOD TO CALCULATE THE ERGODIC AND DIFFERENCE MATRICES OF THE DISCOUNTED MARKOV DECISION PROCESSES



P , it means, the control strategy should minimize quality coefficient if the factor
β is given.

These aspects mentioned above will be a subject area of next papers. It results
from analyses of MDP some conclusions:

1. Formula (19) allows to calculate in analytical way value of total reward
ν
k (β) without difficult process of reverse of the matrix (I - βP). Reverse
of matrices using computer technology goes on in iterative way. The number
of iteration rises along with the size N of matrix rapidly. It leads to loss of
calculation’s accuracy.

2. Proposed analytical method of calculation of ergodic and difference matri-
ces gives us the possibility of selection of two components of total reward.
It increases the possibility of analysis of Discounted Markov Decision Pro-
cess.

14



More intriguing information

1. Bird’s Eye View to Indonesian Mass Conflict Revisiting the Fact of Self-Organized Criticality
2. The name is absent
3. GROWTH, UNEMPLOYMENT AND THE WAGE SETTING PROCESS.
4. Should Local Public Employment Services be Merged with the Local Social Benefit Administrations?
5. WP 48 - Population ageing in the Netherlands: Demographic and financial arguments for a balanced approach
6. Howard Gardner : the myth of Multiple Intelligences
7. The name is absent
8. The name is absent
9. The name is absent
10. Response speeds of direct and securitized real estate to shocks in the fundamentals