AN ANALYTICAL METHOD TO CALCULATE THE ERGODIC AND DIFFERENCE MATRICES OF THE DISCOUNTED MARKOV DECISION PROCESSES



P , it means, the control strategy should minimize quality coefficient if the factor
β is given.

These aspects mentioned above will be a subject area of next papers. It results
from analyses of MDP some conclusions:

1. Formula (19) allows to calculate in analytical way value of total reward
ν
k (β) without difficult process of reverse of the matrix (I - βP). Reverse
of matrices using computer technology goes on in iterative way. The number
of iteration rises along with the size N of matrix rapidly. It leads to loss of
calculation’s accuracy.

2. Proposed analytical method of calculation of ergodic and difference matri-
ces gives us the possibility of selection of two components of total reward.
It increases the possibility of analysis of Discounted Markov Decision Pro-
cess.

14



More intriguing information

1. Consumption Behaviour in Zambia: The Link to Poverty Alleviation?
2. RETAIL SALES: DO THEY MEAN REDUCED EXPENDITURES? GERMAN GROCERY EVIDENCE
3. Multi-Agent System Interaction in Integrated SCM
4. Imperfect competition and congestion in the City
5. The name is absent
6. Testing Panel Data Regression Models with Spatial Error Correlation
7. Comparative study of hatching rates of African catfish (Clarias gariepinus Burchell 1822) eggs on different substrates
8. Peer Reviewed, Open Access, Free
9. Getting the practical teaching element right: A guide for literacy, numeracy and ESOL teacher educators
10. The name is absent