Landing : Athabascau University

POST - Unit 1, Section 3, Activity 4

  • Public

POST - Unit 1, Section 3, Activity 4

Started by Duncan Robertson July 24, 2023 - 8:20pm

Discuss the following question in the discussion forum: What are the differences between the value-iteration algorithm and Q-learning?

Differences between the value-iteration algorithm and Q-learning:

- The value-iteration algorithm is designed to calculate learning parameters in the case where Markov Decision Process probabilities are known.  In Q-learning, the probabilities are not known.

- As such, the MDP probabilities that are unknown are estimated dynamically, and the Q and V values at the same time.

COMP667  Multiagent Systems

COMP667 Multiagent Systems

This group is created for complementing COMP667 Course in Moodle, providing additional course material, and facilitating discussions.