The Landing: POST - Unit 1, Section 3, Activity 4

Discussion
COMP667 Multiagent Systems
POST - Unit 1, Section 3, Activity 4

POST - Unit 1, Section 3, Activity 4

Started by Duncan Robertson July 24, 2023 - 8:20pm

Discuss the following question in the discussion forum: What are the differences between the value-iteration algorithm and Q-learning?

Differences between the value-iteration algorithm and Q-learning:

- The value-iteration algorithm is designed to calculate learning parameters in the case where Markov Decision Process probabilities are known. In Q-learning, the probabilities are not known.

- As such, the MDP probabilities that are unknown are estimated dynamically, and the Q and V values at the same time.

POST - Unit 1, Section 3, Activity 4

POST - Unit 1, Section 3, Activity 4

COMP667 Multiagent Systems

Help

Adding comments to this site

Disclaimer