ED

E.S. Dam

2 records found

DMQL: Deep Maximum Q-Learning

Combatting Relative Overgeneralisation in Deep Independent Learners using Optimism and Similarity

Various pathologies can occur when independent learners are used in cooperative Multi-Agent Reinforcement Learning. One such pathology is Relative Overgeneralisation, which manifests when a suboptimal Nash Equilibrium in the joint action space of a problem is preferred over an op ...
Wisdom of the crowds is the idea that groups of people can collectively make wise decisions. Research suggests that these crowds can even outsmart experts. To gather the wisdom of the crowds, this project utilizes a prediction market. To successfully gather the wisdom of the crow ...