Searched for: subject%3A%22Deep%255C%252BReinforcement%255C%252BLearning%22
(1 - 1 of 1)
document
Mandersloot, A.V. (author)
The Decentralized Partially Observable Markov Decision Process is a commonly used framework to formally model scenarios in which multiple agents must collaborate using local information. A key difficulty in a Dec-POMDP is that in order to coordinate successfully, an agent must decide on actions not only using its own information, but also by...
master thesis 2020