Searched for: subject%3A%22Deep%255C%252BReinforcement%255C%252BLearning%22
(1 - 1 of 1)
- document
-
Mandersloot, A.V. (author)The Decentralized Partially Observable Markov Decision Process is a commonly used framework to formally model scenarios in which multiple agents must collaborate using local information. A key difficulty in a Dec-POMDP is that in order to coordinate successfully, an agent must decide on actions not only using its own information, but also by...master thesis 2020