Searched for: subject%3A%22sample%255C%252Befficiency%22
(1 - 1 of 1)
document
Koomen, Lenard (author)
The combination of reinforcement learning and deep neural networks has the potential to train intelligent autonomous agents on high dimensional sensory inputs, with applications in flight control. However, the amount of samples needed by these methods is often too large to use real-world interaction. In this work, mirror-descent guided policy...
master thesis 2020