Searched for: faculty%3A%22Delft%255C%252BUniversity%255C%252Bof%255C%252BTechnology%22
(1 - 4 of 4)
- document
-
Li, G. (author), Hung, H. (author), Bradley Knox, W. (author), Whiteson, S.A. (author)Learning from rewards generated by a human trainer ob- serving the agent in action has been demonstrated to be an effective method for humans to teach an agent to perform challenging tasks. However, how to make the agent learn most efficiently from these kinds of human reward is still under-addressed. In this paper, we investigate the effect of...journal article 2014
- document
- Li, S.B. (author), Xiao, L.O. (author), Song, G.M. (author), Wu, X.M. (author), Sloof, W.G. (author), Van der Zwaag, S. (author) conference paper 2011
- document
- Sloof, W.G. (author), Li, S. (author), Song, G. (author), Kwakernaak, C. (author), Wu, X. (author), Van der Zwaag, S. (author) conference paper 2011
- document
- Song, G.M. (author), Li, S.B. (author), Sloof, W.G. (author), Van der Zwaag, S. (author) conference paper 2011
Searched for: faculty%3A%22Delft%255C%252BUniversity%255C%252Bof%255C%252BTechnology%22
(1 - 4 of 4)