Searched for: faculty%3A%22Delft%255C%252BUniversity%255C%252Bof%255C%252BTechnology%22
(1 - 4 of 4)
document
Li, G. (author), Hung, H. (author), Bradley Knox, W. (author), Whiteson, S.A. (author)
Learning from rewards generated by a human trainer ob- serving the agent in action has been demonstrated to be an effective method for humans to teach an agent to perform challenging tasks. However, how to make the agent learn most efficiently from these kinds of human reward is still under-addressed. In this paper, we investigate the effect of...
journal article 2014
document
Li, S.B. (author), Xiao, L.O. (author), Song, G.M. (author), Wu, X.M. (author), Sloof, W.G. (author), Van der Zwaag, S. (author)
conference paper 2011
document
Sloof, W.G. (author), Li, S. (author), Song, G. (author), Kwakernaak, C. (author), Wu, X. (author), Van der Zwaag, S. (author)
conference paper 2011
document
Song, G.M. (author), Li, S.B. (author), Sloof, W.G. (author), Van der Zwaag, S. (author)
conference paper 2011
Searched for: faculty%3A%22Delft%255C%252BUniversity%255C%252Bof%255C%252BTechnology%22
(1 - 4 of 4)