- document
-
Li, Guangliang (author), Whiteson, Shimon (author), Bradley Knox, W (author), Hung, H.S. (author)Learning from rewards generated by a human trainer observing an agent in action has been proven to be a powerful method for teaching autonomous agents to perform challenging tasks, especially for those non-technical users. Since the efficacy of this approach depends critically on the reward the trainer provides, we consider how the...journal article 2018