Using prior knowledge to accelerate online least-squares policy iteration
More Info
expand_more
expand_more