M.T.J. Spaan
55 records found
1
Performance of Decision Transformer in multi-task offline reinforcement learning
How does the introduction of sub-optimal data affect the performance of the model?
Multi-Task Offline Reinforcement Learning
Experimental Evaluation of the Generalizability of the Soft Actor-Critic + Behavioral Cloning Algorithm
Teaching How to Learn to Learn
Teacher-Student Curriculum Learning for Efficient Meta-Learning
Comparative Analysis of Curriculum Strategies in training Meta-Learning
Curriculum Strategies for Faster Meta-Learning
Exploration When Everything Looks New
Effect of the Local Uncertainty Source on Exploration
Multi-task Offline Reinforcement Learning with CQL
A study on how dataset size and diversity increase generalization performance
making an observation of the current state it is in, picking out an action to execute, and
observing the reward of said action, after which it receives the next state and repeats the
...
Effects of Partial Observability Solver Methods on Training and Final Policies in Autonomous Driver RL
How do different methods for dealing with partial observability in the environment influence training and the robustness of final policies under various testing conditions?
Evaluating robustness of deep reinforcement learning for autonomous driving
Effects of domain randomization on training and robustness
An empirical analysis of entropy search in batch bayesian optimisation
A comprehensive study of function shape, batch size, noise level, and dimensionality impact on information-theoretic methods
Replacing the acquisition function in Bayesian optimization by a neural network
How effectively do meta-learned acquisition functions in Bayesian optimization perform when optimizing for control variates of unknown functions, as compared to BO with standard acquisition functions
Effects of action space discretization and DQN extensions on algorithm robustness and efficiency
How do the discretization of the action space and various extensions to the well-known DQN algorithm influence training and the robustness of final policies under various testing conditions?
Evaluating Robustness of Deep Reinforcement Learning for Autonomous Driving
How does entropy maximization affect the training and robustness of final policies under various testing conditions?
Comparative Analysis of Exploration Algorithms in Deep Reinforcement Learning for Autonomous Driving
How does epsilon-greedy, random network distillation, bootstrapped DQN affect training and the robustness of final policies under various testing conditions in autonomous driving?