M.T.J. Spaan
Please Note
56 records found
1
Performance of Decision Transformer in multi-task offline reinforcement learning
How does the introduction of sub-optimal data affect the performance of the model?
Multi-Task Offline Reinforcement Learning
Experimental Evaluation of the Generalizability of the Soft Actor-Critic + Behavioral Cloning Algorithm
Teaching How to Learn to Learn
Teacher-Student Curriculum Learning for Efficient Meta-Learning
Comparative Analysis of Curriculum Strategies in training Meta-Learning
Curriculum Strategies for Faster Meta-Learning
Exploration When Everything Looks New
Effect of the Local Uncertainty Source on Exploration
Multi-task Offline Reinforcement Learning with CQL
A study on how dataset size and diversity increase generalization performance
Comparative Analysis of Exploration Algorithms in Deep Reinforcement Learning for Autonomous Driving
How does epsilon-greedy, random network distillation, bootstrapped DQN affect training and the robustness of final policies under various testing conditions in autonomous driving?
Evaluating robustness of deep reinforcement learning for autonomous driving
Effects of domain randomization on training and robustness
Effects of Partial Observability Solver Methods on Training and Final Policies in Autonomous Driver RL
How do different methods for dealing with partial observability in the environment influence training and the robustness of final policies under various testing conditions?
An empirical analysis of entropy search in batch bayesian optimisation
A comprehensive study of function shape, batch size, noise level, and dimensionality impact on information-theoretic methods
Evaluating Robustness of Deep Reinforcement Learning for Autonomous Driving
How does entropy maximization affect the training and robustness of final policies under various testing conditions?
Effects of action space discretization and DQN extensions on algorithm robustness and efficiency
How do the discretization of the action space and various extensions to the well-known DQN algorithm influence training and the robustness of final policies under various testing conditions?