Learning-Driven Torque Control for Skid-Steer Robots

Knowledge-Assisted Reinforcement Learning with Curriculum-Based System Identification for Trajectory Control

Master Thesis (2025)
Author(s)

M.T. Jansen (TU Delft - Mechanical Engineering)

Contributor(s)

Joris Sijs – Mentor (Mobile Robotics - Avular)

Hans Goosen – Graduation committee member (TU Delft - Computational Design and Mechanics)

J. Kober – Mentor (TU Delft - Learning & Autonomous Control)

Faculty
Mechanical Engineering
More Info
expand_more
Publication Year
2025
Language
English
Graduation Date
01-07-2025
Awarding Institution
Delft University of Technology
Programme
Mechanical Engineering
Faculty
Mechanical Engineering
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Skid-steer mobile robots present a unique challenge for reinforcement learning (RL) due to their nonholonomic constraints, dynamic wheel coupling, and high susceptibility to slip. Standard RL methods often struggle to achieve stable torque control in such settings, particularly under long-horizon tasks with sparse or delayed rewards. This thesis introduces a knowledge-assisted RL framework that systematically integrates expert demonstrations and curriculum learning to improve sample efficiency, training stability, and final policy performance.

The foundation of the method is a torque-based Deep Deterministic Policy Gradient (DDPG) agent, augmented through two key innovations: (1) KAMMA (Knowledge-Assisted Mixed Mode Actioning), a probabilistic switching mechanism that alternates between expert and learned actions to avoid interference artifacts and accelerate early-stage convergence; and (2) Curriculum-Driven System Identification, where the task complexity is gradually increased via staged velocity profiles to reveal underlying terrain-robot dynamics in a structured manner.

Experiments conducted in Isaac Sim demonstrate that this integrated KAMMA + Curriculum approach outperforms both baseline KA-DDPG and imitation-only variants across key metrics, including trajectory tracking error, policy smoothness, and convergence speed. The results confirm that combining staged learning with adaptive knowledge infusion enables robust torque-level control and offers a scalable template for learning-driven system identification in robotics.

Files

TUD_Report_MJ.pdf
(pdf | 16.7 Mb)
License info not available