Learning-Driven Torque Control for Skid-Steer Robots

None, None

Learning-Driven Torque Control for Skid-Steer Robots

Knowledge-Assisted Reinforcement Learning with Curriculum-Based System Identification for Trajectory Control

Master Thesis (2025)

Author(s)

M.T. Jansen (TU Delft - Mechanical Engineering)

Contributor(s)

Joris Sijs – Mentor (Mobile Robotics - Avular)

Hans Goosen – Graduation committee member (TU Delft - Computational Design and Mechanics)

Jens Kober – Mentor (TU Delft - Learning & Autonomous Control)

Faculty

Mechanical Engineering

Deep Reinforcement Learning Imitation learning DDPG Trajectory tracking Nonholonomic Motion Planning Curriculum learning Torque-based control Skid‑steer dynamics Knowledge-assisted learning Dynamic Coupling

To reference this document use:

https://resolver.tudelft.nl/uuid:0bcc777f-cf1c-49fe-8dbf-c18237864841

More Info

expand_more

Publication Year

2025

Language

English

Graduation Date

01-07-2025

Awarding Institution

Delft University of Technology

Programme

['Mechanical Engineering']

Faculty

Mechanical Engineering

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Skid-steer mobile robots present a unique challenge for reinforcement learning (RL) due to their nonholonomic constraints, dynamic wheel coupling, and high susceptibility to slip. Standard RL methods often struggle to achieve stable torque control in such settings, particularly under long-horizon tasks with sparse or delayed rewards. This thesis introduces a knowledge-assisted RL framework that systematically integrates expert demonstrations and curriculum learning to improve sample efficiency, training stability, and final policy performance.

The foundation of the method is a torque-based Deep Deterministic Policy Gradient (DDPG) agent, augmented through two key innovations: (1) KAMMA (Knowledge-Assisted Mixed Mode Actioning), a probabilistic switching mechanism that alternates between expert and learned actions to avoid interference artifacts and accelerate early-stage convergence; and (2) Curriculum-Driven System Identification, where the task complexity is gradually increased via staged velocity profiles to reveal underlying terrain-robot dynamics in a structured manner.

Experiments conducted in Isaac Sim demonstrate that this integrated KAMMA + Curriculum approach outperforms both baseline KA-DDPG and imitation-only variants across key metrics, including trajectory tracking error, policy smoothness, and convergence speed. The results confirm that combining staged learning with adaptive knowledge infusion enables robust torque-level control and offers a scalable template for learning-driven system identification in robotics.

Files

TUD_Report_MJ.pdf

(pdf | 16.7 Mb)

License info not available