Towards Sparse Hardware-Efficient Control of the DelFly
P.A. Bakker (TU Delft - Electrical Engineering, Mathematics and Computer Science)
C. Frenkel – Mentor (TU Delft - Electronic Instrumentation)
O. Yarovyi – Graduation committee member (TU Delft - Microwave Sensing, Signals & Systems)
M. Alonso Del Pino – Graduation committee member (TU Delft - Tera-Hertz Sensing)
More Info
expand_more
Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.
Abstract
Flapping-wing micro air vehicles (FWMAVs) present a significant control challenge due to their complex nonlinear dynamics and severe hardware constraints, which preclude the use of computationally intensive controllers. This thesis addresses this challenge by developing and validating a pipeline to convert a high-performance neural network policy, trained via Reinforcement Learning (RL), into a sparse, hardware-efficient symbolic controller using the Sparse Identification of Nonlinear Dynamics (SINDy) framework. The primary contribution of this work is the introduction and evaluation of novel, hardwareaware optimizations within the SINDy distillation process. Specifically, we introduce Sparse Bit Quantization (SBQ), a new quantization scheme that represents coefficients as combinations of powers of two to enable efficient implementation using bit-shift operations on an FPGA. We systematically analyze the impact of applying SBQ both post-training and during the optimization loop (Quantization-Aware Training), and further explore the use of a custom, hardware-efficient function library designed to map directly to DSP block structures. The complete pipeline was validated on the ‘Pendulum-v1‘ benchmark. Our results demonstrate that while standard SINDy can accurately approximate the RL teacher policy, our hardware-oriented function library, struggles to capture the full complexity of the control task. This highlights a key trade-off between hardware-efficiency and model expressiveness. This work serves as a successful proof-of-concept and contributes novel techniques essential for deploying modern control algorithms on resource-constrained robotic systems.
Files
File under embargo until 29-06-2026