Towards Sparse Hardware-Efficient Control of the DelFly

None, None; None, None

Towards Sparse Hardware-Efficient Control of the DelFly

Bachelor Thesis (2025)

Author(s)

P.A. Bakker (TU Delft - Electrical Engineering, Mathematics and Computer Science)

A.H. Mohammad (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

C. Frenkel – Mentor (TU Delft - Electronic Instrumentation)

O. Yarovyi – Graduation committee member (TU Delft - Microwave Sensing, Signals & Systems)

Maria Alonso Del Pino – Graduation committee member (TU Delft - Tera-Hertz Sensing)

Faculty

Electrical Engineering, Mathematics and Computer Science

Reinforcement Learning Sparse SINDy DelFly

To reference this document use:

https://resolver.tudelft.nl/uuid:596edbc5-5695-407b-a0c6-6d2068216564

More Info

expand_more

Publication Year

2025

Language

English

Graduation Date

27-06-2025

Awarding Institution

Delft University of Technology

Project

['EE3L11 Bachelor graduation project Electrical Engineering']

Programme

['Electrical Engineering']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Flapping-wing micro air vehicles (FWMAVs) present a significant control challenge due to their complex nonlinear dynamics and severe hardware constraints, which preclude the use of computationally intensive controllers. This thesis addresses this challenge by developing and validating a pipeline to convert a high-performance neural network policy, trained via Reinforcement Learning (RL), into a sparse, hardware-efficient symbolic controller using the Sparse Identification of Nonlinear Dynamics (SINDy) framework. The primary contribution of this work is the introduction and evaluation of novel, hardwareaware optimizations within the SINDy distillation process. Specifically, we introduce Sparse Bit Quantization (SBQ), a new quantization scheme that represents coefficients as combinations of powers of two to enable efficient implementation using bit-shift operations on an FPGA. We systematically analyze the impact of applying SBQ both post-training and during the optimization loop (Quantization-Aware Training), and further explore the use of a custom, hardware-efficient function library designed to map directly to DSP block structures. The complete pipeline was validated on the ‘Pendulum-v1‘ benchmark. Our results demonstrate that while standard SINDy can accurately approximate the RL teacher policy, our hardware-oriented function library, struggles to capture the full complexity of the control task. This highlights a key trade-off between hardware-efficiency and model expressiveness. This work serves as a successful proof-of-concept and contributes novel techniques essential for deploying modern control algorithms on resource-constrained robotic systems.

Files

BAP_Thesis_P_en_A_revised.pdf

(pdf | 0 Mb)

License info not available

File under embargo until 29-06-2026