Radar-based gesture recognition with spiking neural networks

None, None

Radar-based gesture recognition with spiking neural networks

Master Thesis (2024)

Author(s)

L.M.S.J. de Ghellinck d'Elseghem (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

C. Frenkel – Mentor (TU Delft - Electronic Instrumentation)

Francesco Fioranelli – Mentor (TU Delft - Microwave Sensing, Signals & Systems)

Federico Corradi – Graduation committee member (Stichting IMEC Nederland)

Faculty

Electrical Engineering, Mathematics and Computer Science

Radar Gesture recognition Spiking Neural Network (SNN)

To reference this document use:

https://resolver.tudelft.nl/uuid:4c8ce805-04ca-42aa-b080-91c194697b57

More Info

expand_more

Publication Year

2024

Language

English

Graduation Date

26-08-2024

Awarding Institution

Delft University of Technology

Programme

['Electrical Engineering']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Radar-based sensors are used to perceive their environment and objects of interest in a contactless manner and with robust performance in all weather and light conditions. One of the main drawbacks is the energy needed for the processing of radar data in order to extract its valuable information. Spiking neural networks are an emerging type of neural networks that aim to reduce the energy footprint of their computations while maintaining acceptable performance. To do so, the data is encoded through time in binary spikes to help leverage the low cost of additions. This is in stark opposition to the much higher cost of multiplications that are highly present in conventional artificial neural networks. The drawback of this energy gain is that the rate encoding adds an extra time dimension, hence increasing the latency between the acquisition of the radar data and the recognition of the corresponding gesture class.
More specifically, this work uses an air-marshalling dataset from the literature to exemplify a gesture recognition problem. The first step is to replicate the well-known radar processing pipeline, and classification approach based on conventional neural networks to reach high classification accuracies. A validation accuracy of 98.5% and a test accuracy of 59.8% are reached on the full dataset (11 classes) and 86.7% on their 5 best classes (test set), which is about the same performance reported in the original dataset baseline.
The following steps propose an adaptation of this non-spiking pipeline to its spiking equivalent by optimising the trade-off between the model’s latency, its memory requirements and its accuracy. This work also develops a strategy to tune spiking networks’ thresholds to make the process of developing a spiking equivalent more efficient. For example, the spiking network can reach 94.1% validation accuracy using 100 encoding steps and only 4.7% of the initial memory requirements, and reach 46.8% on the test set. However, this trade-off can be shifted towards lower latency, lower memory, or higher accuracy according to the desired requirements.

Files

Thesis_Lucie_de_Ghellinck.pdf

(pdf | 0 Mb)

License info not available

File under embargo until 26-08-2025