Online IC parameter adjustment of an active knee prosthesis using Reinforcement Learning with frequency-domain state representations

None, None

Online IC parameter adjustment of an active knee prosthesis using Reinforcement Learning with frequency-domain state representations

Master Thesis (2023)

Author(s)

C. Cetindag (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

R.D. McAllister – Mentor (TU Delft - Team Koty McAllister)

M. Penubaku – Mentor

P. Mohajerinesfahani – Graduation committee member (TU Delft - Team Peyman Mohajerin Esfahani)

Faculty

Electrical Engineering, Mathematics and Computer Science

Copyright

Reinforcement learning Policy iteration Frequency-domain state representation Active knee prosthesis

To reference this document use:

https://resolver.tudelft.nl/uuid:6fea6ee2-863f-4081-a9e2-b97f53009a19

More Info

expand_more

Publication Year

2023

Language

English

Copyright

Graduation Date

29-08-2023

Awarding Institution

Delft University of Technology

Programme

['Electrical Engineering | Signals and Systems']

Abstract

Active knee prostheses are potent in assisting users, providing symmetry in walking, reducing metabolic costs, and preventing long-term health problems. The heart of their complex control algorithm employs the Impedance Control (IC) Law, which controls the torque output of the device by three parameters: stiffness coefficient, equilibrium angle, and damping coefficient. Ideally, these parameters should be personalized and adaptive to address interpersonal and intrapersonal variations on level-ground walking. However, current practices achieve personalization only through basic normalization and do not address adaptiveness. This thesis aims to utilize an RL framework with a novel frequency-domain state representation to address personalization and adaptiveness simultaneously. A policy iteration algorithm from the Q-learning family was chosen as the essence of the RL framework, and bellman error (BE) was chosen as the primary evaluation metric. The study revolves around two hypotheses. First, does the RL framework is suitable for the system? Second, does the frequency-domain state representation perform better than the time-domain state representation? Within the scope of the thesis, a custom environment was created by modifying the Humanoid-v04 environment of OpenAI Gym using MuJoCo (Multi-Joint dynamics with Contact). This environment is used to train the RL framework and conduct the experiments. Results suggest that the proposed RL framework can improve the system, and the frequency-domain state representation is superior to its time-domain counterpart. The latter conclusion has an impact beyond the active knee prosthesis domain and can inspire any trajectory following tasks with periodic signals.

Files

Cetindag2023thesis.pdf

(pdf | 0 Mb)

License info not available

File under embargo until 22-08-2025