Reinforcement Learning Compensated Filter for Position and Orientation Estimation

None, None

Reinforcement Learning Compensated Filter for Position and Orientation Estimation

Master Thesis (2021)

Author(s)

Hao LI (TU Delft - Mechanical Engineering)

Contributor(s)

Wei Pan – Mentor (TU Delft - Robot Dynamics)

Y. Tang – Graduation committee member (TU Delft - Robot Dynamics)

Manon Kok – Graduation committee member (TU Delft - Team Manon Kok)

Faculty

Mechanical Engineering

Copyright

Orientation Estimation EKF DRL 2D Plane Localization

To reference this document use:

https://resolver.tudelft.nl/uuid:9e27b7eb-fa2d-419a-ad9d-d181b0953740

More Info

expand_more

Publication Year

2021

Language

English

Copyright

Graduation Date

03-09-2021

Awarding Institution

Delft University of Technology

Programme

Mechanical Engineering | Vehicle Engineering | Cognitive Robotics

Faculty

Mechanical Engineering

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Pose estimation provides accurate position and orientation information of the intelligent agents in real time. The accuracy of the estimation directly affects the performance of sequential tasks such as mapping, motion planning, and control. EKF (Extended Kalman Filter) is a standard theory for nonlinear pose estimation by modeling state uncertainty to Gaussian distribution. However, EKF has requirements for proper initial estimate and system noise to obtain bounded optimal estimate. Meanwhile, model nonlinearity and non-gaussian noise modeling affect the performance of EKF significantly in practical applications. In this thesis, we focus on improving the performance of nonlinear pose estimation by reinforcement learning. By formulating an EKF measurement update as a Markov Decision Process (MDP), reinforcement learning agents can be trained to learn the estimator gain through data samples and executed as the online estimator for pose estimation tasks.

Based on the above idea, we propose a novel reinforcement learning-compensated EKF estimator (RLC-EKF), where the RL agent serves as a second-time measurement update that subsides the residual error from the standard EKF estimate. The estimator is developed and testified on two specific pose estimation scenarios. Firstly, as a continuous work from the previous study, a framework for 3 DOF orientation estimation using inertial sensor and magnetometer is replicated. Then, the framework is extended by different RL algorithms training and multi-scale robustness validation. Besides, we implement the estimator on a feature-based 2D plane localization framework. The proposed framework shows the feasibility of the underlying algorithm on a localization task with a known map. As a result, the RLC-EKF estimator gives superior performance and convincible robustness compared to classical methods in severe conditions such as varying initial states, degree of noise intensities, and model covariance.

Files

Master_Thesis_Hao_5008891.pdf

(pdf | 20.2 Mb)

License info not available