Traffic Gesture Classification for Intelligent Vehicles

Master thesis (2020)

Authors

J.H. Ammerlaan Mechanical Engineering

Contributors

F.B. Flohr Intelligent Vehicles - Mechanical, Maritime and Materials Engineering (mentor)

J.F.P. Kooij Intelligent Vehicles - Mechanical, Maritime and Materials Engineering (mentor)

D. Gavrila Intelligent Vehicles - Mechanical, Maritime and Materials Engineering (graduation committee member)

J.C.F. de Winter Human-Robot Interaction - Mechanical, Maritime and Materials Engineering (graduation committee member)

Faculty

Mechanical Engineering, Mechanical Engineering

To reference this document use:

http://resolver.tudelft.nl/uuid:6272db65-b324-40cf-83aa-6d7caf3c7917

More Info

expand_more

Published Date

11-06-2020

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Mechanical Engineering

Abstract

Self-driving vehicles have shown rapid development in recent years and continue to move towards full autonomy. For high or full automation, self-driving vehicles will have to be able to address and solve a broad range of situations, one of which is interaction with traffic agents. For correct and save maneuvering through these situations, reliable detection of agents followed by an accurate classification of the traffic gestures used by agents is essential. This problem has received limited attention in literature to date. The objective of this work is to establish and investigate a working traffic gesture pipeline by leveraging the latest developments in the fields of computer vision and machine learning. This work investigates and compares how well state-of-the-art methods translate to traffic gesture recognition and what application specific problems are encountered. Multiple configurations based on skeletal features, estimated using OpenPose, and classified using recurrent neural networks (RNN) were investigated. Skeleton estimation using OpenPose and feature representations were evaluated using an action recognition dataset with motion capture ground-truth. Three RNN network architectures, varying in complexity and size, were evaluated on traffic gestures. The robustness of the developed system regarding viewpoint variation is explored, combined with the viability of transfer learning for traffic gestures. To train and validate these methods, a new traffic gesture dataset is introduced, on which an mAP of 0,70 is achieved. The results show that the proposed methods are able to classify traffic gestures within reasonable computation time and illustrate the value of transfer learning for gesture recognition. These promising results validate the methodology used and show that this direction warrants further research.

Files

Jelle_Ammerlaan_Traffic_Gestur... (.pdf)

(.pdf | 19.9 Mb)