Interaction-aware Pedestrian Trajectory Prediction using Monocular Video in Automated Driving

Master Thesis (2022)
Author(s)

S.H.J. Tak (TU Delft - Mechanical Engineering)

Contributor(s)

Phillip Czech – Mentor (Mercedes-Benz)

D.M. Gavrila – Mentor (TU Delft - Intelligent Vehicles)

J.F.P. Kooij – Graduation committee member (TU Delft - Intelligent Vehicles)

Faculty
Mechanical Engineering
Copyright
© 2022 Seger Tak
More Info
expand_more
Publication Year
2022
Language
English
Copyright
© 2022 Seger Tak
Graduation Date
16-06-2022
Awarding Institution
Delft University of Technology
Programme
['Mechanical Engineering | Vehicle Engineering']
Faculty
Mechanical Engineering
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Pedestrian trajectory prediction is essential for developing safe autonomous driving systems. Such trajectories depend on various contextual cues, among which surrounding objects.

This work proposes the first pedestrian trajectory prediction method in the 2D on-board domain that models interactions between the pedestrian and surrounding static- and dynamic- contextual objects using a graph-based approach. Our two-stream model separately encodes past motion history and interactions. The encoded information from both streams is fused and decoded to generate future pedestrian trajectories. The interactions are modeled using spatial graphs, which are temporally connected using a Gated Recurrent Unit. The graph nodes represent the pedestrian and contextual objects, and the edges represent the interaction importance between nodes.

In experiments on the PIE and JAAD_full dataset, it is shown that our graph-based interaction-aware trajectory prediction method outperforms all considered baselines on nearly all metrics. Moreover, the performance gain on JAAD_full is most significant for the close-by pedestrians. Finally, modeling the interactions with all considered contextual objects, i.e. vehicles, crosswalks, and traffic lights, improves trajectory prediction performance most compared to only using a subset of these objects.

Files

Thesis_S_Tak_final.pdf
(pdf | 10.2 Mb)
License info not available