Deep end-to-end 3D person detection from Camera and Lidar

Conference paper (2019)

Authors

M. Roth Intelligent Vehicles - Mechanical, Maritime and Materials Engineering , Daimler AG

Dominik Jargot Student

D. Gavrila Intelligent Vehicles - Mechanical, Maritime and Materials Engineering

Research Group

Intelligent Vehicles (Mechanical, Maritime and Materials Engineering) (TU Delft)

DOI: https://doi.org/10.1109/ITSC.2019.8917366

To reference this document use:

http://resolver.tudelft.nl/uuid:83f7a017-a713-4009-9505-f758f58c07e1

More Info

expand_more

Published Date

2019

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Mechanical, Maritime and Materials Engineering

Department

Cognitive Robotics

Research Group

Intelligent Vehicles

Abstract

We present a method for 3D person detection from camera images and lidar point clouds in automotive scenes. The method comprises a deep neural network which estimates the 3D location and extent of persons present in the scene. 3D anchor proposals are refined in two stages: a region proposal network and a subsequent detection network.For both input modalities high-level feature representations are learned from raw sensor data instead of being manually designed. To that end, we use Voxel Feature Encoders [1] to obtain point cloud features instead of widely used projection-based point cloud representations, thus allowing the network to learn to predict the location and extent of persons in an end-to-end manner.Experiments on the validation set of the KITTI 3D object detection benchmark [2] show that the proposed method outperforms state-of-the-art methods with an average precision (AP) of 47.06% on moderate difficulty.

Files

Roth2019itsc_lidar_person_dete... (pdf)

(pdf | 3.05 Mb)

Unknown license

08917366.pdf

(pdf | 3.06 Mb)

Unknown license

Download not available