Camera-and LiDAR-based Person Re-Identification

None, None; None, None

Camera-and LiDAR-based Person Re-Identification

Conference Paper (2025)

Author(s)

S.A. Krebs (TU Delft - Intelligent Vehicles, Mercedes-Benz)

D. Gavrila (TU Delft - Intelligent Vehicles)

Research Group

Intelligent Vehicles

DOI related publication

https://doi.org/10.1109/IV64158.2025.11097607

To reference this document use:

https://resolver.tudelft.nl/uuid:79c130be-98b3-4a5d-98e4-94b5dd9f6d3f

More Info

expand_more

Publication Year

2025

Language

English

Research Group

Intelligent Vehicles

Bibliographical Note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/publishing/publisher-deals Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Pages (from-to)

1408-1414

Publisher

IEEE

ISBN (electronic)

979-8-3315-3803-3

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In this paper, we introduce a novel method for creating appearance embeddings to identify individual persons using an object re-identification (ReID) framework. We present CLFormer (Camera LiDAR Transformer), a transformer-based architecture that incorporates multi-modal data from both camera and LiDAR sensors. We introduce the 3D Cuboid-Inclusive Point Embedding (3D-CIPE), which leverages rich data from LiDAR point clouds and 3D cuboids to add a learnable embedding into the transformer structure. Additionally, through ablation studies, we explore and analyze various strategies for the early and late fusion of multi-modal input data. To evaluate our proposed CLFormer, we reinterpret the nuScenes dataset [1] for ReID purposes and use it for our experiments. Our method demonstrates a significant improvement in performance, outperforming the image-only baseline with an increase of 2.3 in mean Average Precision (mAP).

Files

Camera-and_LiDAR-based_Person_... (pdf)

(pdf | 0.68 Mb)

- Embargo expired in 06-02-2026

License info not available