Circular Image

A. Palffy

info

Please Note

10 records found

Conference paper (2025) - M. Hassan, A. Palffy, F. Fioranelli, A. Yarovoy, S. Ravindran, D. Gavrila
A data driven method is proposed to obtain free space segmentation using automotive radar point clouds. It aggregates automotive radar detection points from multiple timestamps, projects them into a Birds-Eye-View grid-based representation, and applies a semantic segmentation Neural Network (NN) to classify each grid for free space segmentation. A lidar based supervision is used to generate the ground truth for training. Moreover, debris objects are manually annotated to enable the NN model to learn to detect these uncommon objects. Experimental results on a proprietary 4D Imaging Radar dataset demonstrate that the proposed method gives improved free space segmentation as compared to other baseline methods. ...
Journal article (2025) - F. Fent, A. Palffy, H. Caesar
The perception of autonomous vehicles has to be efficient, robust, and cost-effective. However, cameras are not robust against severe weather conditions, lidar sensors are expensive, and the performance of radar-based perception is still inferior to the others. Camera-radar fusion methods have been proposed to address this issue, but these are constrained by the typical sparsity of radar point clouds and often designed for radars without elevation information. We propose a novel camera-radar fusion approach called Dual Perspective Fusion Transformer (DPFT), designed to overcome these limitations. Our method leverages lower-level radar data (the radar cube) instead of the processed point clouds to preserve as much information as possible and employs projections in both the camera and ground planes to effectively use radars with elevation information and simplify the fusion with camera data. As a result, DPFT has demonstrated state-of-the-art performance on the K-Radar dataset while showing remarkable robustness against adverse weather conditions and maintaining a low inference time. ...

A Data-Driven Radar Detector Trained by Lidar

Conference paper (2024) - Ignacio Roldan, Andras Palffy, Julian F.P. Kooij, Dariu M. Gavrila, Francesco Fioranelli, Alexander Yarovoy
In this paper, we address the limitations of traditional constant false alarm rate (CFAR) target detectors in automotive radars, particularly in complex urban environments with multiple objects that appear as extended targets. We propose a data-driven radar target detector exploiting a highly efficient 2D CNN backbone inspired by the computer vision domain. Our approach is distinguished by a unique cross-sensor supervision pipeline, enabling it to learn exclusively from unlabeled synchronized radar and lidar data, thuseliminating the need for costly manual object annotations. Using a novel large-scale, real-life multi-sensor dataset recorded in various driving scenarios, we demonstrate that the proposed detector generates dense, lidar-like point clouds, achieving a lower Chamfer distance to the reference lidar point clouds than CFAR detectors. Overall, it significantly outperforms CFAR baselines detection accuracy. ...
The detection of multiple extended targets in complex environments using high-resolution automotive radar is considered. A data-driven approach is proposed where unlabeled synchronized lidar data are used as ground truth to train a neural network (NN) with only radar data as input. To this end, the novel, large-scale, real-life, and multisensor RaDelft dataset has been recorded using a demonstrator vehicle in different locations in the city of Delft, The Netherlands. The dataset, as well as the documentation and example code, is publicly available for those researchers in the field of automotive radar or machine perception. The proposed data-driven detector can generate lidar-like point clouds (PCs) using only radar data from a high-resolution system, which preserves the shape and size of extended targets. The results are compared against conventional constant false alarm rate (CFAR) detectors as well as variations of the method to emulate the available approaches in the literature, using the probability of detection, the probability of false alarm, and the Chamfer distance (CD) as performance metrics. Moreover, an ablation study was carried out to assess the impact of Doppler and temporal information on detection performance. The proposed method outperforms different baselines in terms of CD, achieving a reduction of 77% against conventional CFAR detectors and 28% against the modified state-of-the-art deep learning (DL)-based approaches. ...
Conference paper (2023) - Fangqiang Ding, Andras Palffy, Dariu Gavrila, Chris Xiaoxuan Lu
This work proposes a novel approach to 4D radar-based scene flow estimation via cross-modal learning. Our approach is motivated by the co-located sensing redundancy in modern autonomous vehicles. Such redundancy implicitly provides various forms of supervision cues to the radar scene flow estimation. Specifically, we introduce a multi-task model architecture for the identified cross-modal learning problem and propose loss functions to opportunistically engage scene flow estimation using multiple cross-modal constraints for effective model training. Extensive experiments show the state-of-the-art performance of our method and demonstrate the effectiveness of cross-modal super-vised learning to infer more accurate 4D radar scene flow. We also show its usefulness to two subtasks - motion segmentation and ego-motion estimation. Our source code will be available on https://github.com/Toytiny/CMFlow. ...
Early and accurate detection of crossing pedestrians is crucial in automated driving in order to perform timely emergency manoeuvres. However, this is a difficult task in urban scenarios where pedestrians are often occluded (not visible) behind objects, e.g., other parked vehicles. We propose an occlusion aware fusion of stereo camera and radar sensors to address scenarios with crossing pedestrians behind such parked vehicles. Our proposed method adapts both the expected rate and properties of detections in different areas according to the visibility of the sensors. In our experiments on a real-world dataset, we show that the proposed occlusion aware fusion of radar and stereo camera detects the crossing pedestrians on average 0.26 seconds earlier than using the camera alone, and 0.15 seconds earlier than fusing the sensors without occlusion information. Our dataset containing 501 relevant recordings of pedestrians behind vehicles will be publicly available on our website for non-commercial, scientific use. ...
Doctoral thesis (2022) - A. Palffy
This thesis addresses the problem of object detection with automotive radar sensors in the field of intelligent vehicles with special attention to vulnerable road users: pedestrians, cyclists, and motorcyclists. It is not the goal of this work to improve the hardware design or signal processing algorithms of the radar sensors themselves, but to take the output of these sensors as “given”, and to propose various ways to use them for road user detection. To facilitate the reading, a brief introduction to the operating principles of radar sensors and their advantages and limitations is first given. Subsequently, the thesis discusses automotive radar based detection of road users based on three different types of radar data.... ...
Next-generation automotive radars provide elevation data in addition to range-, azimuth- and Doppler velocity. In this experimental study, we apply a state-of-the-art object detector (PointPillars), previously used for LiDAR 3D data, to such 3+1D radar data (where 1D refers to Doppler). In ablation studies, we first explore the benefits of the additional elevation information, together with that of Doppler, radar cross section and temporal accumulation, in the context of multi-class road user detection. We subsequently compare object detection performance on the radar and LiDAR point clouds, object class-wise and as a function of distance. To facilitate our experimental study, we present the novel View-of-Delft (VoD) automotive dataset. It contains 8693 frames of synchronized and calibrated 64-layer LiDAR-, (stereo) camera-, and 3+1D radar-data acquired in complex, urban traffic. It consists of 123106 3D bounding box annotations of both moving and static objects, including 26587 pedestrian, 10800 cyclist and 26949 car labels. Our results show that object detection on 64-layer LiDAR data still outperforms that on 3+1D radar data, but the addition of elevation information and integration of successive radar scans helps close the gap. The VoD dataset is made freely available for scientific benchmarking. ...
Journal article (2020) - A. Palffy, Jiaao Dong, J. F. P. Kooij, D. M. Gavrila
This letter presents a novel radar based, single-frame, multi-class detection method for moving road users ( pedestrian, cyclist, car ), which utilizes low-level radar cube data. The method provides class information both on the radar target- and object-level. Radar targets are classified individually after extending the target features with a cropped block of the 3D radar cube around their positions, thereby capturing the motion of moving parts in the local velocity distribution. A Convolutional Neural Network (CNN) is proposed for this classification step. Afterwards, object proposals are generated with a clustering step, which not only considers the radar targets’ positions and velocities, but their calculated class scores as well. In experiments on a real-life dataset we demonstrate that our method outperforms the state-of-the-art methods both target- and object-wise by reaching an average of 0.70 (baseline: 0.68) target-wise and 0.56 (baseline: 0.48) object-wise F1 score. Furthermore, we examine the importance of the used features in an ablation study. ...
Conference paper (2019) - András Pálffy, Julian Kooij, Dariu Gavrila
Early and accurate detection of crossing pedestrians is crucial in automated driving to execute emergency manoeuvres in time. This is a challenging task in urban scenarios however, where people are often occluded (not visible) behind objects, e.g. other parked vehicles. In this paper, an occlusion aware multi-modal sensor fusion system is proposed to address scenarios with crossing pedestrians behind parked vehicles. Our proposed method adjusts the detection rate in different areas based on sensor visibility. We argue that using this occlusion information can help to evaluate the measurements. Our experiments on real world data show that fusing radar and stereo camera for such tasks is beneficial, and that including occlusion into the model helps to detect pedestrians earlier and more accurately. ...