The Curse of Class Imbalance and Conflicting Metrics with Machine Learning for Side-channel Evaluations

Journal article (2018)

Authors

S. Picek University of Paris 8 (and 13 and CNRS), Cyber Security -

Annelie Heuser INRIA/IRISA, University of Rennes, CNRS

Alan Jovic University of Zagreb

Shivam Bhasin Nanyang Technological University

Francesco Regazzoni University of Lugano

Research Group

Cyber Security () (TU Delft)

DOI: https://doi.org/10.13154/tches.v2019.i1.209-237

Metrics SMOTE Profiled side-channel attacks Imbalanced datasets Synthetic examples

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:c7a9143f-7e0e-420b-9fa7-8ebe346adb6f

Published Date

2018

Language

English

Faculty

Electrical Engineering, Mathematics and Computer Science

Department

Intelligent Systems

Research Group

Cyber Security

Abstract

We concentrate on machine learning techniques used for profiled side-channel analysis in the presence of imbalanced data. Such scenarios are realistic and often occurring, for instance in the Hamming weight or Hamming distance leakage models. In order to deal with the imbalanced data, we use various balancing techniques and we show that most of them help in mounting successful attacks when the data is highly imbalanced. Especially, the results with the SMOTE technique are encouraging, since we observe some scenarios where it reduces the number of necessary measurements more than 8 times. Next, we provide extensive results on comparison of machine learning and side-channel metrics, where we show that machine learning metrics (and especially accuracy as the most often used one) can be extremely deceptive. This finding opens a need to revisit the previous works and their results in order to properly assess the performance of machine learning in side-channel analysis.

Files

Document_1_.pdf

(.pdf | 1.2 Mb)