Strength in Numbers

Improving Generalization with Ensembles in Machine Learning-based Profiled Side-channel Analysis

Journal Article (2020)
Author(s)

Guilherme Perin (Riscure, TU Delft - Cyber Security)

Łukasz Chmielewski (Riscure)

Stjepan Picek (TU Delft - Cyber Security)

Research Group
Cyber Security
Copyright
© 2020 G. Perin, Łukasz Chmielewski, S. Picek
DOI related publication
https://doi.org/10.13154/tches.v2020.i4.337-364
More Info
expand_more
Publication Year
2020
Language
English
Copyright
© 2020 G. Perin, Łukasz Chmielewski, S. Picek
Research Group
Cyber Security
Issue number
4
Volume number
2020
Pages (from-to)
337-364
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The adoption of deep neural networks for profiled side-channel attacks provides powerful options for leakage detection and key retrieval of secure products. When training a neural network for side-channel analysis, it is expected that the trained model can implement an approximation function that can detect leaking side-channel samples and, at the same time, be insensible to noisy (or non-leaking) samples. This outlines a generalization situation where the model can identify the main representations learned from the training set in a separate test set.
This paper discusses how output class probabilities represent a strong metric when conducting the side-channel analysis. Further, we observe that these output probabilities are sensitive to small changes, like selecting specific test traces or weight initialization for a neural network. Next, we discuss the hyperparameter tuning, where one commonly uses only a single out of dozens of trained models, where each of those models will result in different output probabilities. We show how ensembles of machine learning models based on averaged class probabilities can improve generalization. Our results emphasize that ensembles increase a profiled side-channel attack’s performance and reduce the variance of results stemming from different hyperparameters, regardless of the selected dataset or leakage model.