Minimising Data-Layout and Copy Overhead

None, None

Minimising Data-Layout and Copy Overhead

A Memory-Management Study of an EEG Biomarker Pipeline

Bachelor Thesis (2026)

Author(s)

S.L. Lelie (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

R. Guerra Marroquim – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Arthur Avramiea – Mentor

Faculty

Electrical Engineering, Mathematics and Computer Science

Python EEG Memory management Performance Optimization Zero-copy Numpy Lazy evaluation Data-layout overhead

To reference this document use

https://resolver.tudelft.nl/uuid:9ee0d6a0-557f-4770-aa4e-e04da715f776

More Info

expand_more

Publication Year

2026

Language

English

Graduation Date

26-06-2026

Awarding Institution

Delft University of Technology

Project

CSE3000 Research Project

Programme

Computer Science and Engineering

Faculty

Electrical Engineering, Mathematics and Computer Science

Downloads counter

3

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Electroencephalography (EEG) biomarker pipelines are usually assumed to be limited by statistical computation, but much of their cost is data movement: copying, reshaping, and indexing arrays through high-level Python abstractions. We profile the data-reshaping (reduce) stage of the Neurophysiological Biomarker Toolbox, a Python EEG-analysis framework, and find it performs no arithmetic at all: its cost is an eager deep copy of the per-subject container, which also doubles the stage’s peak memory by allocating a full duplicate of the data.
We evaluate three memory-management strategies (zero-copy array views, layout pinning, and lazy materialisation) against an unmodified baseline, verifying that every variant reproduces the baseline’s statistical outputs exactly. Zero-copy views remove the duplicate, cutting reduce-stage peak memory from gigabytes to near zero; this lowers worst-case (tail) latency and, under the parallel load of a cohort sweep, lifts throughput by up to 4.4× by keeping concurrent workers out of swap. Layout pinning and lazy materialisation act only when subjects have repeated sessions, where lazy materialisation cuts reduce-stage peak memory by two orders of magnitude.
The reduce stage is thus effectively eliminated as a cost. The end-to-end speedup is a more modest 1.3×, bounded not by the optimisation but by a separate, arithmetic-bound statistics step that lies outside this paper’s scope and which we flag as the natural next target. The practical recommendation is to eliminate eager deep copies first: a small change that removes the memory doubling and, under parallel load, keeps a cohort sweep out of swap.

Files

Final_paper_Sem_Lelie.pdf

(pdf | 0.543 Mb)

License info not available