B. Özkan | TU Delft Repository

Towards Minimal Certificates for Federated Space Public Key Infrastructure

Master thesis (2025) - A. Roșu (author) , G. Smaragdakis (mentor) , E.A. Markatou (mentor) , O.A. Graur (mentor) , Burcu Külahçıoğlu Özkan (graduation committee member) , M.A. Costea (graduation committee member)

Federated Space Public Key Infrastructure (PKI) can offer a scalable foundation for secure and interoperable communications in collaborative space missions. Yet, its deployment faces challenges stemming from resource-constrained assets, architectural complexity, and the transitio ...

Federated Space Public Key Infrastructure (PKI) can offer a scalable foundation for secure and interoperable communications in collaborative space missions. Yet, its deployment faces challenges stemming from resource-constrained assets, architectural complexity, and the transition to post-quantum (PQ) cryptography. Current CCSDS space guidelines rely on the Internet X.509 profile, whose extensive feature set—if left unrestricted—can increase implementation complexity, certificate size (especially under PQ algorithms), and the risk of interoperability issues. In parallel, the IETF C509 Certificates draft emerges as a streamlined subset of X.509 with a compact encoding specifically tailored for constrained environments. This paper provides an empirical comparison between X.509 and C509 to inform space mission designers about the associated advantages and costs of each, specifically when PQ cryptography is incorporated into space PKIs. To help pave the way for interoperability in federated space missions, a minimal certificate profile for space PKI is proposed.

In addition, the work introduces the first open-source native C509 toolkit that supports PQ algorithms and evaluates open-source and proprietary certificate parsers. While the IETF C509 draft proposal reports a size reduction of over 50%, our evaluation confirms approximately 40% savings for classical certificates generated according to our proposed minimal certificate profile. For PQ certificates, the savings plateau at around 200 bytes, rendering the size gains negligible. However, revocation lists consistently achieve a 60% reduction for 30,000 entries, independent of the cryptographic scheme (PQ or traditional). To quantify and compare the software implementation complexity of X.509 and C509, we conduct software complexity analysis using well-established heuristic metrics (e.g., cyclomatic complexity, Halstead metrics, logical lines of code). The findings further highlight the relative simplicity of the C509 parser implementation in software. Defining a standardised certificate profile for federated space would advance interoperability; however, adopting C509 requires carefully balancing modest PQ size savings against software simplification and the uncertainties associated with a draft standard.

Through the Dependency Maze: A Data-Driven Approach to Dependency Risk Prioritization

Master thesis (2025) - T. Tataru (author) , Georgios Smaragdakis (mentor) , Y. Zhauniarovich (mentor) , Burcu Kulahcioglu Kulahcioglu Ozkan (graduation committee member) , Eduardo Bárbaro (mentor)

Modern software depends heavily on third-party open-source libraries, with the vast majority of applications incorporating external components. While this dependency-driven development accelerates innovation, it creates significant security risks through complex, deeply nested d ...

Modern software depends heavily on third-party open-source libraries, with the vast majority of applications incorporating external components. While this dependency-driven development accelerates innovation, it creates significant security risks through complex, deeply nested dependency graphs where vulnerabilities can propagate across thousands of downstream systems.

While Software Composition Analysis (SCA) tools effectively identify known vulnerabilities, they generate overwhelming alert volumes in large organizations. Our analysis shows that over 8% of dependencies have known vulnerabilities, with each vulnerable version appearing multiple times across projects. This results in dozens of alerts per project, making manual triage infeasible.

This thesis presents a data-driven approach to prioritizing dependency risk, addressing the challenge of identifying the most critical security threats within the overwhelming volumes of alerts generated by SCA tools. The methodology integrates multiple risk indicators, including severity scores, exploit prediction metrics, known exploitation evidence, dependency freshness measures, and license compliance risks into a unified feature set. To capture transitive risk propagation while maintaining focus on actionable components, the framework applies a depth-weighted aggregation technique that assigns exponentially decreasing weights to deeper dependencies. Prioritization is performed using an autoencoder-based model, which leverages reconstruction error to rank dependencies by risk.

The framework was evaluated on thousands of real-world dependencies and showed promise in ranking components based on complex, multi-dimensional risk signals. It prioritized not only dependencies with extreme values in individual indicators but also those with unusual combinations across dimensions, including risks buried in transitive relationships. In a preliminary validation study, expert reviewers agreed with the model’s prioritizations in 96.7% of cases, highlighting its practical relevance and alignment with expert opinion.

By integrating diverse risk indicators, modeling transitive influence, and leveraging autoencoders, this work provides a practical framework for identifying high-risk dependencies in complex software ecosystems. It reduces noise in vulnerability alerts, highlights truly critical components, and supports more focused remediation. While not a replacement for expert judgment, the framework complements existing practices, representing a step toward more adaptive and risk-aware approaches within modern software ecosystems.

Global-State Querying in Stream Processing using Snapshots

Master thesis (2025) - S.S. Kshirsagar (author) , Asterios Katsifodimos (mentor) , Kyriakos Psarakis (mentor) , George Christodoulou (mentor) , George Iosifidis (graduation committee member) , Burcu Kulahcioglu Kulahcioglu Ozkan (graduation committee member)

Stateful Functions-as-a-Service (SFaaS) platforms, such as Styx, are emerging as powerful abstractions for building distributed, serverless cloud applications. By combining the abilities of FaaS with strong transactional guarantees, they enable complex, stateful workflows without ...

Fair Transaction Ordering on DAGs

Preventing MEV extraction without sacrificing practicality

Master thesis (2025) - G. Segalini (author) , Jérémie Decouchant (mentor) , Georgios Smaragdakis (graduation committee member) , Burcu Kulahcioglu Kulahcioglu Ozkan (graduation committee member)

Blockchain technologies enable the decentralized storage and verification of records, such as financial transactions.
Systems like Bitcoin and Ethereum see a considerable usage and have market values in the order of 10s of billions of dollars.
A recent evolution of blockc ...

Program Matching with Semantic Patterns

Master thesis (2025) - P.J. Vunderink (author) , L. Miljak (mentor) , J.G.H. Cockx (mentor) , Burcu Külahçıoğlu Özkan (graduation committee member)

Current tools for pattern matching computer programs often operate on abstract syntax trees or other static representations of programs. These approaches, though efficient, are fundamentally limited when it comes to capturing the dynamic behavior of programs. For example, it is n ...

Current tools for pattern matching computer programs often operate on abstract syntax trees or other static representations of programs. These approaches, though efficient, are fundamentally limited when it comes to capturing the dynamic behavior of programs. For example, it is not always possible to express (concise) syntactic patterns that capture programs which are semantically equivalent but differ in their syntactic representation. A tool that takes into account the behavior (or dynamic semantics) of programs would be able to capture programs that are semantically equivalent in a more concise manner with a single pattern. Additionally, taking into account program behavior leads to more precise pattern matching, by excluding unreachable paths of computation. In this thesis, we explore a novel method, based on behavioral models of programs, that allows patterns to take into account the dynamic semantics of a program. We propose the Dyno pattern language, in which concrete object language syntax can be used to express intuitive semantic patterns of programs. Pattern matching is performed by translating Dyno patterns to μ-calculus formulas and model checking these formulas against models extracted from object programs. Because our method is based on dynamic models of programs, we are fundamentally limited by the halting problem. In favor of precision, our method compromises on efficiency and termination guarantees. In particular, termination is not guaranteed when the extracted model of a program has infinitely many states. To recover termination in some cases, we provide the facility to express bounds on input parameters, limiting the search space while compromising on soundness. We recognize some limitations in our work, including a lack of match evidence (e.g. the location of a match in the object program’s syntax tree), as well as holes in Dyno’s expressiveness. To address the latter issue, we suggest operators that could be added to Dyno in the future.

PGFuzz: Coverage Guided Testing of Graph Processing Applications

Master thesis (2024) - M.W.M. Oudemans (author) , Burcu Külahçıoğlu Kulahcioglu Ozkan (graduation committee member) , Arie Van Deursen (graduation committee member) , J.G.H. Cockx (graduation committee member) , Stafania Dumbrava (graduation committee member)

The rise of graph processing has led to an increase in the usage of graph databases and the availability of various frameworks. Graph databases have become more accessible and, in specific instances, can compete with relational databases. Testing an application with a relational ...

Self-Supervised Representation Learning for Relational Multimodal Data

Should we combine multiple pretext tasks?

Bachelor thesis (2024) - I. Mc Auliffe (author) , Kubilay Atasu (mentor) , T.A. Akyıldız (mentor) , Burcu Kulahcioglu Ozkan (graduation committee member)

Deep Learning models can use pretext tasks to learn representations on unlabelled datasets. Although there have been several works on representation learning and pre-training, to the best of our knowledge combining pretext tasks in a multi-task setting for relational multimodal d ...

A Comparative Study of Fine-Tuning Pipelines for Integrating Large Language Models in Multimodal Data Analysis

Bachelor thesis (2024) - C. Grîu (author) , Kubilay Atasu (mentor) , T.A. Akyıldız (mentor) , Burcu Kulahcioglu Ozkan (graduation committee member)

While LLMs are proficient in processing textual information, integrating them with other models presents significant challenges.
This study evaluates the effectiveness of various configurations for integrating a large language model (LLM) with models capable of handling multi ...

How to improve the performance of the fused architecture consisting of a tabular transformer and a graph neural network used for representation learning for multimodal data?

Bachelor thesis (2024) - D.D. Drashkov (author) , Kubilay Atasu (mentor) , T.A. Akyıldız (mentor) , Burcu Ozkan (graduation committee member)

The substantial amount of tabular data can be attributed to its storage convenience. There is a high demand for learning useful information from the data. To achieve that, machine learning models, called transformers, have been created. They can find patterns in the data, learn f ...

Continuous Improvement of Driving Automation

Using Safety Performance Indicators and Hazardous Scenario Identification

Master thesis (2024) - M.M. Selva Kumar (author) , RR Venkatesha Prasad (mentor) , Burcu Kulahcioglu Ozkan (graduation committee member) , Andrei Terechko (mentor)

The rapid advancement of automated vehicles (AVs) can potentially improve transportation. However, ensuring the safety and reliability of Automated Driving Systems (ADS) remains a critical challenge, particularly when facing the expansion of Operational Design Domains (ODDs) an ...

The rapid advancement of automated vehicles (AVs) can potentially improve transportation. However, ensuring the safety and reliability of Automated Driving Systems (ADS) remains a critical challenge, particularly when facing the expansion of Operational Design Domains (ODDs) and the continuous emergence of unknown hazardous scenarios. This thesis aims to address these challenges by developing a framework for monitoring the safety of multi-channel ADS and identifying hazardous scenarios using Safety Performance Indicators (SPIs) and Hazardous Scenario Identification (HSI) techniques.

The proposed SPI framework, based on the principles outlined in the UL 4600 standard, encompasses a comprehensive set of metrics for assessing the safety and performance of ADS. These metrics cover various critical functionalities, such as ego localization, object detection, trajectory planning, and overall ADS behaviour. By defining appropriate thresholds for each SPI, the framework enables the identification of potential safety issues and supports the continuous monitoring and improvement of ADS.

The HSI module, developed as part of this thesis, leverages the SPI framework and the NXP Daruma cross-channel analysis to detect hazardous scenarios. The HSI module's performance is evaluated using the CARLA simulator and advanced ADS software stacks (LAV and TFUSE) across diverse driving scenarios. The results demonstrate the HSI module's effectiveness in identifying hazardous scenarios such as ego vehicle tailgating, inconsistent ego localization, and ego vehicle being tailgated. However, our analysis also reveals challenges in terms of false positives and negatives, highlighting the need for further improvements in the ADS's perception and localization functionalities and in tuning the SPI thresholds appropriately based on testing as well as the characteristics of the ADS.

This thesis contributes to advancing ADS safety by developing a comprehensive SPI framework and implementing a proof of concept HSI module. We propose an architecture that integrates these components in a closed-loop process involving vehicle fleet data collection, cloud-based analysis, and targeted software updates. This framework enables the identification of areas for improvement and supports generating OpenSCENARIO files for reproducing and analyzing hazardous scenarios ad hoc. The findings from the experimental evaluation provide valuable insights into the performance and limitations of the SPI safety monitoring and HSI techniques, guiding the safe deployment and continuous improvement of ADS. This research ultimately paves the way for the widespread adoption of automated vehicles (AVs) in driving environments.

Optimizing Dataset Quality for Enhanced Machine Learning Performance

A Study on the Impact of Dataset Metrics

Bachelor thesis (2024) - E. Ünlüyurt (author) , Kubilay Atasu (mentor) , T.A. Akyıldız (graduation committee member) , Burcu Kulahcioglu Ozkan (graduation committee member)

With the increase of machine learning applications in our every-day life, high-quality datasets are becoming necessary to train accurate and reliable models. This research delves into the factors that contribute to a high quality dataset and examines how different dataset metrics ...

Beyond Traditional Lexing

Exploiting SIMD Instructions for Tokenizing C

Bachelor thesis (2024) - A. Bolfă (author) , D.G. Sprokholt (mentor) , S.S. Chakraborty (mentor) , Burcu Kulahcioglu Ozkan (graduation committee member)

Over the past decades, Single Instruction, Multiple Data (SIMD) instructions have become common- place in conventional hardware. Lexical analysis, the first stage of compilation, can take advantage of this by splitting its workload across sub lexers that identify groups of tokens ...

Efficient Task Scheduling in Build Systems

Bachelor thesis (2024) - A. Khanna (author) , S.S. Chakraborty (mentor) , D.G. Sprokholt (mentor) , Burcu Kulahcioglu Ozkan (graduation committee member)

Build systems are essential tools for compiling codebases of any complexity. In order to maximize performance, they use parallelism to complete multiple build steps simultaneously. In this thesis, we examine the effectiveness with which common build systems distribute work acro ...

Memory Layout Optimisation on Abstract Syntax Trees

Impact on Utilisation Speed During Type Checking and Code Generation Phases

Bachelor thesis (2024) - I.R.E. de Zwart (author) , D.G. Sprokholt (mentor) , Soham Chakraborty (mentor) , Burcu Kulahcioglu Ozkan (graduation committee member)

In the field of software engineering, the speed of compilation plays a crucial role in enhancing development productivity. This thesis investigates the impact of optimising the memory layout of Abstract Syntax Trees (ASTs) on the performance of the type checking and code generati ...

Comparative Analysis of Linking Efficiency

Evaluating LLD and mold through Insights into Performance Metrics and Architectural Differences in Software Linking Processes

Bachelor thesis (2024) - A.M. Szymkowiak (author) , D.G. Sprokholt (mentor) , S.S. Chakraborty (mentor) , Burcu Kulahcioglu Ozkan (graduation committee member)

This study examines the differences between two modern linkers, LLD and mold, focusing on their efficiency during software development. Although the linking process, which combines multiple object files into a single executable, typically occupies a minor fraction of the total co ...

Efficient Term-Rewriting Super-Optimisation

Specialising Rulesets to Reduce Time Requirements for Compiler Optimisation

Bachelor thesis (2024) - M.A. Ardman (author) , D.G. Sprokholt (mentor) , S.S. Chakraborty (mentor) , Burcu Kulahcioglu Ozkan (graduation committee member)

Term-rewriting super-optimisation during compilation uses rewrite rules in order to restructure a provided code expression into the optimal form, comparing different expressions using a cost function. To reduce the compilation time taken by term-rewriting, the ruleset can be opti ...

A Generic Translation from Case Trees to Eliminators

Master thesis (2024) - K.Z. Lieverse (author) , J.G.H. Cockx (mentor) , L.F.B. Escot (graduation committee member) , Burcu Külahçıoğlu Özkan (graduation committee member)

Dependently-typed languages allow one to guarantee correctness of a program by providing formal proofs. The type checkers of such languages elaborate the user-friendly high-level surface language to a small and fully explicit core language. A lot of trust is put into this elabora ...

AkkaRef: Actor model refactoring into typed actors

Master thesis (2024) - M. Zdanavičius (author) , Burcu Külahçıoğlu Ozkan (mentor) , Burcu Ozkan (graduation committee member) , R.R. Venkatesha Prasad (graduation committee member)

This thesis addresses the challenge of refactoring untyped actor systems into typed ones, particularly within the Scala ecosystem using Akka framework \cite{akkaTypedDocs/Online}. The actor model, with its message-passing architecture, offers a solution to concurrency and scalabi ...

Confidentiality-Preserving Collaborative Bayesian Networks

Master thesis (2023) - A.M. Mălan (author) , Y. Chen (mentor) , Jérémie Decouchant (mentor) , Thiago Guzella (mentor) , Burcu Ozkan (graduation committee member)

Effective large-scale process optimization in manufacturing industries requires close cooperation between different parties of human experts who encode their knowledge of related domains as Bayesian network models. For example, parties in the steel industry must collaboratively u ...

Liveness checking of Streamlined Blockchain Consensus

Master thesis (2023) - Y. Zhou (author) , Jérémie Decouchant (mentor) , Burcu Külahçıoğlu Özkan (graduation committee member) , Johan Pouwelse (graduation committee member)

Byzantine consensus protocols are designed to build resilient systems to achieve consensus under Byzantine settings, maintaining safety guarantees under any network synchrony model and providing liveness in partially or fully synchronous networks.
However, several Byzantine c ...