W. Sun | TU Delft Repository

Database as Runtime

Compiling LLMs to SQL for In-database Model Serving

Conference paper (2025) - W. Sun (author) , Z. Li (author) , R. Hai (author)

Deploying large language models (LLMs) often requires specialized hardware and complex frameworks, creating barriers for CPU-based environments with resource constraints. These systems, common in air-gapped or edge scenarios, lack support for maintenance due to security, budget, ...

Accelerating machine learning queries with linear algebra query processing

Journal article (2025) - Wenbo Sun (author) , Asterios Katsifodimos (author) , R. Hai (author)

The rapid growth of large-scale machine learning (ML) models has led numerous commercial companies to utilize ML models for generating predictive results to help business decision-making. As two primary components in traditional predictive pipelines, data processing, and model pr ...

Database is All You Need

Serving LLMs with Relational Queries

Conference paper (2025) - W. Sun (author) , Z. Li (author) , Vaishnav Srinidhi (author) , R. Hai (author)

Large language models (LLMs) have become central to many applications, but their deployment often requires high-performance hardware, specialized libraries, and complex engineering, limiting accessibility for smaller organizations. Meanwhile, relational database systems (RDBMS) a ...

Amalur

The Convergence of Data Integration and Machine Learning

Journal article (2024) - Z. Li (author) , W. Sun (author) , Danning Zhan (author) , Yan Kang (author) , Y. Chen (author) , Alessandro Bozzon (author) , R. Hai (author)

Machine learning (ML) training data is often scattered across disparate collections of datasets, called <italic>data silos</italic>. This fragmentation poses a major challenge for data-intensive ML applications: integrating and transforming data residing in different ...

Optimizing ML Inference Queries Under Constraints

Conference paper (2023) - Ziyu Li (author) , W. Sun (author) , Rihan Hai (author) , Alessandro Bozzon (author) , A. Katsifodimos (author)

The proliferation of pre-trained ML models in public Web-based model zoos facilitates the engineering of ML pipelines to address complex inference queries over datasets and streams of unstructured content. Constructing optimal plan for a query is hard, especially when constraints ...

Amalur

Data Integration Meets Machine Learning

Conference paper (2023) - R. Hai (author) , Christos Koutras (author) , A. Ionescu (author) , Ziyu Li (author) , Wenbo Sun (author) , Jessie van Schijndel (author) , Yan Kang (author) , A. Katsifodimos (author)

Machine learning (ML) training data is often scattered across disparate collections of datasets, called data silos. This fragmentation poses a major challenge for data-intensive ML applications: integrating and transforming data residing in different sources demand a lot of manua ...

An Empirical Performance Comparison between Matrix Multiplication Join and Hash Join on GPUs

Conference paper (2023) - Wenbo Sun (author) , Asterios Katsifodimos (author) , R. Hai (author)

Recent advances in Graphic Processing Units (GPUs) have facilitated a significant performance boost for database operators, in particular, joins. It has been intensively studied how conventional join implementations, such as hash joins, benefit from the massive parallelism of GPU ...

Accelerating Machine Learning Queries with Linear Algebra Query Processing

Conference paper (2023) - Wenbo Sun (author) , Asterios Katsifodimos (author) , R. Hai (author)

The rapid growth of large-scale machine learning (ML) models has led numerous commercial companies to utilize ML models for generating predictive results to help business decision-making. As two primary components in traditional predictive pipelines, data processing, and model pr ...