Database as Runtime

None, None; None, None; None, None

Database as Runtime

Compiling LLMs to SQL for In-database Model Serving

Conference Paper (2025)

Author(s)

Wenbo Sun (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Ziyu Li (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Rihan Hai (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group

Web Information Systems

Large language models Relational database Query processing

DOI related publication

https://doi.org/10.1145/3722212.3725093 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:52129ace-c50c-49fd-9891-8df8add59a6b

More Info

expand_more

Publication Year

2025

Language

English

Research Group

Web Information Systems

Pages (from-to)

231-234

Publisher

ACM

ISBN (electronic)

9798400715648

Event

2025 ACM SIGMOD/PODS International Conference on Management of Data, SIGMOD-Companion 2025 (2025-06-22 - 2025-06-27), Berlin, Germany

Downloads counter

199

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Deploying large language models (LLMs) often requires specialized hardware and complex frameworks, creating barriers for CPU-based environments with resource constraints. These systems, common in air-gapped or edge scenarios, lack support for maintenance due to security, budget, or technical limits. To address this, we introduce TranSQL⁺, a compiler that translates LLM inference into SQL queries, enabling deployment on relational databases. By converting transformer operations into relational algebra, TranSQL⁺ generates vector-oriented SQL queries that leverage native database features (buffer management, indexing) to manage computations without hardware accelerators or deep learning frameworks. Demonstrated with the LLaMA3.1 8B model on DuckDB, results show relational databases can effectively serve LLMs, reducing deployment barriers and expanding access to advanced AI.

Files

3722212.3725093.pdf

(pdf | 1.72 Mb)