Database is All You Need

None, None; None, None; None, None; None, None

Database is All You Need

Serving LLMs with Relational Queries

Conference Paper (2025)

Author(s)

Wenbo Sun (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Ziyu Li (TU Delft - Mechanical Engineering)

Vaishnav Srinidhi (Student TU Delft)

Rihan Hai (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group

Web Information Systems

DOI related publication

https://doi.org/10.48786/edbt.2025.103 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:ab229796-02f2-4b36-9c20-d2e85a3459e9

More Info

expand_more

Publication Year

2025

Language

English

Research Group

Web Information Systems

Bibliographical Note

Green Open Access added to TU Delft Institutional Repository as part of the Taverne amendment. More information about this copyright law amendment can be found at https://www.openaccess.nl. Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Pages (from-to)

1118-1121

Publisher

OpenProceedings.org

Downloads counter

215

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Large language models (LLMs) have become central to many applications, but their deployment often requires high-performance hardware, specialized libraries, and complex engineering, limiting accessibility for smaller organizations. Meanwhile, relational database systems (RDBMS) are widely used for portability, efficiency, and native support for managing large-scale data operations. This paper presents TranSQL1, a toolkit that enables transformerbased LLM inference within RDBMS. By translating neural operations into SQL queries and representing model weights as relational tables, TranSQL leverages database features like dynamic disk-to-memory data management and caching to reduce hardware and engineering demands for serving LLMs. Using the LLaMA3 8B model, we demonstrate TranSQL's ability to implement attention layers, KV-cache, and end-to-end text generation through SQL queries. TranSQL offers a cost-effective, portable, and scalable approach to making advanced AI technologies more accessible.

Files

Paper-326.pdf

(pdf | 2.34 Mb)

- Embargo expired in 15-09-2025

License info not available