Advanced RAG-LLM prototype AI on PubMed for Cardiac Health

Conference Paper (2025)
Author(s)

L.P.A. Simons (TU Delft - Interactive Intelligence)

P.K. Murukannaiah (TU Delft - Interactive Intelligence)

B.S. Han (Student TU Delft)

M.A. Neerincx (TU Delft - Interactive Intelligence)

Research Group
Interactive Intelligence
DOI related publication
https://doi.org/10.18690/um.fov.4.2025
More Info
expand_more
Publication Year
2025
Language
English
Research Group
Interactive Intelligence
Pages (from-to)
265-279
ISBN (electronic)
978-961-286-998-4
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Healthy lifestyle behaviours are effective in preventing and treating cardiovascular disease. However, the growing body of scientific literature and the prevalence of conflicting studies make it challenging for healthcare practitioners and patients to stay informed. Large Language Models (LLMs), combined with Retrieval-Augmented Generation (RAG), enable automated claim verification and summarization. We enhanced RAG-LLM with extra modules and evaluated performance. Inclusion-Criteria-based filtering of PubMed papers improved verdict performance. Next, for health claims, PICO-based (Population, Intervention, Comparison, Outcome) paper mapping and summarization improves transparency of evidence used for verdict generation (like ‘Berries reduce blood pressure’). Still, the RAG-LLM models we tested have biases towards positivity (too many foods deemed heart healthy) and neutrality (no clear direction). We discuss mechanisms at play and challenges on the route forward.