Belief Updating and Delegation in Multi-Task Human-AI Interaction

Evidence from Controlled Simulations

Conference Paper (2026)
Author(s)

Shreyan Biswas (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Alexander Erlei (University of Göttingen, TU Delft - Electrical Engineering, Mathematics and Computer Science)

Ujwal Gadiraju (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group
Web Information Systems
DOI related publication
https://doi.org/10.1145/3772318.3790775 Final published version
More Info
expand_more
Publication Year
2026
Language
English
Research Group
Web Information Systems
Article number
1042
Publisher
ACM
ISBN (electronic)
979-8-4007-2278-3
Event
2026 CHI Conference on Human Factors in Computing Systems, CHI 2026 (2026-04-13 - 2026-04-17), Barcelona, Spain
Downloads counter
7
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Large language models (LLMs) increasingly support heterogeneous tasks within a single interface, requiring users to form, update, and act upon beliefs about one system across domains with different reliability profiles. Understanding how such beliefs transfer across tasks and shape delegation is therefore critical for the design of multipurpose AI systems. We report a preregistered experiment (N = 240, 7,200 trials) in which participants interacted with a controlled AI simulation across grammar checking, travel planning, and visual question answering, each with fixed, domain-typical accuracy levels. Delegation was operationalized as a binary reliance decision - accepting the AI's output versus acting independently and belief dynamics were evaluated against Bayesian benchmarks. We find three main results. First, participants do not reset beliefs between tasks: priors in a new task depend on posteriors from the previous task, with a 10-point increase predicting a 3-4 point higher subsequent prior. Second, within tasks, belief updating follows the Bayesian direction but is substantially conservative, proceeding at roughly half the normative Bayesian rate. Third, delegation is driven primarily by subjective beliefs about AI accuracy rather than self-confidence, though confidence independently reduces reliance when beliefs are held constant. Together, these findings show that users form global, path-dependent expectations about multipurpose AI systems, update them conservatively, and rely on AI primarily based on subjective beliefs rather than objective performance. We discuss implications for expectation calibration, reliance design, and the risks of belief spillovers in deployed LLM-based interfaces.