S.N.R. Buijsman
Please Note
30 records found
1
Machine Learning Models as Mathematics
Interpreting Explainable AI in Non-causal Terms
We would like to have a wide range of explanations for the behaviour of machine learning systems. However, how should we understand these explanations? Typically, attempts to clarify what an explanations for questions such as ‘why am I getting this output for these inputs?’ have been approached from the philosophy of science, through an analogy with scientific (and often causal) explanations. I show that ML systems are best thought of as noncausal, specifically mathematical objects. We should therefore interpret these explanations differently, through analogy with mathematical explanations. I show that this still allows us to use much of the same theoretical apparatus, and argue that the asymmetry of many of the standard ML explanations can be accounted for in virtue of the link these systems have with concrete implementations.
Inauthentic Value Shifts
More than Manipulation
Artificial Intelligence (AI) in healthcare holds transformative potential but faces critical challenges in ethical accountability and systemic inequities. Biases in AI models, such as lower diagnosis rates for Black women or gender stereotyping in Large Language Models, highlight the urgent need to address historical and structural inequalities in data and development processes. Disparities in clinical trials and datasets, often skewed toward high-income, English-speaking regions, amplify these issues. Moreover, the underrepresentation of marginalized groups among AI developers and researchers exacerbates these challenges. To ensure equitable AI, diverse data collection, federated data-sharing frameworks, and bias-correction techniques are essential. Structural initiatives, such as fairness audits, transparent AI model development processes, and early registration of clinical AI models, alongside inclusive global collaborations like TRAIN-Europe and CHAI, can drive responsible AI adoption. Prioritizing diversity in datasets and among developers and researchers, as well as implementing transparent governance will foster AI systems that uphold ethical principles and deliver equitable healthcare outcomes globally.
Importance: Artificial intelligence (AI) presents transformative opportunities to address the increasing challenges faced by health care systems globally. Particularly, in data-rich environments, such as intensive care units (ICUs), AI could assist in enhancing clinical decision-making, streamline workflows, and improve patient outcomes. Despite these promising applications, the practical implementation of AI in clinical settings remains limited. Objective: To systematically evaluate AI system operationalization in the ICU, focusing on the AI field's progress over time, technical maturity, and risk of bias. Evidence Review: In this systematic review, 5 databases (Embase, MEDLINE ALL, Web of Science Core Collection, Cochrane Central Register of Controlled Trials, and Google Scholar) were searched for studies published from July 28, 2020, to June 10, 2024. Eligible studies evaluated AI applications designed for use within ICUs for adults (aged ≥16 years) and used data collected during ICU stays. Two reviewers independently screened titles and abstracts, with a third reviewer resolving disagreements. Data extraction included AI application aims, dataset origins, technology readiness level (TRL) categorization, and the use of reporting standards. Risk of bias was assessed using the PROBAST (Prediction Model Study Risk of Bias Assessment Tool). Findings: Of 17 401 screened records, 1263 studies met the inclusion criteria. A total of 936 studies (74% of all studies) were classified as TRL 4 or below, indicating early-stage development or initial validation. Among these, 447 (37%) used internal datasets, 562 (46%) used MIMIC (Medical Information Mart for Intensive Care) datasets (I-IV), and 78 (6%) used the open-source eICU Collaborative Research Database. External validation (TRL 5) was achieved by 24% of studies. Only 25 (2%) progressed to clinical integration (TRL≥6), with no studies reaching full implementation (TRL 9). Although approximately half of generative AI models reached a higher TRL (14 [47%] with TRL 5), none reached clinical integration. Additionally, only 207 studies (16%) referenced reporting standards, with adherence modestly increasing from 14% in 2021 to 23% in 2024. High risk of bias was identified in 581 of 1103 studies (53%), primarily due to methodologic shortcomings in the analysis domain. Conclusions and Relevance: Despite substantial growth in AI research within intensive care medicine in recent years, the transition from development to clinical implementation still remains limited and has made little progress over time. A paradigm shift is urgently required in the medical literature-one that moves beyond retrospective validation toward the operationalization and prospective testing of AI for tangible clinical impact.
Autonomy by Design
Preserving Human Autonomy in AI Decision-Support
AI systems increasingly support human decision-making across domains of professional, skill-based, and personal activity. While previous work has examined how AI might affect human autonomy globally, the effects of AI on domain-specific autonomy—the capacity for self-governed action within defined realms of skill or expertise—remain understudied. We analyze how AI decision-support systems affect two key components of domain-specific autonomy: skilled competence (the ability to make informed judgments within one's domain) and authentic value-formation (the capacity to form genuine domain-relevant values and preferences). By engaging with prior investigations and analyzing empirical cases across medical, financial, and educational domains, we demonstrate how the absence of reliable failure indicators and the potential for unconscious value shifts can erode domain-specific autonomy both immediately and over time. We then develop a constructive framework for autonomy-preserving AI support systems. We propose specific socio-technical design patterns—including careful role specification, implementation of defeater mechanisms, and support for reflective practice—that can help maintain domain-specific autonomy while leveraging AI capabilities. This framework provides concrete guidance for developing AI systems that enhance rather than diminish human agency within specialized domains of action.
The Role of Digital Literacy in Maintaining Autonomy in AI Decision-Support
Balancing the Burdens
Integrating AI systems into workflows risks undermining the competence of the people supported by them, specifically due to a loss of meta-cognitive competence. We discuss a recent suggestion to mitigate this through better uncertainty quantification. While this is certainly a step in the right direction, there is a question whether users are sufficiently supported to engage in critical reflection with literacy and tools alone. We therefore suggest that socio-technical system design focused on the role of AI systems is crucial to preserving autonomy, even when supported by uncertainty quantification.
Is Meaningful Human Control Over Personalised AI Assistants Possible?
Ethical Design Requirements for The New Generation of Artificially Intelligent Agents
Responsible artificial intelligence in healthcare
A systematic review on the use of ethical principles in the development and deployment of artificial intelligence
As hospitals increasingly adopt artificial intelligence (AI) to manage rising patient volumes, workforce shortages and healthcare costs, concerns about ethical implementation have become prominent. This systematic review aims to assess how hospital-focused AI literature addresses the WHO’s six ethical AI principles—autonomy; well-being and safety; transparency and explainability; responsibility and accountability; inclusiveness and equity; and responsiveness and sustainability.
Methods and analysis
A systematic review (PROSPERO registration: CRD42022347871) was conducted by searching Embase, MEDLINE ALL, Web of Science and the Cochrane Central Register of Controlled Trials from inception to December 2023, supplemented by Google Scholar. English-language studies describing AI (machine learning, deep learning, predictive analytics) relevant to inpatient settings and referencing at least one WHO principle were included. Two reviewers independently screened titles, abstracts and full texts, extracting data on publication year, country, study design, AI type, technology readiness level and ethical considerations. Discrepancies were resolved by consensus.
Results
Of 4770 unique records, 673 were included. Most (83%) originated from high-income countries, with publication volume rising sharply after 2021. Of these, 558 (83%) addressed at least one WHO principle in depth, most frequently inclusiveness and equity (49%), transparency and explainability (45%) and autonomy (42%). Well-being and safety (26%) and responsibility and accountability (29%) were less frequently covered, while responsiveness and sustainability (6%) was rarely explored. Among 44 studies developing AI applications with technology readiness levels 1–6, ethical principles were acknowledged but rarely operationalised.
Conclusion
Hospital-based AI research demonstrates increasing attention to ethical principles but lacks comprehensive application, particularly regarding sustainability. High-income countries dominate this discourse, underscoring the need for broader global engagement. To achieve equitable, safe and sustainable AI in clinical practice, clearer operational guidance and more inclusive collaboration is warranted. ...
As hospitals increasingly adopt artificial intelligence (AI) to manage rising patient volumes, workforce shortages and healthcare costs, concerns about ethical implementation have become prominent. This systematic review aims to assess how hospital-focused AI literature addresses the WHO’s six ethical AI principles—autonomy; well-being and safety; transparency and explainability; responsibility and accountability; inclusiveness and equity; and responsiveness and sustainability.
Methods and analysis
A systematic review (PROSPERO registration: CRD42022347871) was conducted by searching Embase, MEDLINE ALL, Web of Science and the Cochrane Central Register of Controlled Trials from inception to December 2023, supplemented by Google Scholar. English-language studies describing AI (machine learning, deep learning, predictive analytics) relevant to inpatient settings and referencing at least one WHO principle were included. Two reviewers independently screened titles, abstracts and full texts, extracting data on publication year, country, study design, AI type, technology readiness level and ethical considerations. Discrepancies were resolved by consensus.
Results
Of 4770 unique records, 673 were included. Most (83%) originated from high-income countries, with publication volume rising sharply after 2021. Of these, 558 (83%) addressed at least one WHO principle in depth, most frequently inclusiveness and equity (49%), transparency and explainability (45%) and autonomy (42%). Well-being and safety (26%) and responsibility and accountability (29%) were less frequently covered, while responsiveness and sustainability (6%) was rarely explored. Among 44 studies developing AI applications with technology readiness levels 1–6, ethical principles were acknowledged but rarely operationalised.
Conclusion
Hospital-based AI research demonstrates increasing attention to ethical principles but lacks comprehensive application, particularly regarding sustainability. High-income countries dominate this discourse, underscoring the need for broader global engagement. To achieve equitable, safe and sustainable AI in clinical practice, clearer operational guidance and more inclusive collaboration is warranted.
The advances in machine learning (ML)-based systems in medicine give rise to pressing epistemological and ethical questions. Clinical decisions are increasingly taken in highly digitised work environments, which we call artificial epistemic niches. By considering the case of ML systems in life-critical healthcare settings, we investigate (1) when users’ reliance on these systems can be characterised as epistemic dependence and (2) how this dependence turns into what we refer to as harmful epistemic dependence of clinical professionals on medical ML. The latter occurs when the impossibility of critically assessing the soundness of a system’s output in situ implies a moral obligation to comply with its recommendation since a failure to do so constitutes a moral risk that cannot be justified then and there. We analyse the epistemic and moral consequences of harmful epistemic dependence on the status of medical professionals. We conclude by assessing how a suitable design of the epistemic niche can address the problem.
Transparency for AI systems
A value-based approach
Machine learning techniques are driving — or soon will be driving — much of scientific research and discovery. Can they function as models similar to more traditional modeling techniques in scientific contexts? Or might they replace models altogether if they deliver sufficient predictive accuracy? These questions cut across at least two types of applications of machine learning models. First, machine learning models are used to study the brain, where neural networks might represent aspects of neural activity. The principal question here is: Can the uses of neural networks provide scientific explanations and models for neuroscience? Second, machine learning models are applied in science more broadly, where representational links are less clear. What are the epistemic implications of machine learning in those areas? Can they replace more traditional scientific models of phenomena, or is mere predictive accuracy sufficient? These two strands are finally brought together to create an overview of the (epistemic) role machine learning can play in scientific modeling.
Opening the Analogical Portal to Explainability
Can Analogies Help Laypeople in AI-assisted Decision Making?
Concepts are an important construct in semantics, based on which humans understand the world with various levels of abstraction. With the recent advances in explainable artificial intelligence (XAI), concept-level explanations are receiving an increasing amount of attention from the broad research community. However, laypeople may find such explanations difficult to digest due to the potential knowledge gap and the concomitant cognitive load. Inspired by prior work that has explored analogies and sensemaking, we argue that augmenting concept-level explanations with analogical inference information from commonsense knowledge can be a potential solution to tackle this issue. To investigate the validity of our proposition, we first designed an effective analogy-based explanation generation method and collected 600 analogy-based explanations from 100 crowd workers. Next, we proposed a set of structured dimensions for the qualitative assessment of such explanations, and conducted an empirical evaluation of the generated analogies with experts. Our findings revealed significant positive correlations between the qualitative dimensions of analogies and the perceived helpfulness of analogy-based explanations, suggesting the effectiveness of the dimensions. To understand the practical utility and the effectiveness of analogybased explanations in assisting human decision-making, we conducted a follow-up empirical study (N = 280) on a skin cancer detection task with non-expert humans and an imperfect AI system. Thus, we designed a between-subjects study spanning five different experimental conditions with varying types of explanations. The results of our study confirmed that a knowledge gap can prevent participants from understanding concept-level explanations. Consequently, when only the target domain of our designed analogy-based explanation was provided (in a specific experimental condition), participants demonstrated relatively more appropriate reliance on the AI system. In contrast to our expectations, we found that analogies were not effective in fostering appropriate reliance. We carried out a qualitative analysis of the open-ended responses from participants in the study regarding their perceived usefulness of explanations and analogies. Our findings suggest that human intuition and the perceived plausibility of analogies may have played a role in affecting user reliance on the AI system. We also found that the understanding of commonsense explanations varied with the varying experience of the recipient user, which points out the need for further work on personalization when leveraging commonsense explanations. In summary, although we did not find quantitative support for our hypotheses around the benefits of using analogies, we found considerable qualitative evidence suggesting the potential of high-quality analogies in aiding non-expert users in their decision making with AI-assistance. These insights can inform the design of future methods for the generation and use of effective analogy-based explanations.
ChatGPT is a powerful language model from OpenAI that is arguably able to comprehend and generate text. ChatGPT is expected to greatly impact society, research, and education. An essential step to understand ChatGPT’s expected impact is to study its domain-specific answering capabilities. Here, we perform a systematic empirical assessment of its abilities to answer questions across the natural science and engineering domains. We collected 594 questions on natural science and engineering topics from 198 faculty members across five faculties at Delft University of Technology. After collecting the answers from ChatGPT, the participants assessed the quality of the answers using a systematic scheme. Our results show that the answers from ChatGPT are, on average, perceived as “mostly correct”. Two major trends are that the rating of the ChatGPT answers significantly decreases (i) as the educational level of the question increases and (ii) as we evaluate skills beyond scientific knowledge, e.g., critical attitude.