Petter Ericson

Journal article (2)

2 records found

Helpful, harmless, honest?

Sociotechnical limits of AI alignment and safety through Reinforcement Learning from Human Feedback

Journal article (2025) - Adam Dahlgren Lindström (author) , Leila Methnani (author) , Lea Krause (author) , Petter Ericson (author) , I. Martinez de Rituerto de Troya (author) , Dimitri Coelho Mollo (author) , R.I.J. Dobbe (author)

This paper critically evaluates the attempts to align Artificial Intelligence (AI) systems, especially Large Language Models (LLMs), with human values and intentions through Reinforcement Learning from Feedback methods, involving either human feedback (RLHF) or AI feedback (RLAIF ...

Tracing Class and Capitalism in Critical AI Research

Journal article (2024) - Petter Ericson (author) , R.I.J. Dobbe (author) , Simon Lindgren (author)

This article explores the rapidly developing field of Critical AI Studies and its relation to issues of class and capitalism through a hybrid approach based on distant reading of a newly collected corpus of 300 full-text scientific articles, the creation of which is itself a firs ...