LM
Leila Methnani
2 records found
1
Helpful, harmless, honest?
Sociotechnical limits of AI alignment and safety through Reinforcement Learning from Human Feedback
This paper critically evaluates the attempts to align Artificial Intelligence (AI) systems, especially Large Language Models (LLMs), with human values and intentions through Reinforcement Learning from Feedback methods, involving either human feedback (RLHF) or AI feedback (RLAIF
...
MLOps for Cyber-Physical Production Systems
Challenges and Solutions
Machine Learning Operations (MLOps) involves software development practices for Machine Learning (ML), including data management, preprocessing, model training, deployment, and monitoring. While MLOps have received significant interest, much less work has been published addressin
...