Safe Optimization of Steel Manufacturing with Reinforcement Learning