Routing Optimization for the Train Unit Shunting Problem using a Multi-Agent Deep Reinforcement Learning Framework