Bus management using multi-agent reinforcement learning