Reinforcement Learning for minimizing the total waiting time of passengers in a Taxi Dispatch Problem