Optimal Decision Tree Policies for Markov Decision Processes