A Unifying Framework for Reinforcement Learning and Planning