On the road from Model-Based Dynamical Programming to Model-Free Reinforcement Learning