Learning state representation for deep actor-critic control