Segmented Active Reward Learning