Unlocking the Flexibility of District Heating Pipeline Energy Storage with Reinforcement Learning