Action-driven Reinforcement Learning for Improving Localization of Brace Sleeve in Railway Catenary