Deep Reinforcement Learning for Orchestrating Cost-Aware Reconfigurations of vRANs