Learning scalable and efficient communication policies for multi-robot collision avoidance