Locality in space and time for data-efficient visual recognition