Self-supervised Audio-reactive Music Video Synthesis