Multi-Modal End-to-End Learning for Real-Time Monitoring of Sustainable Energy Systems