Deep Model Compression and Inference Speedup of Sum-Product Networks on Tensor Trains