Optimizing ML Inference Queries Under Constraints