by Michele Laurelli
The process of using a trained model to make predictions on new data.
Inference happens after training. Model processes input and generates output. Can be real-time or batch. Optimized for speed.
Real-time image classification
Batch prediction on 1000 samples
Model deployment