Generate an optimization plan for a model. The server resolves the model's latest catalog version size and delegates to
POST/api/v1/models/:model_id/optimize
Generate an optimization plan for a model. The server resolves the model's latest catalog version size and delegates to OptimizationService.get_optimization_plan_from_metadata(). The plan describes recommended quantization strategies, format targets, and expected size/accuracy tradeoffs per device profile.
Request
Responses
- 200
- default
Success
Error response