Skip to main content

Generate an optimization plan for a model. The server resolves the model's latest catalog version size and delegates to

POST 

/api/v1/models/:model_id/optimize

Generate an optimization plan for a model. The server resolves the model's latest catalog version size and delegates to OptimizationService.get_optimization_plan_from_metadata(). The plan describes recommended quantization strategies, format targets, and expected size/accuracy tradeoffs per device profile.

Request

Responses

Success