Generate an optimization plan for a model. The server resolves the model's latest catalog version size and delegates to

POST /api/v1/models/:model_id/optimize

Generate an optimization plan for a model. The server resolves the model's latest catalog version size and delegates to OptimizationService.get_optimization_plan_from_metadata(). The plan describes recommended quantization strategies, format targets, and expected size/accuracy tradeoffs per device profile.

Request

Responses

200
default

Success

Generate an optimization plan for a model. The server resolves the model's latest catalog version size and delegates to

/api/v1/models/:model_id/optimize

Request​

Responses​

Request

Responses