Skip to main content

Single non-streaming cloud inference. Server proxies to the configured LLM backend and returns the full response in one

POST /api/v1/inference

Single non-streaming cloud inference. Server proxies to the configured LLM backend and returns the full response in one shot.

Request

Responses

200
default

Success