Single non-streaming cloud inference. Server proxies to the configured LLM backend and returns the full response in one
POST/api/v1/inference
Single non-streaming cloud inference. Server proxies to the configured LLM backend and returns the full response in one shot.
Request
Responses
- 200
- default
Success
Error response