Octomil

Octomil is the production control plane for on-device AI. Use it to serve models locally, ship them to devices, and monitor rollout health, quality, and fleet behavior from one system.

Quickstart

Get up and running with your first model or deploy to a phone.

Download

Download Octomil on macOS, Windows, or Linux.

Cloud

Dashboard for fleet health, rollouts, routing, and model versions.

API Reference

View Octomil's API reference.

SDKs

Python SDK

Model registry, responses, rollouts, and control-plane operations.

iOS SDK

On-device inference, deployment, and updates with CoreML.

Android SDK

On-device inference, deployment, and updates with LiteRT and TFLite.

Browser SDK

Run models in the browser with WebGPU and WASM.

Community

GitHub

View source, report issues, and contribute.

Discord

Join the Octomil community.