Skip to content

Supported AI models

InnerZero runs open-source language models directly on your PC via Ollama or LM Studio. The main families are Qwen 3, Gemma 3, and gpt-oss, covering everything from a 1B voice model on a basic laptop to a 235B reasoning model on a datacenter-class workstation. Local AI is free forever, no subscription, no account required.

Cloud support is optional. If you want frontier-quality models for hard tasks, subscribe to InnerZero's managed cloud plans, or bring your own API keys for any of seven providers with zero markup. Keys stay encrypted on your machine.

Local models

Installed automatically based on your hardware tier. These run entirely on your machine, offline if you want.

Qwen 3

4B8B14B30B32B235B
Minimum hardware
Basic tier
Backend
Ollama or LM Studio

Apache 2.0

Official page

Gemma 3

1B4B12B
Minimum hardware
Entry tier
Backend
Ollama or LM Studio

Gemma Terms of Use

Official page

gpt-oss

120B
Minimum hardware
Pro tier
Backend
Ollama or LM Studio

Apache 2.0

Official page

How we choose supported models

Supported local families run reliably on consumer hardware, ship under permissive licences, are actively maintained by their upstream teams, and work cleanly through Ollama or LM Studio without custom glue. We prefer families with a range of sizes so the same recipe scales from a budget laptop up to a frontier workstation. Models that satisfy those constraints land in the default hardware tiers and get tested as new versions ship.

Cloud models

This section is optional. Use these only if you choose to add an API key or subscribe to a managed cloud plan. Managed plans bundle multiple providers under one bill; BYO sends your prompts directly to the provider you configure.

DeepSeek

DeepSeek V3.2deepseek-chat
ManagedBYOPricing
DeepSeek R1deepseek-reasoner
ManagedBYOPricing

OpenAI

GPT-5.4gpt-5.4
GPT-5.4 Minigpt-5.4-mini
GPT-4.1gpt-4.1
GPT-4ogpt-4o
o3o3
o4 Minio4-mini
GPT-5.3 Codexgpt-5.3-codex
GPT-5.3 Chatgpt-5.3-chat

Anthropic

Claude Opus 4.7claude-opus-4-7
ManagedBYOPricing
Claude Opus 4.6claude-opus-4-6
ManagedBYOPricing
Claude Sonnet 4.6claude-sonnet-4-6
ManagedBYOPricing
Claude Haiku 4.5claude-haiku-4-5-20251001
ManagedBYOPricing

Google

Gemini 2.5 Flashgemini-2.5-flash
ManagedBYOPricing
Gemini 2.5 Progemini-2.5-pro
ManagedBYOPricing

Qwen

Qwen Maxqwen-max
Qwen Plusqwen-plus
Qwen Turboqwen-turbo
Qwen3 Coder Plusqwen3-coder-plus

xAI

Grok 4.20 Reasoninggrok-4.20-reasoning
Grok 4.20grok-4.20-non-reasoning
Grok 4.1 Fast Reasoninggrok-4-1-fast-reasoning
Grok 4.1 Fastgrok-4-1-fast-non-reasoning
Grok 4grok-4
Grok 3grok-3

Kimi

Kimi K2.5kimi-k2.5

Model support last reviewed 19 April 2026. Pricing is set by each provider and changes over time; follow the pricing links above for current rates. Local models are free forever.