Changelog
What's new in InnerZero.
v0.1.7June 2026Latest
New
- Multi-language interface: use InnerZero in 26 languages. Pick yours in Settings and the whole app, its menus, and the assistant's replies follow, with right-to-left support for Arabic, Hebrew, Persian, and Urdu. A one-time upgrade keeps your saved memories searchable across every language, and it all works offline.
- Action Hub: an opt-in research and apply assistant on the Tasks page. With your own Apify key, Zero can gather sources from the web; with your saved job profile, it can help fill in and submit job applications. Every action is gated behind your approval, and the apply browser runs in a fresh, isolated session.
- Proxy support: behind a corporate or university proxy? Enter your HTTP or HTTPS proxy in Settings and InnerZero routes model downloads, cloud calls, and connector traffic through it. Local AI traffic is never proxied.
Improved
- Privacy egress guard: every outbound connection now passes through a single fail-closed guard, so Offline and Private modes hold even if a tool tries to reach out.
- Smaller, faster Windows install: dev-only tooling is excluded from the shipped runtime and the heavier components now download on first run.
Fixed
- macOS now opens cleanly. The v0.1.7 DMG is signed with Developer ID, notarised by Apple, and stapled, which fixes the "InnerZero is damaged and can't be opened" message some macOS users saw on v0.1.6. No right-click workaround is needed.
- Project-scoped memory isolation: memories saved under one project no longer surface in another project's context.
- Voice reliability: the text-to-speech button no longer hangs on a slow first load, and it now shows clear loading and stop states.
v0.1.6May 2026
New
- Proactive Assistant: schedule briefings, daily summaries, and reminders that arrive when you want them. Natural-language scheduling like "remind me at 5pm tomorrow", quiet hours, multi-channel notifications, and an optional Telegram bridge if you want briefings on your phone.
- Smart briefings: a higher-quality briefing mode that pulls from your memory, mail, and knowledge packs as needed instead of relying on a single prompt-and-response.
- Automation Specialist: route automation tasks to a small local model, your main assistant, or a dedicated cloud model based on what you want to optimise for (speed, quality, or privacy).
- Slash commands in chat: type /help to see what is available, /clear to reset the visible conversation, and more for quick actions without typing prompts.
- First-run hardware detection: InnerZero now tests your GPU on first run and picks the right model tier automatically. You can also override it manually in the setup wizard.
- Bundled Ollama upgraded to v0.22.1 with full GPU acceleration: modern NVIDIA via CUDA 13, Apple Silicon via Metal, AMD and cross-vendor via Vulkan, and Intel via OneAPI.
- AMD GPU users can now opt in to GPU acceleration in Settings. AMD support is conservative-by-default for this release while we collect real-world feedback.
- Single-instance lock: launching InnerZero a second time now focuses the existing window instead of opening a duplicate.
Improved
- Chat reliability for LM Studio backends: better handling of replies that drift from the expected response shape, and replies that contain emoji or non-ASCII characters now render correctly on Windows.
- Linux installer streamlined for modern NVIDIA (Turing onward) via CUDA 13, plus Vulkan for AMD and cross-vendor.
- Mac code-signing hygiene: cleaner Gatekeeper experience on first launch.
Fixed
- Bundled Ollama on Mac and Linux now resolves correctly. Users on previous versions whose bundled Ollama silently failed to start should see chat working out of the box on v0.1.6.
- LM Studio chat replies containing emoji or smart quotes no longer crash on Windows.
- LM Studio: when a model returns a response shape that does not match what InnerZero expects, the actual answer text is recovered where possible instead of showing a raw fallback.
- Several internal coverage gaps in the installer bundle that affected slash commands and other modules in installed builds.
Known limitations
- The macOS DMG ships unstapled in v0.1.6. The app is signed with Developer ID, runs under hardened runtime, and applies the necessary entitlements. On first launch with internet, Gatekeeper does a one-time online check with Apple and the app opens normally. If you are offline on first launch, right-click the app, select Open, then click Open again. Stapling returns in a future release.
- macOS minimum is now Sonoma 14. Bundled Ollama 0.22.1 no longer supports Monterey or Ventura.
- Pre-Turing NVIDIA Linux users on the bundled mode (GTX 9 series Maxwell, GTX 10 series Pascal, V100 Volta) should use Vulkan or install Ollama separately for full CUDA support.
- Installer sizes have grown vs v0.1.5 because of the Ollama 0.22.1 bundling. See the download page for current sizes per platform.
v0.1.5April 2026
New
- Calendar page with Month, Week, Day, and Agenda views. Click an empty slot to create an event. Drag events to reschedule.
- Two-way Google Calendar sync. Connect your Google account in Settings; events from Google appear locally and events you create at home publish back. Private events stay on your machine.
- Gmail integration (read-only). Sender, subject, and a short snippet of recent inbox emails so Zero can answer questions about your mail. Message bodies are never fetched or stored.
- Tasks page with a live queue. Kanban lanes, progress bars, ETAs, and resource coordination so you can see what Zero is doing in the background.
- Dashboard 7-day calendar widget with a next-up section.
- AI agency over your calendar. Zero can create, find, update, and delete events directly from chat with approval gates on writes.
- Source provenance on memories. Every memory now records where it came from (chat, voice, document, Gmail, calendar) and surfaces that source to the AI.
- Mac code signing with Developer ID and hardened runtime. The macOS installer is now signed by Summers Solutions Ltd.
- Windows installer hardened with Azure Trusted Signing and a deferred-swap auto-updater that resolves the WebView2 file-lock during in-app upgrades.
Improved
- Anthropic prompt caching is now active on Director calls, reducing cost on repeated cache-hit prompts within the 5-minute window.
- Calendar-aware memory. Time-sensitive questions surface upcoming 7-day events directly into Zero's context.
- Knowledge pack search quality. Two-phase title boosting, prose extraction, and question-prefix stripping produce cleaner answers.
- Faster voice shortcuts. Time, weather, calculator, dictionary, system info, and timer queries now respond in under two seconds.
- Privacy hardening for cloud features. Every cloud dispatch path (initial messages, retries, and multi-round agent loops) now routes through one privacy-blacklist chokepoint.
Fixed
- Windows in-app updater no longer fails on the WebView2 DLL file-lock. The new deferred-swap pattern applies the upgrade at next cold start.
- Archive page now shows archived memories correctly.
- Settings hover styling and tab consistency across all 9 tabs.
- Mac and Linux launches are now reliable: pywebview backend bindings ship with the correct platform markers.
- Memory system: sleep pipeline routes correctly on the LM Studio backend, preference-type memories reach the Director prompt, and project-scoped retrieval backfills bidirectionally.
v0.1.4April 2026
New
- Claude Opus 4.7 support. Use it with your own Anthropic API key in chat or with the coding specialist.
- Frontier model tier for datacenter-class hardware (256 GB+ RAM, 120 GB+ VRAM).
- Enthusiast coding model tier for high-end workstations.
- Four new coding models: Qwen3 Coder Next, DeepSeek Coder V2, Codestral, and CodeGemma.
- Tier switch preview shows disk space required before you commit to the change.
- Uninstall downloaded models individually, with protection on any model currently assigned to an active role.
- Theme redeem codes now work reliably end to end.
Improved
- Coding agent reliability on long-running tasks.
- Model downloads auto-resume if your connection drops mid-way.
- LM Studio voice model picker now has feature parity with the Ollama picker.
- Cloud account connection has better error handling and retry behaviour.
- Memory system correctness across projects and specialists.
Fixed
- Specialist now connects correctly to remote Ollama servers on your network.
- Coding model dropdown shows all installed compatible models.
- Coding agent parser no longer strips code fences when writing markdown files.
- Coding agent can read files it wrote earlier in the same run.
- Removed duplicate strategy content in cloud prompts.
- Preference memories reach the assistant again.
- Working state no longer leaks between sessions.
- Fact verification works on the LM Studio backend.
v0.1.3April 2026
New
- AI Specialists: delegate coding tasks to a specialist AI agent. Full file review and approve/reject before any changes are applied.
- LM Studio support: use LM Studio as an alternative local AI backend alongside Ollama. Switch instantly in Settings.
- Offline mode: completely block all outbound network requests with a single toggle. Nothing leaves your machine.
- Connection log: see every outbound request Zero makes, with destination, timing, and status.
- Privacy blacklist: define sensitive terms that are automatically scrubbed from all cloud messages before they leave your machine.
- My Privacy page: centralised privacy dashboard with mode selector, blacklist management, connection log, and data controls.
- Telegram remote access: control Zero from your phone via a Telegram bot. Encrypted token storage, chat ID whitelisting, and desktop chat mirroring.
- xAI Grok and Kimi (Moonshot) cloud providers. 7 cloud AI providers now supported.
- Neon Tokyo exclusive theme: cyberpunk purple and cyan with animated synthwave perspective grid. Unlock with a founder code from Discord.
- Theme unlock system: redeem exclusive codes for special themes.
- Costs page with currency selector (7 currencies), period filters, and per-request cost breakdown.
- Windows installer now signed by Summers Solutions Ltd via Azure Trusted Signing.
Improved
- Cloud voice now offers Standard mode (split reasoning and TTS), roughly 15x cheaper than Premium mode.
- Cloud token usage reduced by approximately 80% for chat messages through context optimisation.
- Choose different cloud models for Director and Specialist roles independently.
- Specialist agent memories are now processed separately during sleep, with dedicated fact extraction and cleanup.
- Cloud billing is now idempotent. Retried requests after timeouts will not be double-charged.
- Account tokens refresh automatically on expiry. No more manual re-login after sessions expire.
- Settings page loads significantly faster with lazy tab loading.
v0.1.2April 2026
New
- macOS support: .dmg installer with .app bundle (Intel and Apple Silicon)
- Linux support: AppImage for x86_64 with bundled Python runtime
- Auto-updater: checks for new versions on startup, one-click update with SHA256 verification
- GPU detection for NVIDIA, AMD (ROCm + HSA override), Intel Arc (oneAPI), and Apple Silicon (Metal)
- Vulkan toggle for GPU acceleration on non-NVIDIA hardware (experimental, manual only)
- Ollama mode persistence: bundled or system Ollama config saved from first setup, prevents model-not-found errors
- Discord community link in the sidebar
- System dependency notices for macOS and Linux on first launch
- GPU acceleration section in Settings with detected backend display
- Model location info in Settings for debugging
- CI/CD pipeline: GitHub Actions builds all three platforms in parallel on tag push
Improved
- Linux AppImage reduced from 2.7 GB to 356 MB (torch CPU-only install in CI)
- Setup wizard shows retry and skip buttons with troubleshooting guidance when model downloads fail
- Settings shows "No compatible GPU detected, CPU mode will be used" with manual tier override note
- Sleep subprocess uses python.exe instead of pythonw.exe for reliable .pyc execution
- Auto-sleep defaults to off on new installs
- Consent modal checkbox text alignment improved
Fixed
- Model not found (404) error when using system Ollama alongside InnerZero
- Sleep pipeline crash in installed app (load_dotenv .pyc incompatibility)
- Sleep subprocess failing silently (pythonw.exe, missing PYTHONPATH)
- Version display showing "?" in Settings before background check completes
- Discord sidebar showing raw template literal instead of icon
- fetch_url Unicode crash on Windows (arrow character in print statement)
- Auto-sleep toggle snapping back to off when dropdown closes on click
v0.1.1April 2026
New
- Remote Ollama support: connect to Ollama running on another machine on your network
- Unrestricted Mode with full age verification and consent flow
- Automated memory backup system (weekly, up to 10 backups)
- Version update gate: older installs prompted to update on launch
- Business licence validation for commercial users
- Star ratings on AI responses (1-5 stars, replaces thumbs up/down)
- Custom alarm sounds: pick your own audio file for alarms
- Memory import: paste text or upload files to teach Zero new facts
Improved
- Clock system redesigned: preset timer pills, phone-style AM/PM alarm picker
- Settings reorganised from 11 tabs to 9 (cleaner, less overwhelming)
- Memory page replaced by Settings Memory tab (Core Facts, Recent Memories, Archive)
- Sleep progress estimates are more accurate and never increase during a run
- Auto-sleep with configurable idle timer (15-60 minutes)
- Wake buttons in chat and status bar during sleep
Fixed
- Dictionary Unicode crash on Windows (IPA phonetic symbols)
- Weather API updated (Open-Meteo deprecated old endpoint)
- Voice name confusion (reordered confidence checks)
- Voice math shortcuts ("seven times seven" now uses calculator, not the AI model)
v0.1.0April 2026
New
- First public release of InnerZero
- AI chat with streaming responses and cancel support
- Full voice mode: speech recognition, natural TTS, voice shortcuts
- 30+ built-in tools (web search, calculator, timers, notes, file operations, and more)
- Persistent memory system with local storage
- Sleep/reflection pipeline for overnight memory processing
- Knowledge packs (offline Wikipedia)
- 5 themes (Dark Zero, Light Zero, Classic Carbon, Soft Pink, Dark Teal)
- Cloud mode with BYO API keys (DeepSeek, OpenAI, Anthropic, Google AI, Qwen)
- Cloud voice (OpenAI Audio with 13 ChatGPT voices)
- AI personality system (Professional, Friendly, Concise, or custom)
- Screen automation (read screen, click, type, scroll other apps)
- Document upload and Q&A (.txt, .md, .pdf, .docx, .xlsx, .csv)
- Project system for organising work and scoping memory
- Hardware auto-detection and model selection
- Setup wizard with guided first-run experience
- Chat session persistence across restarts