Cue

A native iOS AI assistant powered by Apple Intelligence, with persistent memory and an extensible skill system — no setup, no cloud

iOS Dev, Swift, AI

2025–2026

A port of the ZeroClaw agentic framework to iOS — built entirely in Swift, running on-device via Apple's Foundation Models, with no account and no mandatory network calls.

I use ZeroClaw on my laptop pretty much every day. It's an agentic framework with persistent memory, a skill system, and tool-calling built around whatever model you point it at. The frustrating part was that none of that existed on iOS. Every mobile AI app is just a chat interface that forgets everything when you close it.

Cue was built to fix that. It's a full port of the ZeroClaw runtime to Swift — same memory file format, same skill dispatch model, same agent loop — but running natively on iPhone using Apple's on-device Foundation Models. You open the app and you're immediately talking to an assistant that already knows who you are and what you were working on.

Rebuilt the ZeroClaw agent loop as a Swift actor using async/await and structured concurrency — same loop that runs on desktop, now running on-device on iOS 18
Full memory system: soul.md, memory.md, and agents.md in the same flat-file format as ZeroClaw, indexed with SQLite via GRDB
Skill system with bundled integrations for Calendar, Reminders, Files, Clipboard, and web search using iOS-native APIs
External provider support (OpenAI, Anthropic, Groq) as optional BYOK fallbacks when you want a more capable model

Contribution

iOS Architecture, Swift/SwiftUI,
Agent Runtime, Skill System

Memory System, Provider Abstraction, Security Policy

Stack

Swift 6, SwiftUI, Apple Foundation Models
GRDB (SQLite), EventKit, FileManager
AppIntents, Keychain, URLSession

Platform

iOS 18+ · iPhone 15 Pro or later
(Apple Intelligence for local inference)

Duration

Architecture & Design: 2 weeks
Agent Runtime: 4 weeks
Skills & Memory: 3 weeks
Ongoing

AgentActor.swift

AgentActor main loop

CueDispatcher tool routing

MemoryManager soul · memory · agents

SkillRegistry calendar · files · web

InferenceProvider apple · openai · anthropic

Every mobile AI app is just a chat window. Cue is an agent with memory that runs entirely on your device.

Setup time. Open the app, start chatting immediately.

Cue

Why can't my phone's AI assistant actually remember who I am?

The problem with mobile AI:

I was a daily ZeroClaw user on desktop. It had persistent memory so it knew my projects, my preferences, my ongoing context. It could use skills to actually do things — search the web, read files, create calendar events. Then I'd pick up my phone and every mobile AI app was back to square one. No memory, no skills, no continuity. It felt like a step backwards. The problem wasn't that on-device AI was weak — Apple Intelligence on iPhone 15 Pro is actually quite capable. The problem was that no one had built the infrastructure around it.

No Persistent Memory

Every conversation starts from scratch. The assistant has no idea what you were working on yesterday, what your preferences are, or what context matters to you. This makes mobile AI feel like a toy compared to what you can build on desktop.

No Agentic Capability

Mobile AI apps can chat but can't act. They can't read your calendar, create a reminder, search the web, or operate on your files. The hardware to do all of this exists on every modern iPhone — the software layer just hadn't been built.

Cloud Dependency

Most capable AI assistants send your queries to a server. That's a problem for sensitive conversations, for offline use, and for anyone who cares about where their data goes. Apple Intelligence runs on-device by default — it just needed the right runtime around it.

The core architectural decision was to keep the ZeroClaw memory file format exactly as-is. soul.md, memory.md, agents.md — same plain text format, same paths, just stored in Application Support/cue/memory/ instead of ~/.zeroclaw/. This meant the iOS app would be compatible with the desktop agent's memory files. You could theoretically sync them with iCloud Drive and have continuity across both.

I used GRDB.swift for the SQLite layer. It gives you a Swift-native interface and WAL mode out of the box for concurrent reads, which matters when the agent loop is running on one thread and the UI is reading memory on another. All the API keys go into Keychain with kSecAttrAccessibleWhenUnlockedThisDeviceOnly.

Porting the agent loop to Swift

The ZeroClaw agent loop is written in Node.js with an async event loop managing tool calls. Swift's actor model is actually a great fit for this — AgentActor is a Swift actor that handles the entire turn loop serially, with async/await for the inference calls and AsyncStream for streaming responses back to the UI.

Tool calling works via structured output: the model generates a ToolCallEnvelope JSON struct, and CueDispatcher routes it to the matching skill. The skill executes, returns a result string, and the loop re-prompts with the result. The hardest part wasn't the logic — it was getting Apple Foundation Models to produce reliable structured output for tool calls. Guided generation helped a lot.

The skill system and iOS permissions

iOS has a stricter permission model than desktop, which shaped how the skill system works. On desktop, ZeroClaw has a shell execution skill. On iOS, there's no equivalent — the sandbox won't allow it. Instead Cue uses Apple Shortcuts via AppIntents as the OS automation layer.

The bundled skills cover the things I actually want an assistant to do on my phone: add calendar events, set reminders, search the web, read and write files via the document picker, and manage the clipboard. Each skill is a CueSkill protocol conformance, registered with SkillRegistry on launch.

Reflection:

Cue has been one of the most satisfying projects I've worked on because it's solving a problem I actually have. Porting the ZeroClaw runtime to Swift taught me a lot about how Swift's concurrency model works in practice, and working with Apple's Foundation Models API gave me a much better sense of what on-device AI can actually do today.

Swift actors are excellent for agent loops:

The serialized execution model of Swift actors maps almost perfectly to what you want in an agent loop — one turn at a time, no concurrent mutations to state, async/await for I/O. The final architecture is clean in a way that the Node.js version wasn't.

The right language feature for a problem makes the code almost explain itself.

File format compatibility matters more than I expected:

Deciding early to keep the ZeroClaw memory file format exact was the right call. It unlocked cross-device compatibility for free and made the iOS app feel like a genuine extension of the desktop experience rather than a separate product.

Compatibility is a feature — don't break formats without a good reason.

On-device AI is ready for real use:

Apple Foundation Models on iPhone 15 Pro are genuinely capable for everyday assistant tasks. For most things users actually want — summarize this, create a reminder, what was I working on — on-device inference is fast, private, and good enough.

The infrastructure matters more than the model size for most use cases.