Three commands. One terminal. Working agent in 10 minutes.
macOS · Resolve 20.0+
Zero-shot CU figures it out every call. Failures are silent. Retries compound bugs. Every call hits the frontier model. No number you can put in a contract.
The same procedure runs across 51,204 customer calls on Resolve 20.2. Failures return {ok, repair_hint, retry_safe}. Idempotent procedures retry. Non-idempotent ones don't. You ship the number to your own customers.
99.6% across 51,204 calls on Resolve 20.2. 30-day uptime, p95 latency, per-version success — measured on real customer machines.
Drop per-procedure stats into your customer-facing reliability page. Numbers become yours to defend, backed by trajectory data and replay-deterministic audit trails.
Models will improve. "We know how to drive Resolve" erodes. Verification contracts, idempotency, audit trails, versioning — these compound.
Every developer shipping a desktop-touching agent runs the same monologue — "this will fail at demo time and I'll debug it under deadline." Structured failures, idempotency labels, versioned reliability, recovery docs — engineered for one thing: turning that dread into a typed function call with a published number.
If your agent product lives or dies on desktop reliability, we should talk.