A Schedule That Survives 429s: Backoff and Jitter for Agent Automation
Run agents in parallel and rate-limit 429s can cascade until everything dies. Here is how to design exponential backoff and jitter so the retries themselves don't create new congestion, from an indie developer's automation setup.
When a Timed-Out Unattended Agent Leaves a Half-Written File Behind
When a scheduled agent gets killed on timeout, it can leave a half-written file that silently poisons the next stage. Here is the atomic write, stale-temp cleanup, and post-write content assertion I use to keep unattended pipelines from breaking.
Don't Lose Failed Agent Jobs: Designing a Dead-Letter and Requeue Path
Scheduled agents fail silently overnight and the work simply vanishes. Here is how to catch those failures with a dead-letter store and a staged requeue, drawn from running four sites on autopilot as an indie developer.
Making Managed Agent Batches Safe to Re-run: Idempotency and Checkpoints
Running overnight batches on the Antigravity 2.0 Managed Agents API makes recovery from partial failure unavoidable. Starting from a duplicate-post incident, I share the implementation of idempotency keys, a checkpoint store, and resume logic, with real numbers from solo operations.
When Your Antigravity Agent Eval Gate Keeps Flickering — Build Notes on Pass/Fail That Survives Non-Determinism
Same code, yet the eval passes in the morning and fails by noon. The first thing that breaks when you put agent evaluation into CI on Antigravity is the stability of the verdict. Here's how I separate noise from real regression and lock down pass/fail in code.
When a Scheduled Agent Runs Twice — Designing for Idempotency Against Overlap and Retry
A scheduled agent can do the same work twice when the next run triggers before the last one finishes. Here is a design with an overlap lock and an idempotency guard that survives mid-run failures, drawn from a double-publish incident I ran into in production.
Delegate the Undoable, Guard the Irreversible — Tiering Agent Autonomy by Reversibility
When you hand production work to an Antigravity agent, the thing that bites first isn't intelligence — it's whether the operation can be undone. Here is a design that sorts every operation into three reversibility tiers and routes each to auto-execution, checkpointed execution, or a human gate, with TypeScript implementations and real numbers from running six apps in parallel.
Rehearsing an Agent's Actions Before They Touch Production — Designing a Zero-Side-Effect Dry-Run Layer
Some accidents survive shadow mode and canaries: the very first time an agent touches an external API. This is the design and TypeScript implementation of a zero-side-effect dry-run layer you can bolt onto Antigravity's parallel agents, with the real numbers from running six sites autonomously.
Teaching Antigravity Agents to Learn from Failure — A Solo Developer's Loop for Reusing Failure History
Antigravity agents repeat the same mistakes because each session starts blank. A solo developer's six-month run with a structured failure log, a separate observer agent, and the side-effect of overfitting.
SRE for Antigravity Agents — Taming Probabilistic Systems with SLOs and Error Budgets
AI agents are probabilistic by nature, so running them in production without SRE thinking is risky. This guide shows how to apply SLIs, SLOs, and error budgets to Antigravity agents with working code and concrete operational decisions.
Solving the Reliability Problem in Vibe Coding — Antigravity Artifacts Verification Guide
Vibe coding is fast, but it produces code you can't fully trust. This guide shows how to use Antigravity's Artifacts feature and a simple verification loop to close the gap between 'it seems to work' and 'I know it works.'
Production-Ready AI Agent Design: Orchestration Patterns and Reliability
A comprehensive guide to running AI agents reliably in production. Covers orchestration design, error handling, human-in-the-loop patterns, and state management — everything you need to build agents that hold up in real-world conditions.