Articles/Antigravity Basics

◉ Antigravity Basics/2026-06-12Intermediate

Measuring the Break-Even Point Between Google AI Pro and Ultra — 14 Days of Quota Data from Parallel Agent Runs

Is AI Ultra ($100/month, 5x the Pro limits) actually worth it? A Python harness that aggregates daily quota consumption from agent logs, 14 days of real measurements, and a formula that converts wait time into money to settle the question.

Google AI Pro⁴ Google AI Ultra² Antigravity³⁴¹ quota⁹ plan comparison² cost management⁵ parallel agents³ indie development¹⁶

✦ Premium Article

June's plan changes added a new option: AI Ultra at $100 per month, with roughly five times the usage limits of Pro.

As an indie developer who increasingly runs multiple Antigravity agents in parallel, I had a vague feeling of hitting the Pro ceiling toward the end of each month. But a vague feeling is a poor basis for an extra $80 a month. That money matters when you fund your own tools.

The exact quota numbers are not published anywhere. So the only way to know was to measure my own environment. I recorded fourteen days of consumption data before deciding anything.

This article documents the measurement harness, the numbers it produced, and the arithmetic that turns wait time into a break-even judgment.

The June 2026 Plan Lineup — What Is Public and What Is Not

First, the groundwork. As of June 2026, three options matter to individual developers.

AI Pro ($20/month): the standard plan, including Antigravity agent usage
AI Ultra ($100/month): the mid-tier added in June, with roughly 5x the Pro limits
Top-tier Ultra ($200/month): reduced from $250; limits are effectively a non-issue

Here is the catch: the "5x" multiplier is public, but the Pro baseline itself is not. Is it request count? Token volume? Are models weighted differently? Nobody outside Google knows. This opacity is precisely what forces plan decisions to run on gut feeling.

What is not published can still be observed from the outside. Agent execution history lives in local logs. That became my starting point.

Building a Small Measurement Harness

The first thing I built was a script that aggregates agent session logs by day.

One sentence on what problem this code solves: it records, in a consistent daily format, when I ran agents, at what parallelism, and at what moment I hit a rate limit.

Antigravity stores per-session execution history as JSONL files on the local machine. The location varies by setup, so the path is passed as an argument.

#!/usr/bin/env python3
"""agent_quota_tracker.py — daily aggregation of agent execution logs"""
import json
import sys
import re
from pathlib import Path
from collections import defaultdict
from datetime import datetime
 
# Patterns that indicate a rate limit (extend as you observe new ones)
RATE_LIMIT_PATTERNS = [
    re.compile(r"rate.?limit", re.IGNORECASE),
    re.compile(r"quota.+exceeded", re.IGNORECASE),
    re.compile(r"resource.?exhausted", re.IGNORECASE),
]
 
def is_rate_limited(text: str) -> bool:
    return any(p.search(text) for p in RATE_LIMIT_PATTERNS)
 
def collect(log_dir: Path) -> dict:
    daily = defaultdict(lambda: {
        "sessions": 0,
        "agent_runs": 0,
        "max_parallel": 0,
        "rate_limit_hits": [],
    })
    for jsonl in sorted(log_dir.rglob("*.jsonl")):
        active = []  # (start, end) pairs for estimating parallelism
        for line in jsonl.read_text(encoding="utf-8").splitlines():
            try:
                ev = json.loads(line)
            except json.JSONDecodeError:
                continue
            ts = ev.get("timestamp", "")
            day = ts[:10] if ts else "unknown"
            kind = ev.get("type", "")
            if kind == "session_start":
                daily[day]["sessions"] += 1
            elif kind == "agent_run":
                daily[day]["agent_runs"] += 1
                start = ev.get("started_at", ts)
                end = ev.get("ended_at", ts)
                active.append((start, end))
            elif kind == "error" and is_rate_limited(str(ev.get("message", ""))):
                daily[day]["rate_limit_hits"].append(ts[11:16])
        # Treat the max number of overlapping runs as the day's parallelism
        for day, stats in daily.items():
            overlaps = [
                sum(1 for s2, e2 in active if s2 <= s1 < e2)
                for s1, _ in active
            ]
            if overlaps:
                stats["max_parallel"] = max(stats["max_parallel"], max(overlaps))
    return daily
 
def main():
    if len(sys.argv) < 2:
        print("usage: agent_quota_tracker.py <log_dir>")
        sys.exit(1)
    daily = collect(Path(sys.argv[1]))
    for day in sorted(daily):
        s = daily[day]
        hits = ",".join(s["rate_limit_hits"]) or "-"
        print(f"{day}  runs={s['agent_runs']:3d}  "
              f"parallel_max={s['max_parallel']}  limited_at={hits}")
 
if __name__ == "__main__":
    main()

Two design decisions are worth explaining.

First, rate-limit detection works by pattern-matching error messages. Since the internal quota mechanics are invisible, the moment you hit the wall is the only fact observable from the outside. I built around that fact.

Second, parallelism is estimated from overlapping execution windows rather than counted directly. As you will see below, consumption pace correlated more strongly with parallelism than with raw run count.

Register the script with cron or Antigravity 2.0's scheduled execution, run it nightly, and the record accumulates on its own.

✦

Thank you for reading this far.

Continue Reading

What follows includes implementation code, benchmarks, and practical content we hope you'll find useful. This site runs without ads — server and development costs are supported entirely by members like you. If it's been helpful, we'd be truly grateful for your support.

WHAT YOU'LL LEARN

✦A complete Python script (~60 lines) that aggregates daily quota consumption from agent execution logs

✦Raw data from 14 days of measurement showing how parallelism correlates with rate-limit hits — and how to read it

✦A break-even formula that converts wait time into money, plus decision criteria for three usage patterns

Secure payment via Stripe · Cancel anytime

✦

Unlock This Article

Get full access to the rest of this article. Buy once, read anytime. This site is ad-free — your support goes directly toward keeping it running.

Unlock all articles with Membership →

Fourteen Days of Data — How Parallelism Relates to Rate Limits

From late May into early June, I stayed on the Pro plan and recorded fourteen days — ten working days. These are observations from my environment; a different workload will produce different numbers, and I want to be upfront about that.

The headline findings:

Days I hit the limit: 7 out of 10 working days
Average time of hitting the limit: around 4 p.m. (work typically starts at 9 a.m.)
Average wait after hitting the limit: 47 minutes per day (measured until reset)
Consumption pace at 3 parallel agents: roughly 2.6x serial execution

That last number was the biggest takeaway. Running three agents in parallel does not triple consumption — presumably because agents spend stretches waiting on each other. Put another way, the "discount" from parallelism is only about 13%. You get 3x the work for 2.6x the consumption.

Meanwhile, on days I worked mostly serially (parallelism of 1–2), I never hit the limit once in ten working days. For my usage, the Pro ceiling sits at a level that serial work never reaches, but habitual 3-way parallelism exhausts by late afternoon.

For context, the work I delegate to agents is unglamorous: dependency updates across long-running apps, refactoring passes, release-note preparation for store reviews — the steady maintenance load of indie development. Even that modest workload produced these numbers at three agents.

Converting Wait Time into Money — the Break-Even Formula

With data in hand, the $80 difference becomes an equation.

The idea is simple: if monthly wait hours × the value of your time exceeds the monthly price difference, Ultra pays for itself.

# break_even.py — judging the Pro -> Ultra break-even point
PRICE_DIFF_USD = 80          # Ultra(100) - Pro(20)
WORK_DAYS = 20               # working days per month
 
wait_minutes_per_day = 47    # replace with your measured value
hourly_value_usd = 20        # your estimated hourly value
 
monthly_wait_h = wait_minutes_per_day * WORK_DAYS / 60
loss_usd = monthly_wait_h * hourly_value_usd
 
print(f"Monthly wait time: {monthly_wait_h:.1f} hours")
print(f"Wait time as monetary loss: ${loss_usd:,.0f}")
print(f"Plan price difference: ${PRICE_DIFF_USD}")
print("Verdict:", "Ultra pays off" if loss_usd > PRICE_DIFF_USD else "Stay on Pro")

Plugging in my measurements — 47 minutes of waiting per day, my time valued at $20 an hour — gives about 15.7 hours of monthly wait, or roughly $313 of loss against an $80 difference. The verdict came back clearly: Ultra pays off.

One honest caveat: not all wait time is loss. If hitting the limit means you smoothly switch to review, design, or documentation work, your real loss might be less than half. I re-ran my numbers with a 0.5 coefficient, and Ultra still cleared the bar.

If you struggle to context-switch during forced waits, or your workflow involves watching agents finish before issuing the next instruction, keep the coefficient at 1.0 — that is closer to your reality.

A Decision Framework by Usage Pattern — and My Own Call

The measurements condense into three patterns.

Mostly serial (1–2 parallel): Pro is enough. In my observations, serial days never hit the limit. The $80 belongs elsewhere in your budget
Habitually 3+ parallel, unable to repurpose wait time: you have likely crossed the break-even point. Run the formula with your own measured values to confirm
Unsure: just install the measurement harness and record two weeks. Deciding after you have numbers costs you at most two weeks of the price difference — about $40

I decided to move to Ultra. What settled it was not the arithmetic itself but seeing the cost of interruption in numbers: every late-afternoon limit pushed unfinished work to the next morning, and the time spent picking up a broken thread the next day is not included in those 47 minutes.

I have kept measuring after the migration. If days start appearing where I hit the Ultra ceiling, I will treat that as a signal to redesign the workload itself. For techniques that reduce consumption rather than raise the ceiling, see practical strategies for AI credit optimization.

Your Next Action

If you do one thing today, make it this: point the measurement script at your own log directory and check whether yesterday's limited_at column is empty.

If it is empty, your current usage still has headroom on Pro. If timestamps appear, two weeks of recording is worth your while. The plan decision can comfortably wait until then.

I hope this helps anyone weighing the same choice.

Thank You for Reading

Antigravity Lab is ad-free, supported entirely by members like you. We publish practical guides daily with implementation code, benchmarks, and production-ready patterns. If you've found it useful, we'd love to have you on board.