ANTIGRAVITY LABJP
Articles/AI Tools
AI Tools/2026-05-05Beginner

VSCode 1.118: The Quiet Upgrade That Matters for AI-Assisted Development

VSCode 1.118 (April 2026) prepares for GitHub Copilot's shift to pay-as-you-go pricing with practical improvements: prompt caching via KV cache, better agent context management, and cleaner extension API permissions.

VSCodeGitHub Copilot4pay-as-you-goprompt caching2AI development toolsupdate2

Reading through the VSCode 1.118 release notes, I kept thinking "this is quietly significant." On the surface it's about adapting to Copilot's pricing change. But underneath, there are architectural-level efficiency improvements that directly affect how AI-assisted coding works day to day.

GitHub Copilot Moves to Pay-As-You-Go in June 2026

The context first: starting June 1, 2026, GitHub Copilot adds a consumption-based pricing option alongside existing monthly plans. VSCode 1.118 was released at the end of April with these changes built in ahead of time.

Pay-as-you-go means heavy users might pay more on high-usage months, while lighter users can pay less. The flat monthly subscription isn't going away — it's becoming one option among several.

Prompt Caching (KV Cache) Integration

The change with the most direct cost impact is deeper use of prompt caching.

VSCode 1.118 stores frequently referenced context in a KV cache when using Anthropic models (Claude). When the cache hits, cost drops to roughly one-tenth of a normal request.

This pays off when you're referencing the same files repeatedly, or when long system prompts are sent across multiple turns. Copilot users backed by Claude get this benefit automatically — no configuration required.

Improved Agent Context Management

When custom agent skills call multiple tools or load large reference documents, the chat context can balloon quickly. This caused a well-known degradation: the AI's responses becoming less coherent mid-session as the context fills with redundant tool results.

VSCode 1.118 compresses already-processed tool results before storing them in context. If you've experienced "the AI went weird in the middle of a long session," context pollution was likely part of the cause. This update reduces that.

Cleaner Extension API Permissions

The extension API got updated alongside the Copilot pricing changes, clarifying the permission model for third-party extensions that access Copilot's context.

This makes it easier for tools like database connectors and API testing extensions to integrate properly with Copilot's agent features — rather than working around ambiguous access boundaries.

What the Pay-As-You-Go Shift Means in Practice

Some usage patterns will cost more, some less, once consumption billing is active.

Patterns that may cost more:

  • Keeping Copilot open all day with constant real-time completions
  • Sending large code blocks for re-generation repeatedly

Patterns that may cost less:

  • Asking questions in ways that hit the prompt cache
  • Batching related work into fewer, longer conversations rather than many short ones

The shift from flat monthly pricing to consumption billing is a useful forcing function for thinking about efficiency. Being intentional about when to call the AI and how to structure conversations can make a real difference.

Looking back

VSCode 1.118 isn't about headline features. It's about the AI coding infrastructure maturing — prompt caching, context management, and pricing model adaptation all in one release. For developers who rely on Copilot daily, these are improvements you'll feel without necessarily noticing them explicitly.

Before June's pay-as-you-go transition, it's worth reviewing your own usage patterns once. Small adjustments in how you interact with Copilot can meaningfully change what you end up paying.

Share

Thank You for Reading

Antigravity Lab is ad-free, supported entirely by members like you. We publish practical guides daily with implementation code, benchmarks, and production-ready patterns. If you've found it useful, we'd love to have you on board.

  • Copy-paste ready implementation code
  • New advanced guides published daily
  • $5/mo or $10 for lifetime access
View Membership →

If you found this article helpful, a small tip ($1.50) would mean a lot to us. Your support helps keep this site ad-free and covers server and hosting costs.

Related Articles

AI Tools2026-03-18
AI Coding Agent Power Rankings 2026 — How Claude Code Became #1 in Just 8 Months
In-depth analysis of the 2026 AI coding tool landscape. Claude Code, Windsurf, Cursor, and GitHub Copilot compared with benchmarks, adoption data, and practical recommendations.
AI Tools2026-06-20
A Schedule That Survives 429s: Backoff and Jitter for Agent Automation
Run agents in parallel and rate-limit 429s can cascade until everything dies. Here is how to design exponential backoff and jitter so the retries themselves don't create new congestion, from an indie developer's automation setup.
AI Tools2026-06-18
Using the v2.1.4 Quota Screen for a Weekly Reckoning: Reading Used and Remaining to Run an Indie Budget
How to turn the used/remaining display in the reworked Antigravity v2.1.4 quota screen into a weekly reckoning instead of a gut feeling. Baseline recording, burn-rate math, and allocation across multiple projects, written as an indie-dev operating routine.
📚RECOMMENDED BOOKS
Build a Large Language Model (From Scratch)
Sebastian Raschka
LLM Dev
Prompt Engineering for LLMs
Berryman & Ziegler
Prompting
AI Engineering
Chip Huyen
AI Eng
* Contains affiliate links
See all →