← All blog posts 8-10 min readgoogle

Gemini Managed Agents API Production Workflow in 2026: The Operator Checklist

What you'll learn
  • Identify the correct invocation pattern for Managed Agents API and migrate off the legacy outputs schema before June 8
  • Decide when to use Interactions API versus generateContent based on stability and SLA requirements
  • Configure multi-agent workflows in Antigravity 2.0 using dynamic subagents and scheduled tasks

_Last verified: 2026-05-30 — Google I/O 2026 Managed Agents launch_

The Gemini Managed Agents API (model: antigravity-preview-05-2026) gives operators a single API call to deploy stateful agents in Google-hosted Linux sandboxes. The workflow: define agent behavior in AGENTS.md and SKILL.md files, invoke via the Interactions API using client.interactions.create(), and set background=True for long-running tasks. One hard constraint before you start: the API is public preview with no SLA, and the Interactions API has a breaking schema change removing the outputs field on June 8, 2026.

Most post-I/O coverage treats this as a GA launch. It isn't. The model identifier — antigravity-preview-05-2026 — has "preview" in the name. Google's own documentation explicitly recommends generateContent for production stability. That gap between the keynote narrative and the docs is where operators get hurt. This checklist is for the builder who's already decided to evaluate Managed Agents and needs the honest production constraints, not the press-release summary.


TL;DR: 10-Point Production Checklist

Copy this to your team's runbook before your first deployment.

  1. Accept preview-tier risk — no API-level SLA; do not build SLA-committed products on this yet
  2. Migrate off `outputs` schema before June 8, 2026 — legacy field removed permanently; use steps schema now
  3. Add `Api-Revision: 2026-05-20` header — opts into new schema before forced cutover
  4. Migrate from Gemini CLI to Antigravity CLI before June 18, 2026 — Gemini CLI deprecated
  5. Define agent behavior in `AGENTS.md` + `SKILL.md` — versionable, config-file-first authoring
  6. Use `background=True` for tasks over 30 seconds — offloads inference loop server-side; requires status polling
  7. Pin Antigravity SDK version — SDK is in preview; breaking changes expected
  8. Use `generateContent` for any SLA-committed workload — official Google guidance is unambiguous
  9. Enterprise path requires private-preview access — no self-serve enterprise managed agents as of 2026-05-30
  10. Monitor `ai.google.dev/gemini-api/docs/changelog` weekly — active development; subscribe to the breaking-changes feed

What Changed at Google I/O 2026: Three APIs, Three Stability Tiers

``mermaid flowchart TD subgraph config["Agent Configuration"] F["AGENTS.md + SKILL.md\nVersioned behavior files"] end subgraph api["Gemini Managed Agents API ⚠️ Preview"] B["antigravity-preview-05-2026\nHeader: Api-Revision: 2026-05-20"] end subgraph exec["Execution"] C{background=True?} D["Antigravity Linux Sandbox\nServer-side inference loop\nsteps schema — not outputs"] E["Synchronous Response"] G["Poll /interactions/{id}/status"] end A["Your Code\nclient.interactions.create()"] --> B F --> B B --> C C -->|"Tasks > 30 s"| D C -->|"Short tasks"| E D --> G `` Alt: Flowchart showing the Gemini Managed Agents API invocation path — from AGENTS.md configuration through the preview API to synchronous or background Antigravity sandbox execution, with the steps schema replacing the deprecated outputs field.

Google I/O 2026 shipped three interconnected surfaces. Each has a different stability tier, and conflating them is the most common operator mistake.

Managed Agents API — Public Preview

The Gemini API changelog for May 19, 2026 reads: "Launched the Managed Agents in the Gemini API in public preview. This enables developers to build and deploy autonomous, stateful agents that run in secure, isolated Google-hosted Linux sandbox environments."

The model identifier is antigravity-preview-05-2026. As the Google Cloud blog describes it: "builders can now instantly spin up custom agents that reason, call tools, and execute code inside secure, Google-hosted remote environments using a single API call." The invocation surface is the Interactions API, not generateContent. The Managed Agents blog confirms: "You package your instructions, skills, and tools, POST them, and Gemini builds and runs the agent."

For enterprise customers, there is a separate path via the Gemini Enterprise Agent Platform — but it remains in private preview. No self-serve enterprise access exists as of this writing.

Stability tier: public preview. No SLA. No guaranteed schema stability.

Interactions API — Early Beta with Active Breaking Changes

The Interactions API overview is unusually candid: "The Interactions API is currently in an early beta stage... breaking changes may occur. For production workloads, you should continue to use the standard generateContent API."

The May 2026 breaking change moves the response schema from outputs to steps and restructures image generation config. Timeline:

DateEvent
May 6, 2026Breaking change announced
May 26, 2026New steps schema becomes default
June 8, 2026Legacy outputs field removed permanently

Any code still accessing interaction.outputs on June 8 silently breaks. The fix: set Api-Revision: 2026-05-20 header and migrate to interaction.steps.

The Interactions API does offer structural advantages. Community measurements show approximately 79% fewer input tokens for multi-turn sessions versus generateContent, because conversation history is managed server-side. But this is an architectural observation, not an SLA commitment. The official Interactions API blog is direct: "For standard production workloads, generateContent remains the primary path and will continue to be developed and maintained."

Stability tier: early beta. Breaking change cadence is active (~20 days' notice). No published latency or cost SLAs.

Antigravity 2.0 — Stable Desktop App, Preview SDK

Antigravity 2.0 launched May 19, 2026 at v2.0.1 and is now at v2.0.6 (May 22, 2026). It is a standalone desktop application — not an update to Antigravity IDE. As Sundar Pichai framed it at I/O: "Antigravity is expanding beyond the coding environment, turning it into a platform to develop and manage cohorts of autonomous AI agents." The two products coexist; v2.0.6 re-added IDE integration as an optional link.

The Antigravity SDK is in preview. The Antigravity CLI fully replaces Gemini CLI (deprecated June 18, 2026) and shares the same agent harness — any future harness improvements apply to both.

Stability tier: desktop app stable; SDK preview; CLI stable but migrate before June 18.


Should You Migrate to the Interactions API?

flowchart TD
    A[New agent workload] --> B{Need SLA guarantee?}
    B -->|Yes| C[Use generateContent\nstable, maintained, no breaking changes]
    B -->|No| D{Multi-turn sessions\nover 5 turns?}
    D -->|No| C
    D -->|Yes| E{Can tolerate beta\nbreaking changes?}
    E -->|No| C
    E -->|Yes| F{Task runtime\nover 30 seconds?}
    F -->|No| G[Interactions API\nclient.interactions.create\nno background flag]
    F -->|Yes| H[Interactions API\nbackground=True\npoll for status]
    G --> I[Pin Api-Revision: 2026-05-20\nMigrate outputs to steps before June 8]
    H --> I

When Interactions API makes sense: - Internal tooling, demos, or early-access products where preview-tier breakage is acceptable - Long-running agent tasks where background=True server-side offloading reduces client infrastructure overhead - Multi-turn agentic sessions where server-side history management materially reduces token costs

When to stay on `generateContent`: - Any product with a latency or uptime SLA committed to customers - Compliance environments requiring audit trails of stable API schema versions - Production batch workloads where a broken schema means silent data loss


Antigravity 2.0 Multi-Agent Routing Patterns

Antigravity 2.0 supports three routing primitives, documented in the I/O developer highlights blog: "dynamic subagents for parallelized workflows, scheduled tasks for background automation and ecosystem integrations across Google AI Studio, Android and Firebase."

Dynamic Subagents (Parallelized Workflows)

Each subagent specializes in a task domain and runs in parallel within a shared project context. A coordination agent merges outputs. Define specializations in AGENTS.md:

```markdown # AGENTS.md

database-agent Handles all database queries and schema migrations. Skills: [sql, postgres, migration]

frontend-agent Handles React component generation and styling. Skills: [react, typescript, css]

coordinator Merges outputs from database-agent and frontend-agent into a coherent PR. ```

Each agent gets its own SKILL.md defining tool access. The coordination layer invokes specializations via client.interactions.create() with the agent= parameter pointing to each role.

Scheduled Tasks (Background Automation)

Scheduled tasks work like cron jobs managed by Google's infrastructure. Define: task name, project, schedule expression, and prompt. Agents inherit project IAM permissions and appear in the project timeline.

```python # Antigravity SDK (preview) — pin version before using from antigravity import AntigravityClient

client = AntigravityClient() task = client.scheduled_tasks.create( name="daily-dependency-audit", project="my-project", schedule="0 9 *", # daily at 09:00 prompt="Audit package.json for outdated dependencies. File issues for any with CVEs." ) ```

Antigravity CLI Migration (Deadline: June 18, 2026)

```bash # Before — Gemini CLI, deprecated June 18, 2026 gemini run --model gemini-2.5-pro "audit this file"

# After — Antigravity CLI, same harness, future-proofed antigravity run "audit this file" ```

Any CI/CD pipeline, script, or developer workflow using gemini CLI commands must migrate before the removal date. The Antigravity CLI shares the same underlying harness, so the migration is mechanical — syntax change, not behavioral change.


Production Go-Live Checklist (Detailed)

Authentication and Quotas

  • [ ] API key with Gemini API access enabled (Google AI Studio or Vertex endpoint)
  • [ ] Confirm the correct API endpoint — consumer API (ai.google.dev) vs. Vertex AI have different quota limits
  • [ ] For enterprise, confirm private-preview access to Gemini Enterprise Agent Platform before starting development
  • [ ] Request quota increases before launch; preview-tier quotas are lower than GA equivalents

Interactions API Migration (June 8 Deadline)

  • [ ] Set Api-Revision: 2026-05-20 header on all Interactions API requests
  • [ ] Replace all interaction.outputs references with interaction.steps
  • [ ] Move image generation config from generation_config.image_config to top-level response_format
  • [ ] Run integration tests against the new schema before June 8

```python from google import genai

client = genai.Client()

# Correct invocation pattern (post-May 26 schema) interaction = client.interactions.create( agent="antigravity-preview-05-2026", input="Your task description here.", background=True, # offload to server for tasks > 30s headers={"Api-Revision": "2026-05-20"} )

# Poll for completion — background tasks require polling, no push notification import time while interaction.status == "running": time.sleep(5) interaction = client.interactions.get(interaction.id)

# Access output via steps, not outputs (outputs removed June 8) result = interaction.steps[-1].output ```

Observability

  • [ ] Log interaction.id for every request — it's the only handle for debugging server-side execution
  • [ ] Set up alerting on Interactions API error rates — no SLA means no automatic Google alerting on degraded service
  • [ ] Capture interaction.steps sequence for multi-step tasks; this is your audit trail for agent behavior

Fallback Behavior

  • [ ] Define a generateContent fallback for every Managed Agents invocation in any flow with user-facing SLA
  • [ ] Set request timeouts and retry budgets — background tasks can run indefinitely if an agent loops; implement a max-wait ceiling in your polling logic
  • [ ] Test failure modes explicitly: sandbox timeout, Interactions API 503, schema mismatch

When NOT to Use Managed Agents

If your workload requires a latency or uptime SLA, Managed Agents is not the right surface today. No SLA exists. generateContent is the only stable path for committed production guarantees.

If you're in a compliance-gated environment, the active breaking-change cadence on the Interactions API is a blocker. The May 2026 schema change arrived with approximately 20 days' notice. Compliance teams requiring audit trails of exact API request/response formats should wait for GA.

If your tasks are short and stateless, generateContent is cheaper, faster, and more stable. The Interactions API's structural advantages — server-side history, background execution, managed sandboxes — only pay off at multi-turn and long-running scale. A single-turn completion under five seconds has nothing to gain from the preview beta.

If you need enterprise access, confirm the private-preview gate before committing engineering time. The Gemini Enterprise Agent Platform path is not self-serve as of 2026-05-30. Building against the consumer API and planning to migrate later adds integration risk.

The Academy's position: Managed Agents API is a viable operator surface post-I/O 2026 for teams with appropriate preview tolerance. The Interactions API beta is production-ready for internal and non-SLA workloads with the documented caveats applied. Neither is a drop-in replacement for generateContent in SLA-backed deployments — yet.


What to Watch in the Next 90 Days

Based on Google's public development velocity (Managed Agents blog):

  • Interactions API GA — the breaking-change cycle and SDK refinement suggest GA is 3–6 months out; watch the changelog for the announcement
  • Antigravity SDK stability — likely exits preview alongside Interactions API GA
  • Enterprise path self-serve — expect a waitlist-to-open expansion of the Gemini Enterprise Agent Platform private preview
  • June 18 Gemini CLI removal — hard date; migrate now to avoid CI pipeline breakage

Knowledge Check

Which component should you use for production workloads with SLA commitments today?

A) Managed Agents API (antigravity-preview-05-2026) B) Interactions API with background=True C) generateContent API D) Antigravity 2.0 SDK (preview)

<details> <summary>Answer</summary>

C — `generateContent` API. Google's own guidance states: "For production workloads, you should continue to use the standard generateContent API. It remains the recommended path for stable deployments." The Managed Agents API and Interactions API are in public preview and early beta respectively, with no published SLAs and active breaking changes. Source: ai.google.dev/gemini-api/docs/interactions-overview.

</details>


Ready to go deeper on production agent deployment patterns? The Academy's How to build production Gemini Enterprise agents with routing, lifecycle, and governance in 8 chapters course covers the full Gemini Enterprise Agent Platform — from ADK setup through multi-agent orchestration, quota management, and cost modeling — with runnable examples at every step.

References

  1. ai.google.dev
  2. blog.google
  3. cloud.google.com
  4. ai.google.dev
  5. ai.google.dev
  6. blog.google
  7. antigravity.google
  8. blog.google
  9. blog.google
Next up
community 6-8 min read

The workflow harness is the moat: cross-CLI context persistence with Codex, Claude Code, Cursor, and Gemini in 2026

Continue reading