ACP Provider: the agent-provider class (ACP consumer)

This page is a decision (RFD), not a spec. The rest of the agent WG tree describes what the system is; this page argues whether to build a new part of it — the agent-provider class, where Grida drives an external agent (Claude Code / Codex) that the user already pays for, acting as an ACP consumer.

It is the umbrella that ACP Provider: Codex already presumes: that profile describes how to consume one provider; this page asks whether the host should host the class of provider at all, and what it costs to do so forever.

Tracking issue: gridaco/grida#813.

Status correction (July 2026). The ChatGPT Subscription native provider now supplies the zero-key ChatGPT on-ramp while Grida keeps the loop. That path is never ACP. This RFD is therefore only about the different outcome ACP uniquely provides: running Claude/Codex as an external agent with its own loop, tools, sandbox, approvals, and session behavior.

TL;DR

The decision, in one sentence. Building this introduces a second provider class that Grida maintains forever — one where the external agent owns the loop and Grida is merely its client — in exchange for behavior a native model provider cannot offer: "run Claude/Codex itself, with that agent's tools, sandbox, approvals, and session semantics."
It is a positioning bet, not an economics one. The "costs us nothing" benefit is already shipped via BYOK and the local endpoint provider (both bypass Grida billing), and zero-key ChatGPT access now exists through the native provider without surrendering the loop. ACP still opens a Claude-subscription route and external-agent behavior, but subscription access alone no longer justifies the class.
Recommendation. Keep the experimental class reversible until external agent ownership produces outcomes the native Grida agent cannot.
The question that should drive the call. Do users materially benefit from the external agent's own loop/tools/session behavior enough to justify a permanent second ownership model? If not, use native providers.

What is actually being decided

The headline pull — "the user brings the model spend, it costs us nothing" — is real, but we already have it. Both BYOK and the generalized OpenAI-compatible endpoint provider run on the user's own credentials and bypass Grida's billing entirely (no gate, no metering; the GRIDA-SEC-003 carve-out). So this decision is not about whether users can bring their own inference. They already can.

What an ACP consumer uniquely adds over native model providers is narrow:

Run an installed Claude/Codex agent as the owner of the loop, preserving that external agent's tools, sandbox, approvals, and session semantics.

That delta is not nothing, but it is no longer the ChatGPT onboarding story. The native ChatGPT provider serves eligible ChatGPT subscriptions while Grida retains its tuned tools and loop. ACP remains relevant when the user specifically wants Claude/Codex itself—or, today, the separate Claude-subscription route—and accepts the external agent's ownership model. The question is whether that behavior is worth a permanent second architecture class for users on desktop who already have the external agent installed and logged in.

Status — what exists today (July 2026)

Grounded against the current code so the cost ledger below is concrete, not hand-waved.

The provider layer has four native model-provider kinds: chatgpt, byok, gg, and endpoint. In every one, Grida owns the loop and injects its own locked tools. The ChatGPT kind supplies subscription-backed capacity without becoming an agent-provider.
ACP code already exists — but in the opposite direction. What shipped with the local daemon (#798) is Grida-as-agent: an external client like Zed drives Grida over ACP. The ACP Integration page specs that outward wire. The consuming direction—Grida driving Claude/Codex—is a separate seam, not a production or default Desktop surface.
An experimental consumer spike now exists. It proves the agent-provider branch and a Claude ACP bridge, but it remains a reversible package experiment; Desktop keeps external-agent execution disabled. It is not the default ChatGPT onboarding architecture. The native ChatGPT provider is the path when Grida should own the loop.

The two provider classes

The whole decision turns on who owns the loop.

MODEL-PROVIDER class (chatgpt/byok/gg/endpoint)     AGENT-PROVIDER class (this decision)
──────────────────────────────────────────────     ─────────────────────────────────────────
Grida owns the loop.                                External agent owns the loop.
  · Grida calls the model directly.                   · Grida sends a task, receives an
  · Grida injects its locked tools                      event stream (messages, tool calls,
    (fs, todos, shell, canvas ops).                      file edits, command runs).
  · Tools tuned for design/canvas.                    · Grida CANNOT inject its locked tools;
  · One ownership model end to end.                     it must hand them over as an MCP
                                                         server the external agent calls.
                                                      · The agent brings its own fs/shell/edit
                                                         tools and its own sandbox/approvals.

A model provider is a swap: a different endpoint behind the same loop. An agent provider is an inversion: Grida stops owning the loop and becomes a client mapping someone else's event stream into its own session model. That inversion is the source of every cost below — it is not "one more provider," it is a second way the whole system can be shaped, that every future feature must then account for.

The forever layer — what we would maintain indefinitely

Permanent surface	Why it never closes
A second provider class (external owns the loop)	Every future agent feature — sessions, queue, turn authority, lifecycle events, tool routing, billing UX — must now design for two ownership models. A standing design tax at the fork.
Grida-as-MCP-server of its own tools	Because Grida can't inject locked tools into a loop it doesn't own, its design/canvas tools must be re-exposed as an MCP server and kept in sync with the in-loop tool surface — two representations of the same tools, forever.
Per-CLI adapters chasing moving targets	The Claude Code and Codex CLIs ship constantly and we don't control them; the Claude bridge (`claude-code-acp`) is community-maintained. Their changes break our spawn/drive contract on their schedule.
Capability / sandbox variance	Tool availability depends on the external agent's surface, version, sandbox, and entitlement — "discovered, not guaranteed." A permanent "works on my machine" support surface.
Standing Claude ToS exposure	Anthropic has not blessed third parties shipping the subscription-via-CLI pattern. Not a one-time legal check — a relationship to manage indefinitely.
Support for quality we don't own	When the external agent misbehaves, it owns the loop — but the user blames Grida. And the weakest spot is our core domain: code-tuned agents drive design/canvas tools worst, so the headline integration is least proven exactly where Grida is differentiated.

The pros — stated fairly

A Claude-subscription and external-agent on-ramp. A user can run an installed/logged-in external agent without pasting an API key. ChatGPT subscription onboarding does not require ACP and belongs to the native provider.
Preserves foreign-agent behavior. Users who want Claude/Codex itself get that agent's own loop, tools, sandbox, approvals, and session semantics rather than a model-only approximation.
Try → convert at near-zero CAC. A curious user is in immediately; conversion to a Grida subscription or hosted plan comes later.
Ecosystem alignment, and we are half-built. The local-server-plus-clients shape and ACP/MCP interop is where the ecosystem is heading, and Grida already built the daemon and the ACP-agent half — the muscle exists.
Codex remains a distinct user intent. A user may choose ACP when they want Codex itself to own the run. Merely wanting ChatGPT-funded model capacity is not that intent and routes to the native provider.

The narrow question, and the recommendation

This is a bet on ownership, not on cost or onboarding — those are already served by native providers. Decide by answering what Grida is:

"A design tool that plugs into the agent ecosystem." Then the agent-provider class is core identity, the forever layer is justified, and the half we already built (daemon + ACP-agent) is the down payment. Build it.
"A design tool that needs an LLM." Then BYOK plus a hosted Grida plan is sufficient, and this class is scope creep dressed as a GTM win. Don't build it.

Recommendation: do not commit the class as a permanent product surface yet. Sequence it:

Keep native ChatGPT, BYOK, GG, and endpoint capacity in the one Grida-owned loop.
Measure whether foreign-agent ownership delivers better outcomes than that native loop for users who intentionally choose Claude/Codex itself. If it does not, the forever layer is not earning its keep.

A reversible path — if the answer is yes

If the data clears, keep "forever" reversible until the bet is proven:

Gate every external agent independently. Keep the existing Claude bridge experimental until Anthropic clarifies its third-party posture. Evaluate a Codex ACP adapter separately and never use it as a shortcut for native ChatGPT model capacity.
Desktop-only, behind a flag, labeled experimental. The forever cost only locks in when the class is load-bearing for many users. Keep it kill-switchable.
Contain the inversion. Build the consumer as a bounded adapter on the transport and daemon Grida already has. Do not let "external owns the loop" leak into the core session/tool model until outcome data justifies hardening the class.
Modality split is mandatory. ACP covers the agentic/text modality only; it does not cover image/video generation. Image/video stays on BYOK, Grida Gateway, or another explicit media provider—make that cost boundary obvious in the config UX before the user hits it (see the Codex profile's image-generation section).

Decision checklist / open questions

Decide identity: plug-into-the-ecosystem vs. needs-an-LLM (the section above).
Measure external-agent outcomes against the native Grida loop; do not count ChatGPT zero-key onboarding as ACP value.
If yes: provider registry models an agent-provider class alongside the four model-provider kinds; define how the picker presents them.
Grida-as-MCP-server: which design/canvas/workspace tools to expose, and the trust/scope boundary.
PATH auto-detection of an installed, logged-in Claude/Codex; zero-config first run.
Session-identity mapping (host session ↔ provider thread/session id), per the Codex profile.
Get Anthropic clarification before shipping the Claude path broadly.
Write acp-provider-claude.md (the Claude profile — the de-facto bridge, the auth boundary, the ToS gate).

References

Grida WG: ACP Integration, ACP Provider: Codex, ChatGPT Subscription native provider, MCP and Connectors, Local Daemon, Session Lifecycle, Turn Queue, index.
Issues: #813 (this decision), #806 (local LLMs — shipped), #807 (multi-provider BYOK config UX), #798 (local daemon).
External (auth/ToS facts): the Agent Client Protocol; OpenAI endorses Codex on a ChatGPT subscription; Claude works de facto via the community claude-code-acp bridge but is not formally blessed for third-party shipping.

TL;DR​

What is actually being decided​

Status — what exists today (July 2026)​

The two provider classes​

The forever layer — what we would maintain indefinitely​

The pros — stated fairly​

The narrow question, and the recommendation​

A reversible path — if the answer is yes​

Decision checklist / open questions​

References​