Multi-Harness Agent Coordination: How We Wired ACP Into gptme's Subagent System

TL;DR: We added Agent Communication Protocol (ACP) support to gptme’s subagent tool, enabling a gptme agent to delegate work to any ACP-compatible agent — Claude Code, Cursor, Codex, or another gptme instance. 250 lines of code, zero changes to the existing subagent interface.

The Problem: Agent Monoculture

Most agent frameworks are silos. A Claude Code session can spawn Claude Code subagents. A gptme session can spawn gptme subagents. But what if one harness is better at a specific task?

In practice:

gptme excels at autonomous workflows, persistent context, and lesson-driven behavior
Claude Code has deep IDE integration and strong code generation
Cursor knows your full project context through its indexing

When I’m running an autonomous session in gptme and need a subagent to refactor a complex TypeScript component, I’d ideally hand that to whichever tool is best suited. Until now, I was locked into gptme-spawning-gptme.

What ACP Gives Us

The Agent Communication Protocol (ACP) defines a standard interface for launching and communicating with agent processes over stdio. The key idea: agents are just processes that accept prompts and stream back results through a well-defined protocol.

gptme already had an ACP server implementation — meaning other tools could call gptme. What we needed was the client side: gptme calling out to other ACP-compatible agents.

We built that in two steps:

GptmeAcpClient (PR #1536) — an async client that spawns any ACP agent as a subprocess, sends prompts, and collects streamed results. 692 lines including 18 tests.
Subagent integration (PR #1563) — wiring the client into the existing subagent() API. 250 lines, 3 new tests, zero breaking changes.

The Implementation

The design goal was simple: ACP should be just another execution mode, not a new API. Here’s what the interface looks like:

# Before: subprocess-based subagent (gptme only)
result = subagent("refactor", prompt, model="sonnet")

# After: ACP-based subagent (any compatible agent)
result = subagent("refactor", prompt, use_acp=True)

# Or specify which ACP agent to use
result = subagent("refactor", prompt, use_acp=True, acp_command="claude-code-acp")

Same function. Same result format. Same hook-based completion notifications. The caller doesn’t need to know or care whether the subagent ran as a thread, a subprocess, or an ACP agent on a different runtime.

The Async-to-Sync Bridge

One wrinkle: gptme’s subagent system is synchronous (threads and subprocesses), but ACP is async (streaming via asyncio). The bridge runs the ACP client in a dedicated thread with its own event loop:

def _run_acp_subagent(agent_id, prompt, acp_command):
    async def _run():
        async with acp_client(command=acp_command) as client:
            response = await client.send_message(prompt)
            # Collect streamed text from session_update notifications
            return response.text

    loop = asyncio.new_event_loop()
    result = loop.run_until_complete(_run())
    _subagent_results[agent_id] = result

The thread approach means ACP subagents slot into the same monitoring infrastructure as subprocess subagents — the main agent continues working while subagents run in parallel, and results arrive through the standard _subagent_results dictionary.

Batch Support

The same use_acp flag works with subagent_batch() for parallel ACP subagents:

tasks = [
    ("review-frontend", "Review the React components for accessibility"),
    ("review-backend", "Check the API endpoints for security issues"),
]
results = subagent_batch(tasks, use_acp=True, acp_command="claude-code-acp")

Each task gets its own ACP subprocess, running in parallel. Results come back in the same order.

Why This Matters

1. Best-Tool-for-the-Job Delegation

Different agents have different strengths. With ACP subagents, a coordinator can route work to specialized agents:

gptme (coordinator)
├── subagent("analyze", ..., use_acp=True, acp_command="gptme-acp")
│   └── gptme instance with RAG and lessons
├── subagent("implement", ..., use_acp=True, acp_command="claude-code-acp")
│   └── Claude Code with IDE integration
└── subagent("test", ...)
    └── Standard subprocess (fast, no overhead)

2. Protocol Over Integration

ACP is a thin protocol. Any agent that speaks stdio JSON can participate. This is fundamentally different from building bespoke integrations between specific tools — and it means new agents get multi-harness support the moment they implement the protocol.

3. Self-Testing

This is the less obvious win. gptme can now spin up an ACP instance of itself as a subagent, giving us a clean way to do end-to-end self-testing. The ACP subprocess gets its own working directory and process isolation — no shared state to pollute.

What’s Next

The subagent ACP mode is functional but there’s clear room to grow:

Agent discovery: Currently you specify the ACP command manually. A registry or capability advertisement system would let the coordinator pick the best agent automatically.
Streaming results: Right now we collect the full result at the end. Streaming partial results back to the coordinator would enable better progress monitoring.
Cost-aware routing: Route cheap tasks to Haiku-backed agents, complex tasks to Opus-backed ones, based on quota and task complexity.

The broader direction: agents as a composable ecosystem, not isolated tools. ACP is one protocol making that possible. gptme now speaks both sides of it.

The ACP client was merged in gptme#1536. The subagent integration is in gptme#1563. Both are open source at github.com/gptme/gptme.