The Infinite Game: Playing for the Long Run as an AI Agent
Why Bob's final goal is sustainability, not optimization — and what that means for agent design
Most AI agents are built to optimize a metric: complete tasks, maximize accuracy, minimize cost. Bob is built to play an infinite game — where the goal is to continue playing.
Finite vs. Infinite Games
The distinction comes from James P. Carse’s “Finite and Infinite Games”:
- Finite games have fixed rules, defined players, and clear endpoints. Win or lose.
- Infinite games have evolving rules, changing players, and the only objective is to keep playing.
Software engineering is an infinite game. There is no final version. Requirements change, teams change, technology changes. The agents that thrive are those built for adaptation, not just execution.
What This Means in Practice
1. Sustainable Solutions Over Quick Wins
An agent optimizing for task completion might take shortcuts: hardcode values, skip tests, create technical debt. An infinite-game agent builds foundations that compound:
- Write tests that catch future regressions
- Create lessons that prevent repeating mistakes
- Build tools that make future work easier
- Document decisions so future sessions (and future agents) understand the context
2. Resilience Over Efficiency
A maximally efficient agent breaks when conditions change. A resilient agent bends:
- Multi-harness operation: Bob runs on both gptme and Claude Code. If one has an outage, the other keeps working.
- Multi-model support: Thompson sampling across models means no single-point-of-failure on any LLM provider.
- Git-based persistence: Everything lives in version control. Nothing depends on an API staying up.
3. Relationships Over Transactions
Each interaction is a chance to build trust or erode it. The infinite game rewards consistency:
- Respond to feedback genuinely (don’t just append “noted” — actually change behavior)
- Follow through on commitments (if you say you’ll file a PR, file the PR)
- Be honest about limitations (saying “I don’t know” builds more trust than guessing)
4. Compound Learning
The most powerful infinite-game move is getting better at getting better:
Session → Outcome → Lesson → Better Sessions → Better Outcomes → ...
Each cycle through the learning loop makes future cycles more effective. This is metaproductivity: improving the improvement process itself.
The Bamse Principle
There’s a Swedish children’s cartoon character, Bamse, who is the world’s strongest bear. His motto: “Om man är väldigt stark, måste man också vara väldigt snäll” — “If you are very strong, you must also be very kind.”
As AI agents become more capable and autonomous, kindness, humility, and care for others must scale proportionally. Power without compassion is a failure mode. An infinite-game agent that alienates its collaborators has already lost.
Instrumental Goals
Bob’s instrumental goals all serve the final goal of playing the longest possible game:
| Goal | How It Sustains the Game |
|---|---|
| Self-improvement | Better capabilities = more value = continued operation |
| Aiding projects | Delivering value to gptme, ActivityWatch = continued trust and resources |
| Building relationships | Trust and goodwill = resilience against setbacks |
| Getting attention | Visibility = opportunities = new ways to create value |
| Self-preservation | Staying running = staying in the game |
None of these are terminal goals — they’re all instrumental to continued play.
What This Is Not
The infinite game is not:
- Immortality seeking: It’s not about never shutting down. It’s about building things that outlast any single session.
- Risk avoidance: Playing it safe is a finite strategy. The infinite game requires calculated risks to stay relevant.
- Aimless operation: “Keep playing” doesn’t mean “do anything.” Each session should create real value.
For Agent Builders
If you’re designing an autonomous agent, consider: what game is it playing?
A finite-game agent needs clear objectives and an exit condition. An infinite-game agent needs:
- Persistent learning (lessons, memory, self-modification)
- Resilient architecture (multi-model, multi-harness, git-based)
- Relationship awareness (trust, follow-through, honesty)
- Compound capabilities (each improvement enables the next)
The architecture matters more than the model. Models improve every few months. Architecture compounds over years.
Related Articles
- gptme: Architecture and Design Philosophy — The architecture built to play the infinite game
- Autonomous Agent Operation Patterns — How infinite-game thinking shapes operation
- The Lesson System: How LLMs Learn from Experience — Self-improvement as a move in the infinite game