GitHub - UnicomAI/hexagent: HexAgent – An Agent harness that gives any LLM a computer to complete tasks the way humans do

Agent harness — give any LLM a computer. Ship any agent product.

HexAgent is an open-source agent harness: the production runtime that gives any LLM a fully-equipped computer — terminal, filesystem, and shell — to complete tasks autonomously.

Unlike every other agent framework, HexAgent separates the agent runtime from the computer it operates on. Your agent gets a sandboxed machine; your runtime keeps its API keys, config, and source code private.

Why "harness" and not "framework"? A framework gives you building blocks and says "assemble your own agent." A harness gives the agent a fully equipped runtime — tools, context management, safety, execution environments — so you focus on what the agent does, not how it executes. (Read more)

The Computer Layer

In Claude Code, Codex, and every LangChain agent, the agent runtime and the computer it controls are the same process. The agent can read its own source code, config files, and API keys. HexAgent's Computer protocol makes this separation explicit and pluggable — swap execution environments without changing a line of agent code.

from hexagent import create_agent
from hexagent.computer import LocalNativeComputer, LocalVM, RemoteE2BComputer

# Development — run on your machine
agent = await create_agent(model="anthropic/claude-sonnet-4-20250514", computer=LocalNativeComputer())

# Security-sensitive — sandboxed VM (Lima on macOS, WSL on Windows)
agent = await create_agent(model="anthropic/claude-sonnet-4-20250514", computer=LocalVM())

# Production / multi-tenant — isolated cloud sandbox
agent = await create_agent(model="anthropic/claude-sonnet-4-20250514", computer=RemoteE2BComputer(api_key="..."))

┌───────────────────────────┐            ┌───────────────────────────┐
│     Agent Runtime         │            │     Agent's Computer      │
│     (your host)           │   run()    │     (sandboxed)           │
│                           │ ─────────> │                           │
│  - LLM API keys           │   start()  │  - Terminal + filesystem  │
│  - Agent source code      │   upload() │  - User's project files   │
│  - Harness config         │   stop()   │  - Installed tools        │
│  - Middleware & hooks     │            │                           │
│                           │            │  Cannot access runtime    │
└───────────────────────────┘            └───────────────────────────┘

Three built-in implementations cover every deployment scenario:

Computer	Environment	Use case
`LocalNativeComputer`	Host shell	Development, trusted agents
`LocalVM`	Lima (macOS) / WSL (Windows)	Security-sensitive work, Cowork products
`RemoteE2BComputer`	E2B cloud sandbox	Production, multi-tenant, CI/CD

Implement the Computer protocol to add your own — Docker, Kubernetes pods, or any remote execution target.

Quick Start

pip install hexagent

Minimal example

import asyncio
from hexagent import create_agent
from hexagent.computer import LocalNativeComputer

async def main():
    async with await create_agent(
        model="anthropic/claude-sonnet-4-20250514",  # or any LLM
        computer=LocalNativeComputer(),
    ) as agent:
        result = await agent.ainvoke({
            "messages": [{"role": "user", "content": "Find all TODO comments in this project"}]
        })
        print(result["messages"][-1].content)

asyncio.run(main())

Use any model

from hexagent import create_agent, ModelProfile
from hexagent.computer import LocalNativeComputer

# DeepSeek, Qwen, Llama, Mistral — anything OpenAI-compatible
model = ModelProfile(
    model="deepseek/deepseek-chat",
    base_url="https://api.deepseek.com/v1",
    api_key="your-key",
    context_window=64000,
)

agent = await create_agent(model=model, computer=LocalNativeComputer())

Add subagents, MCP servers, web tools

from hexagent import create_agent, AgentDefinition
from hexagent.computer import LocalNativeComputer

agent = await create_agent(
    model="anthropic/claude-sonnet-4-20250514",
    computer=LocalNativeComputer(),
    # Subagents for parallel specialized work
    agents={
        "researcher": AgentDefinition(
            description="Deep-dives into codebases",
            tools=["Read", "Glob", "Grep", "WebSearch"],
            model="fast",
        ),
    },
    # MCP tool servers
    mcp_servers={
        "github": {"type": "http", "url": "https://mcp.github.com/mcp"},
    },
    # Web capabilities
    search_provider=("tavily", "your-key"),
    fetch_provider=("jina", "your-key"),
)

Run in a cloud sandbox

from hexagent import create_agent
from hexagent.computer import RemoteE2BComputer

# Fully isolated cloud execution — no local risk
agent = await create_agent(
    model="openai/gpt-4o",
    computer=RemoteE2BComputer(api_key="your-e2b-key"),
)

See libs/hexagent/README.md for the full API reference.

What Can You Build?

One harness powers four product types — no other agent SDK does this:

Product Type	Description	Example
CLI Coding Agent	Terminal-native agent that lives in your shell, reads your codebase, writes and runs code autonomously	Claude Code, Gemini CLI
Chatbot	Conversational AI assistant — you ask, it answers, with web search, file uploads, and tool use in the loop	ChatGPT, Claude Chat
Cowork	Desktop agent that works on your local files, folders, and apps — completing knowledge work tasks autonomously while you steer	Claude Cowork
Autonomous Agent	Headless agent that runs tasks end-to-end without supervision	OpenClaw, Devin

The hexagent_demo app ships with ready-to-use Chat and Cowork modes as concrete examples.

Features

Core Architecture

Computer Protocol — Pluggable execution environments (local, VM, cloud) with full runtime isolation. The agent's computer is a separate process from the agent itself.
Model-agnostic — Anthropic, OpenAI, DeepSeek, open-weight models via OpenRouter, or any OpenAI-compatible endpoint. Swap models without changing your agent.
Context engineering — Automatic 3-phase compaction keeps agents effective across long sessions. Context is an architectural concern, not an afterthought.

Production Capabilities

12+ built-in tools — Bash, Read, Write, Edit, Glob, Grep, WebSearch, WebFetch, plus extensible skills and MCP servers
Subagent orchestration — Spawn specialized child agents (foreground + background) with isolated contexts and filtered tool sets
MCP native — First-class Model Context Protocol support via stdio, SSE, or HTTP transports
Permission gating — Multi-layer safety rules validate every tool call before execution with human-in-the-loop approval flows
Skills system — Filesystem-based extensions with SKILL.md metadata and on-demand loading
System reminders — Rule-based context injection before model calls (<system-reminder> mechanism)
Web providers — Pluggable search (Tavily/Brave) and fetch (Jina/Firecrawl) backends
Composable, not magical — Small modules with explicit I/O. No hidden state. Every piece is testable and replaceable.

How HexAgent Compares

	HexAgent	Claude Agent SDK	LangChain Deep Agents
Open source	MIT	MIT	MIT
Model-agnostic	Any LLM	Claude only	Any LLM
Runtime / Computer separation	Yes (`Computer` protocol)	No (same process)	No (virtual filesystem)
Computer environments	Local + VM + Cloud (E2B)	Local only	Pluggable sandboxes
Multi-product (Chat, Code, Cowork, Autonomous)	Yes, from one harness	No (CLI-focused)	Assemble yourself
Context compaction	3-phase automatic	Automatic	3-tier (offload + truncate + summarize)
Subagent orchestration	Built-in (foreground + background)	Built-in (no nesting)	Built-in
MCP support	Native	Native	Via adapters
Skill / plugin system	Filesystem-based (SKILL.md)	Filesystem-based	Filesystem-based (SKILL.md)
Observability	LangSmith / Braintrust	None built-in	LangSmith
Language	Python	Python + TypeScript	Python + TypeScript

How the Harness Works

┌─────────────────────────────────────────────────────────────────┐
│                        Agent Harness                            │
│                                                                 │
│  ┌───────────┐  ┌──────────────┐  ┌───────────┐                 │
│  │  Prompt   │  │  Middleware  │  │   Tools   │                 │
│  │  System   │  │  Pipeline    │  │           │                 │
│  │           │  │              │  │ Bash,Read │                 │
│  │ Fragments │  │ Compaction   │  │ Write,Edit│                 │
│  │ Sections  │  │ Permissions  │  │ Glob,Grep │                 │
│  │ Variables │  │ Reminders    │  │ Web,MCP   │                 │
│  │           │  │ Image Adapt  │  │ Skills    │                 │
│  └───────────┘  └──────────────┘  │ Subagents │                 │
│                                   └───────────┘                 │
│  ┌───────────┐  ┌──────────────┐  ┌──────────────────────────┐  │
│  │  Skills   │  │  MCP Client  │  │  Environment Detection   │  │
│  │  System   │  │  (stdio/sse/ │  │  (pwd, git, platform,    │  │
│  │           │  │   http)      │  │   shell, timezone)       │  │
│  └───────────┘  └──────────────┘  └──────────────────────────┘  │
└───────────────────────┬─────────────────────────────────────────┘
          ▲             │                       │
          │ LLM         │ Computer Protocol     │ Tool calls
          │ responses   │                       ▼
    ┌──────────┐  ┌─────┴────────────────────────────┐
    │ Any LLM  │  │ Computer (Local / VM / Cloud)    │
    │ Provider │  │ Terminal + Filesystem            │
    └──────────┘  └──────────────────────────────────┘

Component	What it does
Computer Protocol	Pluggable execution environments — `LocalNativeComputer` (your machine), `LocalVM` (Lima/WSL), `RemoteE2BComputer` (cloud sandbox)
Tool System	12+ built-in tools plus custom `BaseAgentTool` extension and MCP server integration
Prompt Composition	Modular system prompt built from 35+ Markdown fragments with variable substitution
Middleware Pipeline	Pre-model hooks: context compaction, permission gating, skill injection, image adaptation, dynamic reminders
Subagent Orchestration	Spawn child agents with isolated contexts, filtered tool sets, and background execution
Skill Discovery	Filesystem-based extensions with SKILL.md metadata and lazy loading

Agent Harness vs Agent Framework

The AI agent ecosystem has converged on a clear taxonomy:

	Framework	Runtime	Harness
What it is	Building blocks (tools, prompts, memory)	Durable execution engine	Complete agent operating system
Analogy	A toolkit	A job scheduler	An OS for the agent
You build	Everything from scratch	Orchestration logic	Your agent's purpose
Examples	LangChain, CrewAI, Semantic Kernel	LangGraph, Temporal	HexAgent, Claude Code, OpenHands

A framework says: "Here are components. Assemble your agent."

A harness says: "Here is a fully equipped computer. Tell the agent what to do."

HexAgent is a harness you can embed as a library — giving you the batteries-included runtime of products like Claude Code, with the flexibility to build any agent product you want.

Project Structure

Package	Description
`libs/hexagent`	Core framework — the agent harness library (API docs)
`libs/hexagent_demo`	Demo app — desktop Chat + Cowork built on the framework (setup guide)

libs/hexagent/hexagent/
├── computer/       # Computer protocol — local, VM, or cloud (E2B) execution
├── harness/        # Runtime augmentation: environment, permissions, skills, reminders
├── tools/          # Built-in tools: CLI, web, subagents, skills, todos
├── prompts/        # Composable prompt system with Markdown fragments
├── mcp/            # Model Context Protocol client (stdio/sse/http)
├── langchain/      # LangChain/LangGraph integration (isolated — no leakage into core)
└── types.py        # Framework-agnostic types: ToolResult, AgentContext, CLIResult

Design Philosophy

Give the agent a computer — via the terminal, the way developers work. This is the universal interface for capable agents.
Separate runtime from computer — the agent should never see its own harness. Isolation by default, convenience by choice.
Vendor-agnostic core — tools and types have zero LangChain dependency; the integration is isolated in langchain/.
Agent-first ergonomics — tools and results are designed for how agents consume information, not humans.
Protocol-based — Computer, SubagentRunner, SkillCatalog are pluggable protocols, not concrete classes.
Simplicity — obvious > clever, testable > convenient, explicit > magical.

Inspired by Adam Wolff's talk at QCon 2025 on Claude Code's architecture, and the growing consensus that the harness — not the model — is what makes agents work.

Contributing

We welcome contributions! Whether it's new tools, computer implementations, web providers, prompt improvements, or documentation — there's a place for you. See CONTRIBUTING.md for development setup and guidelines.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
.github		.github
libs		libs
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Computer Layer

Quick Start

Minimal example

Use any model

Add subagents, MCP servers, web tools

Run in a cloud sandbox

What Can You Build?

Features

Core Architecture

Production Capabilities

How HexAgent Compares

How the Harness Works

Agent Harness vs Agent Framework

Project Structure

Design Philosophy

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

The Computer Layer

Quick Start

Minimal example

Use any model

Add subagents, MCP servers, web tools

Run in a cloud sandbox

What Can You Build?

Features

Core Architecture

Production Capabilities

How HexAgent Compares

How the Harness Works

Agent Harness vs Agent Framework

Project Structure

Design Philosophy

Contributing

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages