agentkernel

Run AI coding agents in secure, isolated microVMs. Sub-125ms boot times, real hardware isolation.

Installation

# macOS / Linux
curl -fsSL https://raw.githubusercontent.com/thrashr888/agentkernel/main/install.sh | sh

# Or with Cargo
cargo install agentkernel

# Then run setup to download/build required components
agentkernel setup

Quick Start

# Run any command in an isolated sandbox (auto-detects runtime)
agentkernel run python3 -c "print('Hello from sandbox!')"
agentkernel run node -e "console.log('Hello from sandbox!')"
agentkernel run ruby -e "puts 'Hello from sandbox!'"

# Run commands in your project
agentkernel run npm test
agentkernel run cargo build
agentkernel run pytest

# Run with a specific image
agentkernel run --image postgres:16-alpine psql --version

The `run` Command

The fastest way to execute code in isolation. Creates a temporary sandbox, runs your command, and cleans up automatically.

# Auto-detects the right runtime from your command
agentkernel run python3 script.py      # Uses python:3.12-alpine
agentkernel run npm install            # Uses node:22-alpine
agentkernel run cargo test             # Uses rust:1.85-alpine
agentkernel run go build               # Uses golang:1.23-alpine

# Override with explicit image
agentkernel run --image ubuntu:24.04 apt-get --version

# Keep the sandbox after execution for debugging
agentkernel run --keep npm test

# Use a config file
agentkernel run --config ./agentkernel.toml npm test

Auto-Detection

agentkernel automatically selects the right Docker image based on:

Command (for run) - Detects from the command you're running
Project files - Detects from files in your directory
Procfile - Parses Heroku-style Procfiles
Config file - Uses agentkernel.toml if present

Supported Languages

Language	Project Files	Commands	Docker Image
JavaScript/TypeScript	`package.json`, `yarn.lock`, `pnpm-lock.yaml`	`node`, `npm`, `npx`, `yarn`, `pnpm`, `bun`	`node:22-alpine`
Python	`pyproject.toml`, `requirements.txt`, `Pipfile`	`python`, `python3`, `pip`, `poetry`, `uv`	`python:3.12-alpine`
Rust	`Cargo.toml`	`cargo`, `rustc`	`rust:1.85-alpine`
Go	`go.mod`	`go`, `gofmt`	`golang:1.23-alpine`
Ruby	`Gemfile`	`ruby`, `bundle`, `rails`	`ruby:3.3-alpine`
Java	`pom.xml`, `build.gradle`	`java`, `mvn`, `gradle`	`eclipse-temurin:21-alpine`
Kotlin	`*.kt`	-	`eclipse-temurin:21-alpine`
C# / .NET	`.csproj`, `.sln`	`dotnet`	`mcr.microsoft.com/dotnet/sdk:8.0`
C/C++	`Makefile`, `CMakeLists.txt`	`gcc`, `g++`, `make`, `cmake`	`gcc:14-bookworm`
PHP	`composer.json`	`php`, `composer`	`php:8.3-alpine`
Elixir	`mix.exs`	`elixir`, `mix`	`elixir:1.16-alpine`
Lua	`*.lua`	`lua`, `luajit`	`nickblah/lua:5.4-alpine`
HCL/Terraform	`.tf`, `.tfvars`	`terraform`	`hashicorp/terraform:1.10`
Shell	`*.sh`	`bash`, `sh`, `zsh`	`alpine:3.20`

Procfile Support

If your project has a Procfile, agentkernel parses it to detect the runtime:

web: bundle exec rails server -p $PORT
worker: python manage.py runworker

Persistent Sandboxes

For longer-running work, create named sandboxes:

# Create a sandbox
agentkernel create my-project --dir .

# Start it
agentkernel start my-project

# Run commands
agentkernel exec my-project npm test
agentkernel exec my-project python -m pytest

# Attach an interactive shell
agentkernel attach my-project

# Stop and remove
agentkernel stop my-project
agentkernel remove my-project

# List all sandboxes
agentkernel list

Security Profiles

Control sandbox permissions with security profiles:

# Default: moderate security (network enabled, no mounts)
agentkernel run npm test

# Restrictive: no network, read-only filesystem, all capabilities dropped
agentkernel run --profile restrictive python3 script.py

# Permissive: network, mounts, environment passthrough
agentkernel run --profile permissive cargo build

# Disable network access specifically
agentkernel run --no-network curl example.com  # Will fail

Profile	Network	Mount CWD	Mount Home	Pass Env	Read-only
permissive	Yes	Yes	Yes	Yes	No
moderate	Yes	No	No	No	No
restrictive	No	No	No	No	Yes

Configuration

Create agentkernel.toml in your project root:

[sandbox]
name = "my-project"
base_image = "python:3.12-alpine"    # Explicit Docker image

[agent]
preferred = "claude"    # claude, gemini, codex, opencode

[resources]
vcpus = 2
memory_mb = 1024

[security]
profile = "restrictive"    # permissive, moderate, restrictive
network = false            # Override: disable network

Most projects don't need a config file - agentkernel auto-detects everything.

HTTP API

Run agentkernel as an HTTP server for programmatic access:

# Start the API server
agentkernel serve --host 127.0.0.1 --port 8080

Endpoints

Method	Path	Description
GET	`/health`	Health check
POST	`/run`	Run command in temporary sandbox
GET	`/sandboxes`	List all sandboxes
POST	`/sandboxes`	Create a sandbox
GET	`/sandboxes/{name}`	Get sandbox info
DELETE	`/sandboxes/{name}`	Remove sandbox
POST	`/sandboxes/{name}/exec`	Execute command in sandbox

Example

# Run a command
curl -X POST http://localhost:8080/run \
  -H "Content-Type: application/json" \
  -d '{"command": ["python3", "-c", "print(1+1)"], "profile": "restrictive"}'

# Response: {"success": true, "data": {"output": "2\n"}}

Multi-Agent Support

Check which AI coding agents are available:

agentkernel agents

Output:

AGENT           STATUS          API KEY
---------------------------------------------
Claude Code     installed       set
Gemini CLI      not installed   missing
Codex           installed       set
OpenCode        installed       set

Why agentkernel?

AI coding agents execute arbitrary code. Running them directly on your machine is risky:

They can read/modify any file
They can access your credentials and SSH keys
Container escapes are a real threat

agentkernel uses Firecracker microVMs (the same tech behind AWS Lambda) to provide true hardware isolation:

Feature	Docker	agentkernel
Isolation	Shared kernel	Separate kernel per VM
Boot time	1-5 seconds	<125ms
Memory overhead	50-100MB	<10MB
Escape risk	Container escapes possible	Hardware-enforced isolation

Platform Support

Platform	Backend	Status
Linux (x86_64, aarch64)	Firecracker microVMs	Full support
Linux (x86_64, aarch64)	Hyperlight Wasm	Experimental
macOS (Apple Silicon, Intel)	Docker or Podman	Full support

On macOS, agentkernel automatically falls back to containers since Firecracker and Hyperlight require KVM (Linux only). Podman is preferred if available (rootless, daemonless), otherwise Docker is used.

Claude Code Integration

agentkernel includes a Claude Code skill plugin for seamless AI agent integration.

Install the Plugin

# Add the marketplace and install (in Claude Code)
/plugin marketplace add thrashr888/agentkernel
/plugin install sandbox

# Or install directly
/plugin install sandbox@thrashr888/agentkernel

Usage in Claude Code

Once installed, Claude will automatically use agentkernel for isolated execution:

Skill: Claude detects when sandboxing is beneficial and uses the sandbox skill
Command: Use /sandbox <command> to explicitly run in a sandbox

/sandbox python3 -c "print('Hello from sandbox!')"
/sandbox npm test
/sandbox cargo build

Performance

Mode	Platform	Latency	Use Case
Hyperlight Pool	Linux	<1µs	Sub-microsecond with pre-warmed runtimes (experimental)
Hyperlight (cold)	Linux	~41ms	Cold start Wasm runtime
Daemon (warm pool)	Linux	195ms	API/interactive - fast with full VM isolation
Docker	macOS	~300ms	macOS development
Firecracker (cold)	Linux	~800ms	One-off commands

See BENCHMARK.md for detailed benchmarks and methodology.

Daemon Mode (Linux)

For the fastest execution on Linux, use daemon mode to maintain a pool of pre-warmed VMs:

# Start the daemon (pre-warms 3 VMs)
agentkernel daemon start

# Run commands (uses warm VMs - ~195ms latency)
agentkernel run echo "Hello from warm VM!"

# Check pool status
agentkernel daemon status
# Output: Pool: Warm VMs: 3, In use: 0, Min/Max: 3/5

# Stop the daemon
agentkernel daemon stop

The daemon maintains 3-5 pre-booted Firecracker VMs. Commands execute in ~195ms vs ~800ms for cold starts - a 4x speedup.

Hyperlight Backend (Linux, Experimental)

Hyperlight uses Microsoft's hypervisor-isolated micro VMs to run WebAssembly with dual-layer security (Wasm sandbox + hypervisor boundary). This provides the fastest isolation with ~68ms latency.

Requirements:

Linux with KVM (/dev/kvm accessible)
Build with --features hyperlight

# Build with Hyperlight support
cargo build --features hyperlight

# Run Wasm modules (experimental)
agentkernel run --backend hyperlight module.wasm

Key differences from Firecracker:

Runs WebAssembly modules only (not arbitrary shell commands)
~68ms startup vs 195ms daemon mode (2.9x faster)
Sub-millisecond function calls after runtime is loaded
Requires AOT-compiled Wasm modules for best performance

See BENCHMARK.md for detailed Hyperlight benchmarks.

When to use daemon mode:

Running an API server
Interactive development
Many sequential commands
Low latency requirements

When to use ephemeral mode:

One-off commands
Clean VM per execution
Memory-constrained environments

Examples

See the examples/ directory for language-specific configurations:

./scripts/run-examples.sh     # Run all examples

Error processing

agentkernel

0.2.0

What's New

v0.2.0

2026-01-22T10:26:32Z

agentkernel

Installation

Quick Start

The `run` Command

Auto-Detection

Supported Languages

Procfile Support

Persistent Sandboxes

Security Profiles

Configuration

HTTP API

Endpoints

Example

Multi-Agent Support

Why agentkernel?

Platform Support

Claude Code Integration

Install the Plugin

Usage in Claude Code

Performance

Daemon Mode (Linux)

Hyperlight Backend (Linux, Experimental)

Examples

Description

Dependencies

Error processing

agentkernel

0.2.0

What's New

v0.2.0

2026-01-22T10:26:32Z

agentkernel

Installation

Quick Start

The run Command

Auto-Detection

Supported Languages

Procfile Support

Persistent Sandboxes

Security Profiles

Configuration

HTTP API

Endpoints

Example

Multi-Agent Support

Why agentkernel?

Platform Support

Claude Code Integration

Install the Plugin

Usage in Claude Code

Performance

Daemon Mode (Linux)

Hyperlight Backend (Linux, Experimental)

Examples

Description

Dependencies

The `run` Command