Kubernetes-native AI agent orchestration.
Getting Started ยท Architecture ยท API Reference ยท Documentation
Orka turns your Kubernetes cluster into an AI-powered task execution platform. Spin up swarms of AI agents that write code, review PRs, research topics, or run containers โ each as an isolated Kubernetes Job with full scheduling, retries, and observability. A coordinator agent dynamically decomposes complex tasks, spawns specialist agents to work in parallel, and synthesizes their results โ no manual orchestration graphs required.
One helm install, one LLM secret, and you're chatting with an orchestrator that handles the rest.
No API keys on developer machines โ LLM credentials live in Kubernetes Secrets, managed by your platform team. Developers connect via ServiceAccount tokens โ no risk of leaked keys in dotfiles, shell history, or laptops.
Centralized control โ One place to set model policies, rate limits, and allowed providers across every team. Swap models or providers without touching developer configs.
Every agent action is auditable โ Tasks run as Kubernetes Jobs with full logs, Prometheus metrics, and result storage. Know exactly what every agent did, when, and at what cost.
Isolated execution โ Each agent runs in its own Pod with a hardened security context: non-root, read-only rootfs, all capabilities dropped, seccomp enforced. Agents can't escape their sandbox.
Scale with your cluster โ Priority scheduling, retry policies, concurrency limits, and cron-based execution โ all handled by the Kubernetes control plane you already operate.
Parallel code review โ Spawn a swarm of review agents โ security, performance, test coverage, accessibility, whatever you need. Each reviews independently and in parallel, then the coordinator synthesizes findings into a single report.
Autonomous dev workflows โ A coordinator agent dynamically breaks down a feature request, delegates implementation to specialist agents (backend, frontend, tests), and opens a PR with the combined result โ no predefined workflow graphs.
Research with competing hypotheses โ Multiple agents investigate different theories in parallel, challenge each other's findings, and converge on the strongest explanation. The adversarial structure avoids the anchoring bias of sequential investigation.
Scheduled operations โ Cron-based agents that run daily security scans, dependency audits, or report generation โ all with retry policies and webhook notifications.
Use your favorite AI client โ Connect Continue, Cursor, or any OpenAI-compatible client to Orka's API. Your cluster manages the LLM credentials โ developers just code.
CI/CD integration โ Trigger agent tasks from GitHub Actions, monitor progress via the REST API, and gate deployments on agent analysis.
- ๐ค AI Agents โ Anthropic, OpenAI, or Azure OpenAI with tools, skills, and session persistence
- ๐ ๏ธ Agent Runtimes โ Delegate repo-backed coding tasks to Codex CLI, Claude Code CLI, or GitHub Copilot CLI
- ๐ Autonomous Task Loops โ Coordinators can iterate on long-running goals until complete, canceled, or at an iteration limit
- ๐ Multi-Agent Coordination โ Coordinators delegate to specialists with depth and concurrency controls
- ๐ฌ Interactive Chat โ Agentic orchestrator with SSE streaming that creates and manages agents and tasks for you
- ๐ง Durable Memory โ Namespace-scoped recall, transcript search, and reviewable memory proposals that can be applied
- ๐ก๏ธ Repository Security Scanning โ Scheduled and incremental repository scans with threat models, validated findings, patch generation, and remediation PRs
- ๐งฐ Agent Sandbox Workspaces โ Experimental durable, reusable coding workspaces through
agent-sandbox - ๐ฅ๏ธ Web Dashboard โ Built-in React UI embedded in the controller binary โ zero extra deployments
- ๐ฆ Declarative CRDs โ Task, Agent, Tool, Provider, and Skill custom resources for GitOps workflows
- โฐ Scheduled Tasks โ Cron-based recurring execution with concurrency policies
- ๐ REST & OpenAI-Compatible API โ Full CRUD +
/openai/v1/chat/completionsendpoint for Continue, Cursor, and any OpenAI-compatible client - ๐ Kubernetes, OIDC & Kontxt TxToken Auth โ ServiceAccount tokens by default, with optional OIDC and scoped
kontxttransaction-token flows - ๐ฎ Anthropic-Compatible API โ
/anthropic/v1/messagesendpoint for Claude Code and other Anthropic-native clients - ๐ Observability โ Prometheus metrics, structured logging, health probes
- ๐ Hardened by Default โ Non-root containers, read-only rootfs, ServiceAccount token auth
helm install orka charts/orka \
--namespace orka-system \
--create-namespacekubectl create secret generic anthropic-secret \
--from-literal=api-key=your-api-key
kubectl apply -f - <<EOF
apiVersion: core.orka.ai/v1alpha1
kind: Provider
metadata:
name: anthropic
spec:
type: anthropic
secretRef:
name: anthropic-secret
key: api-key
defaultModel: claude-sonnet-4-20250514
EOFUse the built-in dashboard, or connect any OpenAI-compatible client:
kubectl port-forward -n orka-system svc/orka-api 8080:8080
# Open the web dashboard
open http://localhost:8080The built-in orchestrator creates agents, runs tasks, monitors progress, and returns results โ all from natural language. See the OpenAI Compatibility and Anthropic Compatibility docs for proxy setup with your preferred client.
| Getting Started | Installation, quick start, CLI setup |
| Architecture | System design, components, and data flow |
| Configuration | CRD reference, Helm values, controller flags, metrics |
| Agent Runtimes | Codex CLI, Claude Code CLI, and Copilot CLI runtimes |
| Agent Sandbox | Experimental upstream agent-sandbox workspace execution for agent runtimes |
| Interactive Chat | Chat endpoint, tools, and SSE streaming |
| Multi-Agent Coordination | Coordinator agents and task delegation |
| Autonomous Tasks | Long-running coordinator loops with persisted plan state |
| Memory | Durable memory, proposals, transcript search, and validation |
| API Reference | REST API endpoints and usage examples |
| OpenAI Compatibility | OpenAI-compatible chat completions API |
| Anthropic Compatibility | Anthropic-compatible Messages API |
| Web Dashboard | Frontend architecture and pages |
| Security | Security model and hardening |
| Kontxt Quickstart | Use OIDC identity to call Orka without long-lived tokens |
| Kontxt TxToken Integration | TxToken verification, authorization, TTS, and audit guidance |
| Repository Security Scanning | Repository scan workflow, threat models, findings, and remediation |
| Development | Building, testing, and contributing |
| Testing | Test structure, patterns, and commands |