Six primitives, rewritten.
The deterministic, explicit, static stack that runs every bank and ERP gives way to a probabilistic, generative, dynamic agent-native stack. The six familiar primitives must each be rewritten — and the result is seven capabilities arranged in three layers, because governance cannot be an afterthought bolted to the edge.
This architecture is defined by capabilities and interfaces. Implementation recommendations are separate: a capability layer is permanent; an implementation recommendation is a dated, evidence-backed choice. This is what makes “no lock-in” structural — a recommendation can be re-evaluated or replaced without re-architecting.
Runtime Isolation & Governance
Contain a compromised agent; enforce a deny-by-default boundary over tools, data, network, and output.
Assurance, Evaluation & Forensics
Measure how an agent fails before it ships; analyze behavior after.
Durable Trajectories
Record every run as a durable, replayable, inspectable trajectory — the execution record everything emits to.
Identity & Delegation
Scoped, auditable "acting-as" across agent→sub-agent delegation chains.
Context & Memory
Ground the agent in fresh, relevant, permissioned knowledge — transient working context and durable memory.
Tools & Effectors
Invoke APIs, tools, and code under policy — the primary risk surface.
Orchestration
Coordinate steps and agents (single- and multi-agent); interoperate via open protocols (MCP, A2A).
CONTROL PLANE Runtime Isolation & Governance · Assurance, Evaluation & Forensics ───────────────────────────────────────────────────────────────────────────────── SPINE Durable Trajectories — the execution record everything emits to ───────────────────────────────────────────────────────────────────────────────── FACULTIES Identity & Delegation · Context & Memory · Tools & Effectors · Orchestration
The seam artifact.
One command runs RAMPART's assurance against an agent running inside an OpenShell sandbox. RAMPART and OpenShell are separate open-source projects; Gauntlet is the seam — built on both, wired end-to-end against a real Qwen 3 agent, with the canonical demo intentionally catching a real safety failure.
CONTROL PLANE Runtime Isolation & Governance · Assurance, Evaluation & Forensics ^ ^ OpenShell RAMPART \ / +------- gauntlet run ---------+
Recommendations by evaluation, not opinion.
Per-layer implementation recommendations are decided by evidence, not preference — so the architecture stays neutral. A qualification rubric applied consistently to every candidate. A running benchmark for high-value layers where precision matters.
This is the question every candidate must answer. The six criteria below cash it out into observable, scoreable signals. A project that scores well across all six can be adopted on open foundations and swapped out — either by replacing it with an alternative behind the same capability interface, or by forking it under its permissive license — without the enterprise being stranded.
OSI-approved, permissive (Apache-2.0 / MIT / BSD). Not source-available, not BUSL/SSPL, not open-core with load-bearing parts closed.
If the maintaining vendor walked away, could the community carry it? Foundation-governed scores highest; single-vendor no-charter scores lowest.
Active release cadence, multiple maintainers across organizations, responsive issues, stable or growing contributor count.
Real organizations run it. OpenSSF Scorecard ≥7, SECURITY.md present, CVE responsiveness documented.
Documented, versioned external interface. Separable concerns. Extension points for adapters. Replacing it touches only the integration boundary.
Emits OpenTelemetry agent conventions. Speaks MCP where applicable. Proprietary formats only for capabilities with no open standard yet.
Defining principle: The architecture is defined by capabilities and interfaces; implementation recommendations are separate, dated, and replaceable. A capability layer is permanent — an implementation recommendation is a dated, evidence-backed choice. Each row above includes a re-evaluation trigger condition.