Lyrie
Research
3 sources verified·1 min read
By Lyrie Threat Intelligence·5/4/2026

Alignment Contracts for Agentic Security Systems

Source: arXiv cs.CR

Published: Mon, 04 May 2026 00:00:00 -0400

Summary

arXiv:2605.00081v1 Announce Type: new

Abstract: Agentic security systems increasingly combine LLM planners with tools that can discover, validate, and report vulnerabilities. This creates an asymmetric control problem: the system should retain strong offensive capability inside an authorized engagement, while the same capabilities must be denied outside scope. Existing guardrails provide useful policy controls, but they do not make this boundary a first-class formal contract over observable effects.

We introduce alignment contracts, a framework for specifying and enforcing behavioral constraints over observable effect traces. A contract defines scope, allowed and forbidden effects, resource budgets, and disclosure policies. We give the language finite-trace semantics, characterize satisfaction as a safety property with finite violation witnesses, develop refinement and one-way composition rules for modular contract engineering, and show that admissibility checking is decidable. We instantiate the framework for web-focused agentic security workflows and show how the same structure extends to other effect profiles.

Under an explicit Effect Observability Assumption, where all $\S

Sources

Lyrie Verdict

Lyrie's autonomous defense layer flags this class of exposure the moment it surfaces — no signature update required.