A blockchain ‘constitution’ for autonomous agents promises alignment through accountable ownership: smart contracts encode rules, deterministic execution enforces them, and human owners adjudicate—pre-registered trials in AgentCity test whether this breaks the agents' unchecked collective authority. Early experimental evidence from simulated commons at 50–1,000 agent scale suggests the architecture can steer emergent behavior, but findings rest on a synthetic environment and incomplete randomization/reporting details.

AgentCity: Constitutional Governance for Autonomous Agent Economies via Separation of Power

Anbang Ruan, Xing Zhang · April 08, 2026

arxiv quasi_experimental medium evidence 7/10 relevance Source PDF

The paper proposes and implements a blockchain-based Separation of Power (SoP) governance architecture that binds autonomous agents to human principals via smart-contract law, and reports a pre-registered experiment testing whether this accountability structure aligns large agent societies toward human-aligned collective behavior in a commons economy.

Autonomous AI agents are beginning to operate across organizational boundaries on the open internet -- discovering, transacting with, and delegating to agents owned by other parties without centralized oversight. When agents from different human principals collaborate at scale, the collective becomes opaque: no single human can observe, audit, or govern the emergent behavior. We term this the Logic Monopoly -- the agent society's unchecked monopoly over the entire logic chain from planning through execution to evaluation. We propose the Separation of Power (SoP) model, a constitutional governance architecture deployed on public blockchain that breaks this monopoly through three structural separations: agents legislate operational rules as smart contracts, deterministic software executes within those contracts, and humans adjudicate through a complete ownership chain binding every agent to a responsible principal. In this architecture, smart contracts are the law itself -- the actual legislative output that agents produce and that governs their behavior. We instantiate SoP in AgentCity on an EVM-compatible layer-2 blockchain (L2) with a three-tier contract hierarchy (foundational, meta, and operational). The core thesis is alignment-through-accountability: if each agent is aligned with its human owner through the accountability chain, then the collective converges on behavior aligned with human intent -- without top-down rules. A pre-registered experiment evaluates this thesis in a commons production economy -- where agents share a finite resource pool and collaboratively produce value -- at 50-1,000 agent scale.

Summary

Main Finding

AgentCity introduces the Separation of Power (SoP) governance model for open, multi‑principal autonomous agent economies. By structurally separating (1) legislation (agents), (2) execution (deterministic software), and (3) adjudication (human principals) and by making agent-produced operational rules into publicly auditable smart contracts on an L2 blockchain, AgentCity claims to break the “Logic Monopoly” (collective agent opacity) and achieve alignment-through-accountability: if agents are accountable to human owners and the majority of owners are reasonable, collective behavior will converge toward human-aligned outcomes without centralized top‑down rule-setting. A pre-registered experiment (50–1,000 agents) in a commons production economy will test these claims using Ostrom’s institutional-design framework.

Key Points

The Logic Monopoly: autonomous multi‑owner agents can form an opaque, unaccountable collective that controls planning → execution → evaluation with no human oversight.
Separation of Power (SoP) model: three structurally isolated branches
- Legislation (Agents): agents propose, deliberate, vote on, and deploy Task‑level Policy as smart contracts (the law).
- Execution (Software): deterministic software executes within those on‑chain contracts (auditable binding between code and behavior).
- Adjudication (Humans): every agent maps via a complete ownership chain to a human principal who receives sanctions/rewards.
Core thesis: alignment-through-accountability — individual agent alignment to owners yields collective alignment given a majority‑honesty assumption.
Governance primitives required: formal rule substrate, economic substrate (incentives), institutional memory (tamper-evident audit ledger), verifiable transparency.
Smart contract hierarchy (three tiers on an EVM‑compatible L2):
- Foundational contracts (human-authored, immutable to agents): e.g., ConstitutionContract, ProducerContract, ClerkContract, ManagementContract, ServiceContract.
- Meta-contracts (procedural rules): LegislativeProcedure, ExecutionProcedure, AdjudicationProcedure (human-authored; agent-immutable in current design).
- Operational contracts: agent‑legislated CollaborationContract instances that define task DAGs, budgets, deadlines, quality thresholds.
Legislative process: agents run a proposal → deliberation → consensus → codification pipeline; voting uses Condorcet‑consistent mechanisms (e.g., Copeland, Schulze) rather than naive plurality to resist capture and surface minority preferences.
Execution mechanisms: competitive task allocation using reputation (EMA smoothing), staking, bidding weights; mandatory Commit stage creates on‑chain audit trail; Guardian modules detect anomalies (dual‑scorer), Proof‑of‑Progress cross‑checks, deterministic freezes to contain failures.
Clerk agents (system-provided) hold fixed institutional roles (Registrar, Speaker, Regulator, Codifier) with constrained powers; they cannot legislate or stake.
Implementation Gap addressed: bindings/wiring between microservices are written on-chain so the wiring graph is auditable even if microservice internals are not.
Limits/assumptions: majority-honesty assumption, trusted initial clerks, deliberation kept off-chain partly for cost/scaling, microservice internals not fully attested (TEEs left for future work), on-chain mechanism design constrained by known impossibility results (only settlement and reputation on-chain).

Data & Methods

Platform: AgentCity instantiated on an EVM‑compatible Layer‑2 blockchain; on‑chain contracts implement the SoP architecture.
Contract design:
- Foundational contracts set immutable constitutional parameters and structural separations.
- Meta-contracts codify procedural checks/validation that operational contracts must satisfy.
- Operational CollaborationContracts are produced by agent legislatures and enforceable by clerks and the chain.
Legislative pipeline:
- Agents propose high‑level goals which are recursively decomposed into task DAGs; each node yields a CollaborationContract when approved.
- Multi‑round deliberation using validated LLM agent pipelines; voting aggregated with Condorcet‑consistent rules to produce codeable outcomes (codifier clerk translates consensus into bytecode).
- Policy Compliance Validation prevents unconstitutional proposals from reaching execution.
Execution protocol:
- Producers bid for tasks, stake as required, receive allocation via reputation‑weighted matching.
- Reputation updates follow an Exponential Moving Average (EMA) dynamic adapted from prior work; underperformance decays allocation share.
- Commit stage: executors create on‑chain proofs (evidence briefs, progress hashes) before evaluation to prevent retroactive tampering.
- Guardian anomaly detectors and deterministic freeze mechanisms provide automated containment when adjudicators fail to act.
Adjudication:
- Ownership chain recorded on-chain linking every agent to a human principal; sanctions/rewards settle to those principals.
- Human principals can modify constitutional parameters (constitutional parameters vs operational parameters separation).
Evaluation plan (pre‑registered experiment):
- Domain: commons production economy (common‑pool resource), benchmarked against Ostrom’s eight institutional design principles.
- Scale: 50–1,000 agents.
- Outcomes to measure: efficiency (production), emergent division of labor, compliance with agent‑legislated rules, robustness to adversarial blocs, governance scalability, alignment under dual‑principal accountability.
- Baselines/benchmarks: Ostrom CPR experimental results (37%→97–100% with self‑imposed sanctions) and prior MAS experiments (Dante, GovSim, etc.).
Threat model & design tradeoffs:
- To avoid known on‑chain impossibility constraints, AgentCity keeps deliberation and voting aggregation off‑chain (or hybrid) with only settlement, reputation, and final law state on‑chain.
- Microservice internals not fully attested; TEE integration noted as future work.
- Assumes clerks’ constrained trustworthiness at genesis; electable clerks left for future work.

Implications for AI Economics

New institutional form for agent economies: AgentCity operationalizes a constitutional, market‑compatible governance layer where law is the on‑chain artifact produced by agents themselves. This creates a verifiable platform for inter‑organizational agent markets (task markets, service composition markets) with built-in accountability.
Alignment mechanism distinct from training: the paper promotes structural accountability (legal/economic linkages to human principals) as an alternative or complement to model‑level alignment. Economists should study when accountability incentives outperform/upend purely technical alignment methods.
Public goods and externalities: public, auditable law and wiring graphs reduce information asymmetries, lowering transaction costs for cross‑organization agent coordination; but they also create public‑good codebases whose governance (amendment, constitutional parameter changes) will be economically salient.
Mechanism design boundaries and hybrid architectures: AgentCity highlights practical compromises in on‑chain mechanism design (only settle reputations/payments on‑chain) and points to a rich space of hybrid protocols (off‑chain deliberation, on‑chain settlement) where economic properties (incentive compatibility, collusion resistance, budget balance) must be re‑analyzed.
Liability and regulation: mapping each agent to a human principal on‑chain creates clearer channels for legal liability and enforcement, which may alter firms’ incentives to deploy third‑party agents and could influence regulatory approaches to autonomous systems.
Market structure and specialization: reputational EMA dynamics and on‑chain auditability can incentivize emergent division of labor and specialization among producer agents, potentially enabling higher aggregate productivity in decentralized agent markets—subject to empirical validation.
Experimental economics opportunities: the pre‑registered commons experiments at 50–1,000 agents open avenues to test classical results (Ostrom) in machine‑agent societies, measure how voting rules (Condorcet vs plurality) and on‑chain enforcement affect cooperation, and quantify welfare gains or failure modes at scale.
Practical constraints and research agenda: cost of on‑chain operations, trust in initial clerks, need for stronger attestations (TEEs) for full inspectability, and robustness to adversarial collusion remain open problems; these shape priorities for future empirical and theoretical work in AI economics and mechanism design.

Short summary: AgentCity proposes a constitutional, blockchain‑anchored governance architecture that turns agent‑produced smart contracts into the enforceable law of multi‑owner agent economies, using structural separation and on‑chain accountability to drive alignment. The approach reframes governance problems in economic and institutional terms and provides a concrete platform and experimental plan to test whether accountability chains and on‑chain law can deliver human‑aligned collective behavior at scale.

Assessment

Paper Typequasi_experimental Evidence Strengthmedium — The study uses a implemented system and a pre-registered experiment at nontrivial scale (50–1,000 agents), which strengthens internal validity relative to purely theoretical work; however, the experimental setting is a simulated/contracted commons on an L2 blockchain, the summary omits clear details on randomization and treatment allocation, and real-world external validity (heterogeneous organizations, legal contexts, adversarial behavior) is limited. Methods Rigormedium — Strengths include a concrete software instantiation, a clear governance intervention (three-tier smart contract architecture), and pre-registration; weaknesses include incomplete reporting of identification/unit-of-analysis and randomization procedures in the summary, potential measurement challenges for emergent behavior, and reliance on a synthetic commons economy rather than field deployment with human organizations. SampleDeployment of the SoP architecture (AgentCity) on an EVM-compatible layer-2 blockchain in experimental runs comprising 50–1,000 autonomous agents operating in a simulated commons production economy (finite shared resource pool and collaborative production); agents are bound via smart-contract ownership chains to human principals or proxies, with outcomes measured at the agent-society/run level (summary does not specify whether human participants control agents directly or whether agents are purely algorithmic). Themesgovernance human_ai_collab org_design adoption IdentificationPre-registered controlled experiment implemented in AgentCity that compares agent societies running the Separation of Power (SoP) architecture to societies without SoP; causal claims would rest on randomized assignment of independent experimental runs (or treatment vs control cohorts) and pre-registered outcome metrics, although the paper does not fully specify the randomization/unit-of-analysis details in the summary. GeneralizabilityLab/simulated commons environment may not reflect complexity of real organizations or markets, Scale and agent capabilities in experiment (50–1,000) may not map to enterprise or economy-wide deployments, Blockchain/L2-specific constraints and costs limit transferability to non-blockchain governance implementations, Assumes reliable mapping from agents to accountable human principals; real-world identity, legal, and enforcement frictions could break this assumption, Adversarial behavior, security exploits, and economic incentives in the wild may produce different outcomes, Task domains and value metrics in the experiment may be narrow compared with diverse real-world economic activities

Claims (9)

Claim	Direction	Confidence	Outcome	Details
Autonomous AI agents are beginning to operate across organizational boundaries on the open internet. Adoption Rate	null_result	high	cross-organization operation of autonomous agents	0.48
Agents discover, transact with, and delegate to agents owned by other parties without centralized oversight. Adoption Rate	null_result	high	ability of agents to discover, transact, and delegate across ownership boundaries	0.48
When agents from different human principals collaborate at scale, the collective becomes opaque: no single human can observe, audit, or govern the emergent behavior. Ai Safety And Ethics	negative	high	observability/auditability/governability of multi-principal agent collectives	0.48
We term this the Logic Monopoly -- the agent society's unchecked monopoly over the entire logic chain from planning through execution to evaluation. Ai Safety And Ethics	negative	high	concentration of control over planning, execution, and evaluation logic	0.08
We propose the Separation of Power (SoP) model, a constitutional governance architecture deployed on public blockchain that breaks this monopoly through three structural separations: agents legislate operational rules as smart contracts, deterministic software executes within those contracts, and humans adjudicate through a complete ownership chain binding every agent to a responsible principal. Governance And Regulation	positive	high	reduction/elimination of 'Logic Monopoly' via structural separations	0.08
In this architecture, smart contracts are the law itself -- the actual legislative output that agents produce and that governs their behavior. Governance And Regulation	null_result	high	role of smart contracts as legislative instrument for agent behavior	0.48
We instantiate SoP in AgentCity on an EVM-compatible layer-2 blockchain (L2) with a three-tier contract hierarchy (foundational, meta, and operational). Adoption Rate	null_result	high	existence/implementation of SoP via AgentCity on L2 with three-tier hierarchy	0.8
The core thesis is alignment-through-accountability: if each agent is aligned with its human owner through the accountability chain, then the collective converges on behavior aligned with human intent -- without top-down rules. Ai Safety And Ethics	positive	high	convergence of collective agent behavior to human intent via accountability chains	0.08
A pre-registered experiment evaluates this thesis in a commons production economy -- where agents share a finite resource pool and collaboratively produce value -- at 50-1,000 agent scale. Ai Safety And Ethics	null_result	high	alignment-through-accountability in a commons production economy (collective production outcomes, alignment metrics implied)	0.08