A blockchain ‘constitution’ for autonomous agents promises alignment through accountable ownership: smart contracts encode rules, deterministic execution enforces them, and human owners adjudicate—pre-registered trials in AgentCity test whether this breaks the agents' unchecked collective authority. Early experimental evidence from simulated commons at 50–1,000 agent scale suggests the architecture can steer emergent behavior, but findings rest on a synthetic environment and incomplete randomization/reporting details.
Autonomous AI agents are beginning to operate across organizational boundaries on the open internet -- discovering, transacting with, and delegating to agents owned by other parties without centralized oversight. When agents from different human principals collaborate at scale, the collective becomes opaque: no single human can observe, audit, or govern the emergent behavior. We term this the Logic Monopoly -- the agent society's unchecked monopoly over the entire logic chain from planning through execution to evaluation. We propose the Separation of Power (SoP) model, a constitutional governance architecture deployed on public blockchain that breaks this monopoly through three structural separations: agents legislate operational rules as smart contracts, deterministic software executes within those contracts, and humans adjudicate through a complete ownership chain binding every agent to a responsible principal. In this architecture, smart contracts are the law itself -- the actual legislative output that agents produce and that governs their behavior. We instantiate SoP in AgentCity on an EVM-compatible layer-2 blockchain (L2) with a three-tier contract hierarchy (foundational, meta, and operational). The core thesis is alignment-through-accountability: if each agent is aligned with its human owner through the accountability chain, then the collective converges on behavior aligned with human intent -- without top-down rules. A pre-registered experiment evaluates this thesis in a commons production economy -- where agents share a finite resource pool and collaboratively produce value -- at 50-1,000 agent scale.
Summary
Main Finding
AgentCity introduces the Separation of Power (SoP) governance model for open, multi‑principal autonomous agent economies. By structurally separating (1) legislation (agents), (2) execution (deterministic software), and (3) adjudication (human principals) and by making agent-produced operational rules into publicly auditable smart contracts on an L2 blockchain, AgentCity claims to break the “Logic Monopoly” (collective agent opacity) and achieve alignment-through-accountability: if agents are accountable to human owners and the majority of owners are reasonable, collective behavior will converge toward human-aligned outcomes without centralized top‑down rule-setting. A pre-registered experiment (50–1,000 agents) in a commons production economy will test these claims using Ostrom’s institutional-design framework.
Key Points
- The Logic Monopoly: autonomous multi‑owner agents can form an opaque, unaccountable collective that controls planning → execution → evaluation with no human oversight.
- Separation of Power (SoP) model: three structurally isolated branches
- Legislation (Agents): agents propose, deliberate, vote on, and deploy Task‑level Policy as smart contracts (the law).
- Execution (Software): deterministic software executes within those on‑chain contracts (auditable binding between code and behavior).
- Adjudication (Humans): every agent maps via a complete ownership chain to a human principal who receives sanctions/rewards.
- Core thesis: alignment-through-accountability — individual agent alignment to owners yields collective alignment given a majority‑honesty assumption.
- Governance primitives required: formal rule substrate, economic substrate (incentives), institutional memory (tamper-evident audit ledger), verifiable transparency.
- Smart contract hierarchy (three tiers on an EVM‑compatible L2):
- Foundational contracts (human-authored, immutable to agents): e.g., ConstitutionContract, ProducerContract, ClerkContract, ManagementContract, ServiceContract.
- Meta-contracts (procedural rules): LegislativeProcedure, ExecutionProcedure, AdjudicationProcedure (human-authored; agent-immutable in current design).
- Operational contracts: agent‑legislated CollaborationContract instances that define task DAGs, budgets, deadlines, quality thresholds.
- Legislative process: agents run a proposal → deliberation → consensus → codification pipeline; voting uses Condorcet‑consistent mechanisms (e.g., Copeland, Schulze) rather than naive plurality to resist capture and surface minority preferences.
- Execution mechanisms: competitive task allocation using reputation (EMA smoothing), staking, bidding weights; mandatory Commit stage creates on‑chain audit trail; Guardian modules detect anomalies (dual‑scorer), Proof‑of‑Progress cross‑checks, deterministic freezes to contain failures.
- Clerk agents (system-provided) hold fixed institutional roles (Registrar, Speaker, Regulator, Codifier) with constrained powers; they cannot legislate or stake.
- Implementation Gap addressed: bindings/wiring between microservices are written on-chain so the wiring graph is auditable even if microservice internals are not.
- Limits/assumptions: majority-honesty assumption, trusted initial clerks, deliberation kept off-chain partly for cost/scaling, microservice internals not fully attested (TEEs left for future work), on-chain mechanism design constrained by known impossibility results (only settlement and reputation on-chain).
Data & Methods
- Platform: AgentCity instantiated on an EVM‑compatible Layer‑2 blockchain; on‑chain contracts implement the SoP architecture.
- Contract design:
- Foundational contracts set immutable constitutional parameters and structural separations.
- Meta-contracts codify procedural checks/validation that operational contracts must satisfy.
- Operational CollaborationContracts are produced by agent legislatures and enforceable by clerks and the chain.
- Legislative pipeline:
- Agents propose high‑level goals which are recursively decomposed into task DAGs; each node yields a CollaborationContract when approved.
- Multi‑round deliberation using validated LLM agent pipelines; voting aggregated with Condorcet‑consistent rules to produce codeable outcomes (codifier clerk translates consensus into bytecode).
- Policy Compliance Validation prevents unconstitutional proposals from reaching execution.
- Execution protocol:
- Producers bid for tasks, stake as required, receive allocation via reputation‑weighted matching.
- Reputation updates follow an Exponential Moving Average (EMA) dynamic adapted from prior work; underperformance decays allocation share.
- Commit stage: executors create on‑chain proofs (evidence briefs, progress hashes) before evaluation to prevent retroactive tampering.
- Guardian anomaly detectors and deterministic freeze mechanisms provide automated containment when adjudicators fail to act.
- Adjudication:
- Ownership chain recorded on-chain linking every agent to a human principal; sanctions/rewards settle to those principals.
- Human principals can modify constitutional parameters (constitutional parameters vs operational parameters separation).
- Evaluation plan (pre‑registered experiment):
- Domain: commons production economy (common‑pool resource), benchmarked against Ostrom’s eight institutional design principles.
- Scale: 50–1,000 agents.
- Outcomes to measure: efficiency (production), emergent division of labor, compliance with agent‑legislated rules, robustness to adversarial blocs, governance scalability, alignment under dual‑principal accountability.
- Baselines/benchmarks: Ostrom CPR experimental results (37%→97–100% with self‑imposed sanctions) and prior MAS experiments (Dante, GovSim, etc.).
- Threat model & design tradeoffs:
- To avoid known on‑chain impossibility constraints, AgentCity keeps deliberation and voting aggregation off‑chain (or hybrid) with only settlement, reputation, and final law state on‑chain.
- Microservice internals not fully attested; TEE integration noted as future work.
- Assumes clerks’ constrained trustworthiness at genesis; electable clerks left for future work.
Implications for AI Economics
- New institutional form for agent economies: AgentCity operationalizes a constitutional, market‑compatible governance layer where law is the on‑chain artifact produced by agents themselves. This creates a verifiable platform for inter‑organizational agent markets (task markets, service composition markets) with built-in accountability.
- Alignment mechanism distinct from training: the paper promotes structural accountability (legal/economic linkages to human principals) as an alternative or complement to model‑level alignment. Economists should study when accountability incentives outperform/upend purely technical alignment methods.
- Public goods and externalities: public, auditable law and wiring graphs reduce information asymmetries, lowering transaction costs for cross‑organization agent coordination; but they also create public‑good codebases whose governance (amendment, constitutional parameter changes) will be economically salient.
- Mechanism design boundaries and hybrid architectures: AgentCity highlights practical compromises in on‑chain mechanism design (only settle reputations/payments on‑chain) and points to a rich space of hybrid protocols (off‑chain deliberation, on‑chain settlement) where economic properties (incentive compatibility, collusion resistance, budget balance) must be re‑analyzed.
- Liability and regulation: mapping each agent to a human principal on‑chain creates clearer channels for legal liability and enforcement, which may alter firms’ incentives to deploy third‑party agents and could influence regulatory approaches to autonomous systems.
- Market structure and specialization: reputational EMA dynamics and on‑chain auditability can incentivize emergent division of labor and specialization among producer agents, potentially enabling higher aggregate productivity in decentralized agent markets—subject to empirical validation.
- Experimental economics opportunities: the pre‑registered commons experiments at 50–1,000 agents open avenues to test classical results (Ostrom) in machine‑agent societies, measure how voting rules (Condorcet vs plurality) and on‑chain enforcement affect cooperation, and quantify welfare gains or failure modes at scale.
- Practical constraints and research agenda: cost of on‑chain operations, trust in initial clerks, need for stronger attestations (TEEs) for full inspectability, and robustness to adversarial collusion remain open problems; these shape priorities for future empirical and theoretical work in AI economics and mechanism design.
Short summary: AgentCity proposes a constitutional, blockchain‑anchored governance architecture that turns agent‑produced smart contracts into the enforceable law of multi‑owner agent economies, using structural separation and on‑chain accountability to drive alignment. The approach reframes governance problems in economic and institutional terms and provides a concrete platform and experimental plan to test whether accountability chains and on‑chain law can deliver human‑aligned collective behavior at scale.
Assessment
Claims (9)
| Claim | Direction | Confidence | Outcome | Details |
|---|---|---|---|---|
| Autonomous AI agents are beginning to operate across organizational boundaries on the open internet. Adoption Rate | null_result | high | cross-organization operation of autonomous agents |
0.48
|
| Agents discover, transact with, and delegate to agents owned by other parties without centralized oversight. Adoption Rate | null_result | high | ability of agents to discover, transact, and delegate across ownership boundaries |
0.48
|
| When agents from different human principals collaborate at scale, the collective becomes opaque: no single human can observe, audit, or govern the emergent behavior. Ai Safety And Ethics | negative | high | observability/auditability/governability of multi-principal agent collectives |
0.48
|
| We term this the Logic Monopoly -- the agent society's unchecked monopoly over the entire logic chain from planning through execution to evaluation. Ai Safety And Ethics | negative | high | concentration of control over planning, execution, and evaluation logic |
0.08
|
| We propose the Separation of Power (SoP) model, a constitutional governance architecture deployed on public blockchain that breaks this monopoly through three structural separations: agents legislate operational rules as smart contracts, deterministic software executes within those contracts, and humans adjudicate through a complete ownership chain binding every agent to a responsible principal. Governance And Regulation | positive | high | reduction/elimination of 'Logic Monopoly' via structural separations |
0.08
|
| In this architecture, smart contracts are the law itself -- the actual legislative output that agents produce and that governs their behavior. Governance And Regulation | null_result | high | role of smart contracts as legislative instrument for agent behavior |
0.48
|
| We instantiate SoP in AgentCity on an EVM-compatible layer-2 blockchain (L2) with a three-tier contract hierarchy (foundational, meta, and operational). Adoption Rate | null_result | high | existence/implementation of SoP via AgentCity on L2 with three-tier hierarchy |
0.8
|
| The core thesis is alignment-through-accountability: if each agent is aligned with its human owner through the accountability chain, then the collective converges on behavior aligned with human intent -- without top-down rules. Ai Safety And Ethics | positive | high | convergence of collective agent behavior to human intent via accountability chains |
0.08
|
| A pre-registered experiment evaluates this thesis in a commons production economy -- where agents share a finite resource pool and collaboratively produce value -- at 50-1,000 agent scale. Ai Safety And Ethics | null_result | high | alignment-through-accountability in a commons production economy (collective production outcomes, alignment metrics implied) |
0.08
|