A data-driven, model-free traffic-light controller cuts citywide travel time and CO2 in a large Zurich simulation without bespoke traffic models. The approach substitutes sensor data for costly modeling—speeding deployment and lowering operational costs—though real-world gains hinge on representative data, safety checks and online adaptation.

Data-driven generalized perimeter control: Zürich case study

Alessio Rimoldi, Carlo Cenedese, Alberto Padoan, Florian Dörfler, John Lygeros · March 17, 2026

arxiv descriptive medium evidence 7/10 relevance Source PDF

A model-free DeePC traffic-light controller learned from measured trajectories reduces total travel time and CO2 emissions in a very large, high-fidelity microscopic simulation of Zürich relative to fixed-time and standard adaptive baselines.

Urban traffic congestion is a key challenge for the development of modern cities, requiring advanced control techniques to optimize existing infrastructures usage. Despite the extensive availability of data, modeling such complex systems remains an expensive and time consuming step when designing model-based control approaches. On the other hand, machine learning approaches require simulations to bootstrap models, or are unable to deal with the sparse nature of traffic data and enforce hard constraints. We propose a novel formulation of traffic dynamics based on behavioral systems theory and apply data-enabled predictive control to steer traffic dynamics via dynamic traffic light control. A high-fidelity simulation of the city of Zürich, the largest closed-loop microscopic simulation of urban traffic in the literature to the best of our knowledge, is used to validate the performance of the proposed method in terms of total travel time and CO2 emissions.

Summary

Main Finding

A model-free traffic control approach — combining behavioral systems theory with data-enabled predictive control (DeePC) — can steer urban traffic via dynamic traffic-light control without building explicit traffic models, and delivers measurable improvements in system-level outcomes (reduced total travel time and CO2 emissions) when validated on a very large, high-fidelity microscopic simulation of Zürich.

Key Points

Methodological contribution: reframes urban traffic dynamics using behavioral systems theory so system evolution can be learned and predicted directly from measured input–output data (no explicit model identification).
Control approach: applies Data-Enabled Predictive Control (DeePC) to compute traffic-light actuation sequences that respect hard constraints (e.g., signal timing and safety) while optimizing performance.
Practical advantages: avoids the expensive, time-consuming model-building step of model-based control; better handles sparse or limited traffic measurements compared with many ML methods; enforces hard constraints explicitly (unlike many pure learning approaches).
Large-scale validation: tested on a high-fidelity microscopic closed-loop simulation of the city of Zürich — reported as the largest such closed-loop urban-traffic simulation in the literature — demonstrating improvements in aggregate metrics (total travel time and CO2 emissions).
Robustness considerations: method leverages measured system trajectories (Hankel/data matrices) and regularization to deal with noise and limited data; however, real-world deployment will still require representative data coverage and online adaptation.

Data & Methods

Theoretical basis: behavioral systems theory — represents system trajectories as combinations of measured input/output data, enabling prediction and control without parametric model identification.
Control algorithm: Data-Enabled Predictive Control (DeePC). DeePC builds predictive models from past input/output trajectories (Hankel matrices) and solves an optimization (typically quadratic or convex) over future input sequences subject to constraints and cost objectives.
Objectives and constraints: optimize traffic performance (aggregate travel time, emissions proxies) while enforcing operational constraints on traffic signals (phasing, minimum/maximum green times, safety).
Validation environment: a very large-scale, high-fidelity microscopic closed-loop simulator of Zürich (traffic demand, network topology, vehicle interactions modeled at microscopic level; controller is in-the-loop). Performance evaluated on system-level metrics such as total travel time and CO2 emissions.
Comparisons: method is presented against baseline approaches (e.g., fixed-time or standard adaptive schemes), with DeePC achieving superior system-level outcomes in the simulated experiments.
Data requirements: needs representative historical trajectories (inputs = signal settings; outputs = traffic measurements such as flows, speeds, queue lengths). Method copes with sparse sensing better than many ML baselines, but quality/coverage of data still matters.

Implications for AI Economics

Value of data infrastructure: demonstrates how investment in sensing and data-collection infrastructure (real-time traffic sensors, connected vehicles) can substitute for costly modeling work, accelerating deployment of advanced control and increasing returns on sensor data.
Cost savings and externalities: reductions in travel time and CO2 point to measurable economic benefits—lower fuel consumption, productivity gains, reduced pollution-related health costs—strengthening the business case for data-driven traffic control.
Market opportunities and business models: enables scalable “control-as-a-service” offerings for cities or traffic authorities that use city data to deliver operational improvements without bespoke model development; creates a market for curated urban traffic datasets and real-time data platforms.
Public policy and regulation: motivates policies for safe data sharing and standardization (to produce representative training trajectories) and regulation/validation frameworks to certify data-driven controllers that directly affect public infrastructure and safety.
Distributional effects and welfare: system-level improvements do not necessarily imply uniform benefits—spatial or temporal distribution of gains may favor some routes or neighborhoods; planners must consider equity and potential unintended reallocations of congestion.
Complementarities and rebound effects: improved throughput and lower travel costs can increase trip demand (rebound), so gains in congestion/emissions may be partially offset unless paired with demand-management measures (pricing, transit improvements).
Labor and organizational impacts: reduces reliance on lengthy modeling efforts by traffic engineers, shifting the skill mix toward data engineering, controller integration, and ongoing monitoring; may change procurement and vendor relationships.
Robustness and deployment risk: economic value depends on representativeness of training data and ability to adapt to structural changes (new infrastructure, demand shocks); investing in continual data collection, online adaptation, and safety verification increases deployment costs but lowers operational risk.

If you want, I can (a) produce a brief technical sketch of the DeePC optimization used here (objective, constraints, Hankel construction), (b) outline a small economic cost–benefit calculation framework to evaluate city deployment, or (c) list practical steps for a city to pilot this approach. Which would be most useful?

Assessment

Paper Typedescriptive Evidence Strengthmedium — Findings are supported by a rigorous theoretical foundation (behavioral systems theory and DeePC) and extensive, large-scale high-fidelity simulation experiments showing consistent system-level gains, but evidence is limited to simulation rather than field deployment or randomized/observational real-world evaluation, leaving external validity and operational risks untested. Methods Rigorhigh — The work uses a sound control-theoretic basis (Hankel/data matrices, DeePC), enforces operational constraints in optimization, incorporates regularization for noise/limited data, and performs large-scale closed-loop comparison to baselines in a state-of-the-art microscopic simulator, though it stops short of real-world pilots and online adaptation studies. SampleTraining data consist of measured input–output traffic trajectories (inputs: signal actuation sequences; outputs: flows, speeds, queue lengths) used to construct Hankel/data matrices; validation is performed in a very large-scale, high-fidelity microscopic closed-loop simulation of the city of Zürich with realistic demand, network topology, and vehicle interactions, comparing DeePC-controlled runs to fixed-time and standard adaptive controllers on aggregate metrics (total travel time, CO2 emissions). Themesproductivity adoption innovation governance org_design IdentificationNo econometric causal identification; causal claims are evaluated via closed-loop simulation experiments where the DeePC controller is deployed in-the-loop on a high-fidelity microscopic simulator of Zürich and compared against baseline controllers (fixed-time and standard adaptive schemes), with counterfactual outcomes measured by running alternative controller policies on the same simulated demand realizations. GeneralizabilityResults are from simulation rather than field deployment—real-world sensor noise, failures, latency, and unmodeled behaviors may reduce effectiveness., Performance depends on representativeness and coverage of historical trajectories; cities with different network topologies, demand patterns, or signal conventions may not see similar gains., Structural changes (new infrastructure, major demand shocks, mode shifts) could violate the stationarity assumptions implicit in using historical trajectories unless online adaptation is implemented., Regulatory, safety, and deployment constraints in real traffic control (certification, fail-safes, vendor lock-in) are not fully captured in simulation., Computational and communication constraints for real-time city-scale deployment (optimization solve times, distributed control) may limit transferability.

Claims (17)

Claim	Direction	Confidence	Outcome	Details
A model-free traffic control approach (DeePC) can steer urban traffic via dynamic traffic-light control without building explicit traffic models. Organizational Efficiency	positive	high	ability to generate feasible control (traffic-light) actuation sequences and close the control loop without an explicit parametric traffic model	0.18
Applying DeePC yields measurable improvements in system-level outcomes (reduced total travel time and CO2 emissions) in a very large, high-fidelity microscopic simulation of Zürich. Consumer Welfare	positive	high	total travel time; CO2 emissions	0.18
Reframing urban traffic dynamics with behavioral systems theory allows system evolution to be learned and predicted directly from measured input–output data (no explicit model identification). Decision Quality	positive	high	predictive capability from measured I/O trajectories (ability to forecast future outputs given past I/O data and candidate inputs)	0.18
DeePC can compute traffic-light actuation sequences that respect hard operational and safety constraints (e.g., phasing, minimum/maximum green times). Regulatory Compliance	positive	high	constraint satisfaction / feasibility of computed actuation sequences	0.18
The DeePC-based approach avoids the expensive, time-consuming model-building step required by model-based control methods. Organizational Efficiency	positive	medium	need for explicit parametric model identification (development time/effort proxy)	0.11
DeePC handles sparse or limited traffic measurements better than many machine-learning methods. Decision Quality	positive	medium	controller performance (e.g., travel time, emissions) under sparse sensing / limited measurements	0.11
Regularization and the use of measured Hankel/data matrices make the method more robust to measurement noise and limited data. Decision Quality	positive	medium	robustness to measurement noise and limited data (performance degradation metrics)	0.11
The method was validated on a very large, high-fidelity microscopic closed-loop simulator of Zürich; the paper reports this as the largest such closed-loop urban-traffic simulation in the literature. Research Productivity	positive	medium	scale of validation (city-scale microscopic closed-loop simulation)	0.11
DeePC outperforms baseline controllers (e.g., fixed-time and standard adaptive schemes) in the simulated experiments. Organizational Efficiency	positive	medium	system-level outcomes (total travel time, CO2 emissions) compared across controllers	0.11
Real-world deployment will require representative data coverage and online adaptation despite the method’s robustness mechanisms. Adoption Rate	negative	high	data representativeness and need for online adaptation (deployment readiness/risk)	0.18
Reductions in travel time and CO2 emissions translate into measurable economic benefits (lower fuel consumption, productivity gains, reduced pollution-related health costs). Fiscal And Macroeconomic	positive	speculative	economic proxies: fuel consumption, travel-time value (productivity), pollution-related health costs	0.02
System-level improvements from the controller do not imply uniform spatial/temporal benefits—distributional effects may favor certain routes or neighborhoods. Inequality	mixed	medium	spatial/temporal distribution of travel-time changes across network links or neighborhoods	0.11
Improved throughput and lower travel costs can induce additional travel demand (rebound), partially offsetting congestion/emissions gains unless paired with demand-management measures. Consumer Welfare	mixed	speculative	net congestion and emissions accounting for possible induced travel demand	0.02
Adopting this approach shifts required skills and organizational roles away from lengthy parametric modeling toward data engineering, controller integration, and monitoring. Skill Acquisition	positive	medium	changes in required skills/organizational roles (qualitative workforce composition proxy)	0.11
Explicit enforcement of signal constraints in DeePC provides a safety/operational advantage over many pure learning approaches that do not explicitly enforce hard constraints. Regulatory Compliance	positive	high	explicit constraint satisfaction and operational safety of signal timings	0.18
There is a market opportunity for scalable 'control-as-a-service' offerings and curated urban traffic datasets enabled by this data-driven control approach. Firm Revenue	positive	speculative	commercialization potential / emergence of data-driven service offerings (qualitative market outcome)	0.02
The economic value of deploying DeePC-based controllers depends critically on representativeness of training data and the costs of online adaptation and safety verification. Firm Revenue	mixed	medium	net economic value after accounting for data collection, adaptation, and verification costs	0.11