Generative AI is concentrating in Beijing’s urban core and has stalled wage growth there despite continued inflows of skilled workers, creating a ‘high‑skill trap’; a difference‑in‑differences analysis around ChatGPT’s release suggests the impact is causal.

Generative AI impacts on intra-urban inequality and skill premium in Beijing

Xiliu He, Haoxiang Zhao, Mingyi Ma, Edward Wen Chuan Lai, Koei Enomoto, Anni Hu, Jiatong Li, Lingyun Chu, Yuan Lai · May 25, 2026

arxiv quasi_experimental medium evidence 8/10 relevance Source PDF

Using 5 million Beijing job postings and a neighborhood-level GenAI Exposure Index from five LLMs, the paper finds that GenAI exposure is concentrated in core districts and—since ChatGPT's release—high-exposure neighborhoods experienced wage stagnation despite continued inflows of high-skilled workers, driven by task de-skilling and labor-market crowding, with a DID design lending causal support.

Generative artificial intelligence (GenAI) is the first automation wave to reach high-cognitive tasks at scale, yet its effects on intra-urban inequality remain largely unknown. Using 5 million job postings from Beijing (2018--2024), we construct a neighborhood-level GenAI Exposure Index by aggregating task-level assessments from five leading large language models. We examine the spatial, structural and causal mechanisms of this shock. We find that GenAI exposure is highly concentrated in the city's core districts, deepening the intra-urban AI divide. Since 2023, high-exposure neighborhoods have experienced wage stagnation even as they continue to attract high-skilled workers -- a "high-skill trap." This wage penalty is driven by task de-skilling and intensified labor-market crowding. A difference-in-differences design centered on ChatGPT's release supports a causal interpretation. These findings challenge the prevailing theory of skill-biased technological change and provide a basis for inclusive AI governance in global technology hubs.

Summary

Main Finding

Generative AI exposure in Beijing is spatially concentrated in the city’s high-tech cores and, after ChatGPT’s public release, produced a “high-skill trap”: neighborhoods with higher pre-existing GenAI exposure attracted more highly educated workers but experienced stagnant or falling wages. The wage penalty is causally associated with GenAI exposure and appears driven by task de-skilling (substitution) and labor-market crowding (oversupply).

Key Points

Data scope: 4,995,615 online job postings in Beijing (2018–2024), matched to neighborhood geography and other geospatial covariates.
Exposure measure: a neighborhood-level GenAI Exposure Index constructed by (1) mapping job-posting tasks to standard occupational tasks via retrieval-augmented generation (RAG) and (2) aggregating task-level assessments from five state-of-the-art LLMs to reduce model bias.
Spatial pattern: GenAI exposure is strongly concentrated in core “golden triangle” clusters (Zhongguancun, Financial Street, Guomao CBD); low-exposure neighborhoods persist on the periphery. LISA spatial autocorrelation shows persistent high–high clusters in the core and expanding low–low clusters in the periphery.
Empirical pattern: High-exposure neighborhoods show rising shares of highly educated workers but wage declines after 2022. Example: average monthly wages in high-exposure group fell after peaking in 2021 and reached ~13,673 CNY/month post-ChatGPT.
Causal evidence: Using 2018 GenAI exposure as pre-determined treatment and exploiting ChatGPT’s release (Nov 2022) in a difference-in-differences (DID) framework (analysis window 2020–2024, 2022 baseline), the authors find:
- Event-study: β2023 = −0.193 (p = 0.033), implying ~17.5% decline in wages for a 1 SD increase in pre-determined exposure in the first year post-shock; partial recovery in 2024 (β2024 = −0.103, p = 0.119).
- Baseline DID: interaction GenAI2018 × Post = −0.140 (p < 0.05); with concurrent-shock controls = −0.151 (p < 0.05) ≈ a 13.1% wage decline.
- Permutation (randomization) inference: permutation p = 0.004 (500 permutations), supporting that the effect is unlikely due to chance.
- Bartik shift-share IV (national AI job growth × local industry composition) gives a consistent reduced-form effect (−0.436 log points, p < 0.001) and a strong first-stage (F = 53.5), though the authors advise caution because the Bartik event-study fails a parallel-trends test.
Mechanisms:
- De-skilling: interaction between GenAI exposure and worker education is negative and large (interaction ≈ −1.286), indicating diminishing or negative returns to education in high-exposure settings—GenAI substitutes for some high-cognitive tasks.
- Crowding: interaction between GenAI exposure and a job-market-heat (competition) measure is negative and significant (≈ −1.619), showing intensified competition amplifies downward pressure on wages.
Robustness: controls for concurrent shocks (tech-sector regulation, COVID recovery, real-estate downturn), neighborhood and time fixed effects, clustered SEs, event-study pre-trend tests.

Data & Methods

Data: ~5 million job postings (2018–2024) for Beijing; neighborhood-level aggregation; supplementary geospatial covariates (population, nightlights, NDVI, POI density, land use).
Exposure construction:
- Retrieval-augmented generation (RAG) to map job-posting text to standardized occupational tasks.
- Five SOTA LLMs used to score task-level GenAI substitutability/capability; aggregated into a neighborhood GenAI Exposure Index.
Identification strategy:
- Pre-determined treatment: use 2018 exposure (pre-dating ChatGPT) to avoid reverse causality.
- DID: interact GenAI2018 with a Post indicator (2023–2024) in models with entity and time fixed effects; 2022 is the baseline year.
- Event-study: dynamic coefficients to test parallel trends (pre-treatment coefficients jointly indistinguishable from zero).
- Robustness: add controls interacting Post with measures of concurrent shocks; randomization inference (permutation); Bartik shift-share IV (national AI job growth × 2018 local industry shares).
Key statistics reported:
- Sample: 1,383 neighborhoods, 6,895 neighborhood-year observations (2020–2024).
- DID coefficients: −0.140 (baseline), −0.151 (with confounder controls); event-study β2023 = −0.193 (p = 0.033).
- Permutation p-value: 0.004.
- Bartik IV reduced-form: −0.436 (p < 0.001); first-stage F = 53.5.

Implications for AI Economics

Challenges classical SBTC: Rather than uniformly increasing the skill premium, GenAI can compress wages at the top by reducing the scarcity of high-cognitive tasks—introducing a “high-skill trap” where skill supply rises but returns to skill weaken.
Spatial dimension matters: AI’s impacts are uneven within cities—tech cores capture exposure and skilled inflows but also concentrated wage risks. Urban inequality and intra-city policy should account for this spatial lock-in.
Dual mechanisms: Policy responses must address both substitution (task de-skilling) and demand–supply mismatches (crowding). Upskilling alone may be insufficient if GenAI continues to substitute core high-cognitive tasks.
Policy prescriptions suggested by results:
- Targeted support in tech cores: wage insurance, portable benefits, and retraining focused on AI-complementary skills (meta-skills, coordination, supervision, creativity beyond current GenAI capabilities).
- Promote diffusion and absorptive capacity in peripheries: invest in digital infrastructure, local institutions, and industry diversification to enable selective spillovers rather than reinforce core–periphery divides.
- Labor-market monitoring: track occupational competition and wage trends at fine spatial scales to detect crowding effects early.
- Consider employment-side interventions (hiring subsidies, public-sector demand for AI-complementary roles) to absorb skilled labor displaced from compressed task markets.
Research implications: need for multi-city and cross-country comparable studies, worker-level longitudinal panels to follow career transitions, firm-level productivity analyses, and randomized/policy experiments to test interventions that can mitigate high-skill traps.

Limitations noted by authors (brief) - Single-city focus (Beijing)—results may differ in other institutional contexts. - Job-posting data reflect vacancies and posted wages, not necessarily realized wages or full employment outcomes. - Exposure scoring depends on LLM assessments and task mappings, which may evolve as models improve. - Bartik IV supportive but imperfect (parallel-trends concerns), so causal magnitudes should be interpreted with caution.

Overall, the paper provides high-resolution evidence that GenAI can deepen intra-urban inequality by locking exposure into city cores and producing a paradoxical combination of up-skilling in supply and wage compression—an outcome with important theory and policy implications for AI-era labor markets.

Assessment

Paper Typequasi_experimental Evidence Strengthmedium — The study leverages a large administrative-style dataset and a plausible temporal shock (ChatGPT release) with a DID design and a multi-LLM exposure measure, which supports causal claims; however, key threats remain (measurement error from LLM-derived exposure, job-postings as an imperfect proxy for realized employment and wages, potential violations of parallel trends and local spillovers, and the short post-treatment window), lowering confidence in strong causal inference. Methods Rigormedium — High rigor in data scale (5 million postings), multi-model aggregation to reduce single-LLM bias, and neighborhood-level analysis; but the approach depends on assumptions (parallel trends, no differential pre-trends, correct mapping from tasks to exposure), and the summary does not report robustness checks/ placebo tests, heterogeneous parallel-trend tests, or alternative wage/ employment outcome validations that would be needed to rate methods as high. SampleApproximately 5 million online job postings from Beijing spanning 2018–2024, aggregated to neighborhood (intra-urban) level; outcomes include posted wages and occupation/skill composition; GenAI exposure is computed by mapping tasks in each posting to automation susceptibility scores produced by five leading LLMs and aggregating to neighborhoods. Themesinequality labor_markets adoption IdentificationDifference-in-differences comparing neighborhood-level outcomes before and after the public release of ChatGPT (2022/2023), using a GenAI Exposure Index constructed by aggregating task-level automation susceptibility scores from five large language models applied to 5 million Beijing job postings (2018–2024); high- versus low-exposure neighborhoods serve as treatment and control groups. GeneralizabilitySingle-city study (Beijing) — results may not generalize to other cities, countries, or institutional contexts, Job postings data may not reflect actual hires, realized wages, informal employment, or full labor-market dynamics, LLM-derived task exposure may mismeasure real-world automation risk and may be sensitive to the selection of models and prompt design, Short post-treatment window (since 2023) limits inference about long-run effects and equilibrium adjustments, Potential policy, regulatory, or cultural differences (Chinese labor market and tech ecosystem) limit transferability to other national contexts

Claims (8)

Claim	Direction	Confidence	Outcome	Details
GenAI exposure is highly concentrated in the city's core districts, deepening the intra-urban AI divide. Inequality	negative	high	GenAI exposure concentration across neighborhoods / intra-urban AI divide	n=5000000 0.48
Since 2023, high-exposure neighborhoods have experienced wage stagnation even as they continue to attract high-skilled workers (a 'high-skill trap'). Wages	negative	high	wage levels / wage growth (stagnation)	n=5000000 0.48
The observed wage penalty in high-exposure neighborhoods is driven by task de-skilling and intensified labor-market crowding. Wages	negative	medium	wage penalty and its mechanisms (task de-skilling, labor-market crowding)	n=5000000 0.29
A difference-in-differences design centered on ChatGPT's release supports a causal interpretation of GenAI's local labor-market effects. Wages	positive	high	causal effect of GenAI exposure on neighborhood-level labor-market outcomes (e.g., wages)	n=5000000 0.48
These findings challenge the prevailing theory of skill-biased technological change. Skill Obsolescence	negative	high	validity of skill-biased technological change predictions (skill premium dynamics)	n=5000000 0.08
We construct a neighborhood-level GenAI Exposure Index by aggregating task-level assessments from five leading large language models. Adoption Rate	null_result	high	GenAI Exposure Index (measurement / adoption proxy)	n=5000000 0.24
The study uses 5 million job postings from Beijing covering 2018--2024 as its primary data source. Other	null_result	high	dataset size and temporal coverage	n=5000000 0.8
Generative AI (GenAI) is the first automation wave to reach high-cognitive tasks at scale. Other	null_result	medium	scope of automation reach (high-cognitive task automation at scale)	0.05