Climate AI is being built on Northern compute and data, producing systematically weaker and potentially misleading outputs for vulnerable, data-sparse regions; public investment in shared data, distributed compute, and co-produced evaluation is needed to prevent costly maladaptation and entrenched informational inequality.

The Rise of AI in Weather and Climate Information and its Impact on Global Inequality

Amirpasha Mozaffari, Amanda Duarte, Lina Teckentrup, Stefano Materia, Gina E. C. Charnley, Lluis Palma, Eulalia Baulenas Serra, Dragana Bojovic, Paula Checchia, Aude Carreric, Francisco Doblas-Reyes · March 05, 2026

arxiv review_meta low evidence 7/10 relevance Source PDF

Concentration of compute, data, and model development in the Global North is producing biased climate-AI inputs, processes, and outputs that risk worse forecasts and misleading adaptation guidance for data-poor, climate-vulnerable regions unless investment shifts to equitable data and compute infrastructure and co-produced evaluation metrics.

The rapid adoption of AI in Earth system science promises unprecedented speed and fidelity in the generation of climate information. However, this technological prowess rests on a fragile and unequal foundation: the current trajectory of AI development risks further automating and amplifying the North-South divide in the global climate information system. We outline the global asymmetry in High-Performance Computing and data infrastructure, demonstrating that the development of foundation models is almost exclusively concentrated in the Global North. Using three different domains, we show how this infrastructure inequality continues through models' inputs, processes and outputs. As an example, in weather and climate modelling, the reliance on historically biased data leads to systematic performance gaps that disproportionately affect the most vulnerable regions. In climate impact modelling, data sparsity and unrepresentative validation risk driving misleading interventions and maladaptation. Finally, in large language models, dependence on dominant textualised forms of climate knowledge risks reinforcing existing biases. We conclude that addressing these disparities demands revisiting the three phases, i.e. models Input, Process and Output. This involves (i) a perspective shift from model-centric to data-centric development, (ii) the establishment of a Climate Digital Public Infrastructure and human-centric evaluation metrics, and (iii) a move from producer-consumer dynamics toward knowledge co-production. This integration of diverse knowledge systems would truly democratise compute sovereignty and ensure that the AI revolution fosters genuine systemic resilience rather than exacerbating inequity.

Summary

Main Finding

The paper shows that rapid AI adoption in Earth system science is intensifying an existing North–South divide. High-performance compute (HPC), data infrastructure, and foundation-model development are concentrated in the Global North, and this infrastructural asymmetry propagates through models’ inputs, processes, and outputs. As a result, AI-driven climate information risks producing systematically worse performance and misleading recommendations for the most climate-vulnerable, data-sparse regions unless development shifts from a model-centric to a data- and equity-centric approach.

Key Points

Global asymmetry in compute and data:
- Foundation-model development and HPC capacity are overwhelmingly located in the Global North.
- This concentration creates path dependence: model design, datasets, and validation reflect Northern priorities and contexts.
Three-domain evidence of unequal impacts:
Weather & climate modelling: Historical and spatially biased observational data lead to systematic performance gaps in under-observed tropical and low-income regions, reducing forecast fidelity where adaptive capacity is lowest.
Climate impact modelling: Sparse, unrepresentative exposure and vulnerability data and inadequate validation produce high uncertainty and risk of misleading interventions and maladaptation in vulnerable locales.
Large language models (LLMs): Reliance on dominant, textualized climate knowledge risks foregrounding Northern epistemologies and sidelining local or indigenous knowledge, reinforcing biases in climate narratives and recommendations.
Framework of failure points: Inequities appear across three phases:
- Inputs: Data scarcity, biased sampling, and limited local data governance.
- Process: Model design, training resources, and development priorities shaped by Northern compute and funding.
- Output: Evaluation metrics and dissemination channels that do not reflect local utility or values.
Proposed remedies:
- Shift from model-centric to data-centric development (improve data quality, representativeness, and governance).
- Establish Climate Digital Public Infrastructure (CDPI): shared, interoperable data and compute resources, standards, and governance to democratize access.
- Adopt human-centric evaluation metrics that measure value for local users and socio-ecological resilience.
- Move toward knowledge co-production: involve local stakeholders, integrate diverse knowledge systems, and redistribute “compute sovereignty.”

Data & Methods

Evidence base:
- Descriptive mapping of global HPC and foundation-model development concentration (documenting geography of compute and model authorship).
- Comparative, domain-specific case studies illustrating how data scarcity and bias affect model performance and downstream decisions in weather/climate modelling, impact modelling, and LLMs.
- Conceptual analysis tracing how inequalities propagate through Inputs → Process → Outputs.
Methodological approach:
- Cross-disciplinary synthesis combining infrastructure mapping, literature review, targeted examples of model failures/biases, and normative design principles.
- Emphasis on illustrative empirical patterns (performance gaps, validation shortfalls) rather than a single quantitative model or randomized experiment.
Limitations noted by the authors:
- Granular quantitative estimates of the economic cost of these asymmetries are not provided.
- Empirical generalization across all climate-AI systems is constrained by heterogeneous data availability and proprietary models.

Implications for AI Economics

Market structure and concentration:
- Concentrated compute and model development create market power for Northern institutions and companies, likely leading to unequal pricing, control over standards, and capture of high-value climate services.
- Returns to scale in compute and data favor incumbents; without intervention, this can entrench inequality in the global climate-information market.
Public-good nature and market failure:
- High-quality, equitable climate information has characteristics of a public good (nonrival, nonexcludable at scale). Private incentives alone will underprovide geographically representative data and shared infrastructure.
- Negative externalities include maladaptation and inefficient investments triggered by biased or unrepresentative AI outputs.
Distributional and welfare impacts:
- Economically vulnerable regions face larger adaptation costs and lost benefits from inaccurate forecasts/impact models; this has implications for global welfare and climate justice.
- Misleading outputs can induce costly maladaptation, increasing expected social costs of climate change.
Policy and funding implications:
- Justification for coordinated public investment: build Climate Digital Public Infrastructure (shared data repositories, distributed compute credits, standards, and training) akin to physical public goods (observational networks, satellites).
- Rethink aid and climate finance: allocate funds explicitly to data collection, compute capacity, and co-production processes, not only to physical adaptation projects.
- Regulatory and incentive tools: open-data mandates, compute-sharing incentives, conditionality in R&D funding to require equitable validation and local engagement.
Research and evaluation priorities for economists:
- Quantify the returns to investments in CDPI versus private compute; model distributional effects and welfare gains from more equitable climate information.
- Estimate the economic cost of maladaptation caused by biased AI outputs and the value of human-centric evaluation metrics.
- Design incentive-compatible mechanisms to promote data sharing and co-production (e.g., payment for data public goods, compute-voucher schemes, or international compute pools).
- Analyze labor-market and knowledge-economy effects: capacity building, on-shoring of data science talent, and effects on local industries.
Governance and international coordination:
- Need for multilateral governance frameworks to support compute sovereignty, data standards, and equitable access—similar to global public-health or geospatial-data collaborations.
- Consider trade-offs between intellectual property, commercial incentives for innovation, and the social value of open shared infrastructures.

Overall, the paper highlights that without deliberate economic and governance interventions—investment in shared digital public goods, new evaluation metrics, and co-production practices—the AI revolution in climate science risks amplifying global inequality and producing economically costly misallocations. Economists can help by quantifying costs/benefits, designing funding and incentive mechanisms, and evaluating policy options to democratize climate-AI infrastructure.

Assessment

Paper Typereview_meta Evidence Strengthlow — The paper synthesizes descriptive mapping, literature, and illustrative case studies rather than providing systematic causal estimates; evidence is persuasive and multi-domain but largely descriptive, with limited quantitative validation and reliance on proprietary or unevenly available data. Methods Rigormedium — Cross-disciplinary synthesis combines infrastructure mapping, targeted case studies, and conceptual tracing, which is rigorous for a review; however, methods are not standardized or pre-registered, selection of examples is illustrative rather than systematic, and there are no counterfactual or econometric identification strategies. SampleData consist of descriptive maps of global high-performance compute and foundation-model development (geography of compute, authorship), a literature review, and targeted domain-specific case studies/examples from weather & climate modelling, climate impact modelling, and large language models; no single representative dataset or randomized experiment, and some evidence draws on proprietary or unevenly available sources. Themesinequality governance adoption productivity human_ai_collab GeneralizabilityFindings are based on illustrative case studies and descriptive mapping rather than representative sampling, risking selection bias, Proprietary models and opaque compute footprints limit completeness and replicability, Heterogeneity across climate-AI systems and regions means impacts may differ locally (not all 'Global South' contexts are the same), Rapidly changing compute and model diffusion could alter the geographic concentration over time, Economic cost estimates and quantified welfare impacts are not provided, limiting precise policy transfers

Claims (17)

Claim	Direction	Confidence	Outcome	Details
Foundation-model development and high-performance computing (HPC) capacity are overwhelmingly located in the Global North. Market Structure	negative	high	Geographic distribution of HPC capacity and foundation-model development (location of compute resources and model authorship)	0.12
The geographic concentration of compute and model development creates path dependence: model design, training datasets, and validation reflect Northern priorities and contexts. Inequality	negative	medium	Degree of alignment between model design/validation choices and Northern (vs. local) priorities and contexts	0.07
In weather and climate modelling, historically and spatially biased observational data produce systematic performance gaps in under-observed tropical and low-income regions, reducing forecast fidelity where adaptive capacity is lowest. Output Quality	negative	medium	Forecast fidelity/accuracy in under-observed tropical and low-income regions (model performance metrics)	0.07
In climate impact modelling, sparse and unrepresentative exposure and vulnerability data combined with inadequate validation generate high uncertainty and risk of misleading interventions and maladaptation in vulnerable locales. Decision Quality	negative	medium	Uncertainty in impact estimates and likelihood of misleading policy/intervention recommendations (validation quality; uncertainty measures)	0.07
Large language models (LLMs) that rely on dominant, textualized climate knowledge tend to foreground Northern epistemologies and marginalize local or indigenous knowledge, reinforcing biases in climate narratives and recommendations. Ai Safety And Ethics	negative	medium	Representation of local/indigenous knowledge in LLM outputs and bias in generated climate narratives/recommendations	0.07
Inequities in climate-AI systems appear across three development phases—Inputs, Process, and Outputs—creating multiple failure points where Global North advantages propagate into final products. Inequality	negative	high	Presence of inequities at each phase of the AI development lifecycle (data availability, development resources, evaluation/dissemination fit)	0.12
Rapid AI adoption without a shift from model-centric to data- and equity-centric development risks producing systematically worse performance and misleading recommendations for the most climate-vulnerable, data-sparse regions. Output Quality	negative	medium	Model performance and recommendation quality in climate-vulnerable, data-sparse regions	0.07
Shifting from a model-centric to a data-centric approach (improving data quality, representativeness, and governance) will mitigate the harms caused by current infrastructural asymmetries. Output Quality	positive	low	Improvements in data representativeness, model performance, and equity of outputs after implementing data-centric practices	0.04
Establishing Climate Digital Public Infrastructure (CDPI)—shared, interoperable data and compute resources, standards, and governance—can democratize access and reduce inequities in climate-AI. Adoption Rate	positive	low	Access to compute/data, interoperability, and distributional equity in climate-AI services	0.04
High-quality, equitable climate information displays public-good characteristics (nonrival, nonexcludable at scale), so private incentives alone will underprovide geographically representative data and shared infrastructure. Market Structure	negative	medium-high	Level of provision of geographically representative data/shared infrastructure under private vs. public provision	0.01
Concentration of compute and model development creates market power for Northern institutions and companies, likely leading to unequal pricing, control over standards, and capture of high-value climate services. Market Structure	negative	medium	Market power indicators (pricing, standard-setting control, market share in climate services)	0.07
Returns to scale in compute and data favor incumbents; without intervention this dynamic can entrench inequality in the global climate-information market. Market Structure	negative	medium	Degree to which compute/data scale advantages increase incumbents' market share and entrench inequality	0.07
Biased or unrepresentative AI outputs produce negative externalities, including maladaptation and inefficient investments in vulnerable regions. Fiscal And Macroeconomic	negative	medium	Incidence of maladaptation and associated economic inefficiencies attributable to biased AI outputs	0.07
The paper does not provide granular quantitative estimates of the economic cost of infrastructural asymmetries in climate-AI. Research Productivity	null_result	high	Absence of quantified economic cost estimates in the paper	0.12
Empirical generalization across all climate-AI systems is constrained by heterogeneous data availability and proprietary models, limiting the ability to produce universal quantitative claims. Research Productivity	null_result	high	Extent of empirical generalizability across climate-AI systems	0.12
Economists should prioritize research to quantify returns to investments in CDPI versus private compute, estimate economic costs of maladaptation from biased AI outputs, and design incentive-compatible mechanisms for data sharing and co-production. Research Productivity	positive	low	Feasibility and quantified returns of policy/research interventions (e.g., CDPI investments, incentive mechanisms)	0.04
Policy instruments such as open-data mandates, compute-sharing incentives, and conditionality in R&D funding can help ensure equitable validation and local engagement in climate-AI development. Governance And Regulation	positive	low	Adoption of policy instruments and subsequent changes in equity of validation practices and local engagement	0.04