A nationwide LLM-curated dataset finds that Spanish firms' public reporting of AI use varies sharply by region, industry and size, covering 112,814 firms across 2023 and 2025. The resource classifies whether AI is used internally or embedded in offerings and provides a reproducible baseline for studying diffusion and economic impacts.

AI adoption in Spain (2023–2025): A web-derived dataset based on LLMs

Ana Pastor-Merino, Xavier Martínez-Barbero, J. Domenech · Fetched March 26, 2026 · Data in Brief

semantic_scholar descriptive medium evidence 8/10 relevance Summary only summary available; pdf_status=pending DOI Source

The paper provides a reproducible, LLM-powered nationwide dataset mapping public evidence of AI adoption across 112,814 Spanish firms (225,628 firm-year observations) in 2023 and 2025, documenting regional, sectoral and size-related patterns and distinguishing internal use from product embedding.

This article introduces a nationwide dataset that maps how 112,814 Spanish firms communicate and implement artificial intelligence (AI) on their corporate websites in 2023 and 2025, resulting in 225,628 firm-year observations. Using a systemic pipeline based on large language models (LLMs), website text is segmented, semantically filtered, and evaluated with a structured rubric to identify explicit evidence of AI use in internal processes and in products or services. The dataset offers a detailed portrait of AI adoption across regions (NUTS 3), industries, and firm size categories. For each province–sector–size combination, it reports whether firms adopt AI, whether they apply it internally, whether it is embedded in their offerings, and how many firms have valid website content. This multi-dimensional structure enables users to explore territorial patterns, sectoral differences, and size-related disparities in the uptake of AI. By providing indicators for two benchmark years, the dataset supports the study of how AI adoption evolves across the Spanish business landscape. It offers a reproducible and scalable foundation for research on technological diffusion, regional digitalisation, and industry-level transformation, and can be readily extended to future years or adapted to other countries.

Summary

Main Finding

The paper provides a reproducible, scalable nationwide dataset that maps explicit, website-disclosed AI adoption for 112,814 Spanish firms in 2023 and 2025 (225,628 firm‑year observations). Using an LLM-based pipeline, it classifies whether firms communicate AI use at all, whether AI is applied internally, and whether AI is embedded in products or services, and aggregates these indicators across provinces (NUTS3), industries, and firm-size categories.

Key Points

Coverage: 112,814 distinct Spanish firms, two benchmark years (2023 and 2025), producing 225,628 firm‑year observations.
Indicators: for each firm-year the dataset records (a) presence of AI claims on the website, (b) explicit evidence of AI use in internal processes, (c) explicit evidence of AI embedded in products/services, and (d) whether website content was valid for evaluation.
Multi-dimensional aggregation: indicators are reported at province–sector–size cell level (NUTS3 × industry × firm-size), enabling territorial, sectoral and size-based comparisons.
Method: automated pipeline based on large language models that segments website text, semantically filters relevant content, and applies a structured rubric to identify explicit AI evidence.
Temporal comparability: two benchmark years allow analysis of changes in communicated/adopted AI between 2023 and 2025.
Reproducibility and scalability: pipeline design enables extension to future years or adaptation to other countries/languages.

Data & Methods

Data source: firm corporate websites scraped for text content in two snapshots (2023 and 2025).
Sample construction: 112,814 firms sampled and matched to website content; firm‑year observations total 225,628.
Processing pipeline:
- Text segmentation of web pages into coherent chunks.
- Semantic filtering to surface AI-related candidate passages.
- LLM-assisted evaluation using a structured rubric that flags explicit mentions/evidence of AI in (i) internal processes and (ii) products/services.
Output variables:
- Binary flags per firm-year: AI mentioned (yes/no), AI used internally (yes/no), AI embedded in offerings (yes/no), and content-validity indicator.
- Aggregated counts and shares by NUTS3 region, industry, and firm-size cell, including the number of valid website observations per cell.
Quality/limitations (implicit in method):
- Measures capture explicit, disclosed AI use on public websites — silent/adoptive uses not mentioned online will be missed.
- Dependent on website coverage and quality; potential selection bias toward firms that maintain informative websites.
- LLM-based classification introduces risk of false positives/negatives; reproducibility mitigates but does not eliminate model-driven error.
- Currently language- and web-text–based; non-public AI use (internal tools, proprietary systems) and multimodal evidence (e.g., demos, PDFs, code repositories) may be under‑captured.

Implications for AI Economics

Measurement advance: provides a fine-grained, reproducible indicator of firms’ communicated AI adoption that complements administrative, patent, and survey measures.
Spatial and sectoral diffusion: enables analysis of how AI adoption varies across regions (NUTS3), industries, and firm size—useful for mapping hotspots, digital divides, and localized spillovers.
Temporal dynamics: two-year benchmarks support preliminary assessments of adoption trajectories and short-run diffusion patterns; pipeline can generate further time points for dynamic studies.
Linking to outcomes: dataset can be merged with firm-level administrative data (employment, productivity, trade, R&D) to study causal relationships between AI adoption and economic outcomes, conditional on disclosure biases.
Policy targeting: informs regional and sector-specific digitalisation policies (training, infrastructure, subsidies) by identifying low-adoption cells and potential capacity gaps among small/medium firms.
Methodological template: demonstrates scalable LLM-based measurement that can be adapted cross-country, facilitating comparative studies of AI diffusion and policy effectiveness.
Research cautions: because the dataset measures publicly disclosed AI, researchers should account for disclosure bias when using it as a proxy for true adoption (e.g., via validation with surveys, admin data, or robustness checks).

Assessment

Paper Typedescriptive Evidence Strengthmedium — The dataset provides broad, systematic coverage (112,814 firms, 225,628 firm-year observations) and a reproducible LLM-driven coding rubric, which makes the indicators informative for observed public disclosure of AI; however, website mentions are an imperfect proxy for true adoption (non-disclosure, strategic boasting, or omission), and measurement error from LLM classification and scraping is plausible. Methods Rigormedium — The pipeline appears well-structured (segmentation, semantic filtering, rubric) and is scalable and reproducible, and the sample is large and stratified by region, industry and size; nevertheless, potential issues include LLM misclassification, rubric subjectivity, coverage bias toward firms with websites, language/SEO artifacts, and limited information here about validation against ground truth. SampleAll Spanish firms with accessible corporate websites in 2023 and 2025 yielding 112,814 unique firms and 225,628 firm-year observations; data are coded at province (NUTS3) × sector × firm-size levels and include indicators for (a) any AI adoption, (b) internal process use, (c) AI embedded in products/services, and (d) counts of firms with valid website content per cell. Themesadoption innovation IdentificationNot a causal study; identification of 'AI adoption' is achieved by an LLM-based text pipeline that scrapes corporate websites, segments site text, applies semantic filtering, and uses a structured rubric to code explicit evidence that a firm uses AI internally or embeds AI in products/services. GeneralizabilityRestricted to Spanish firms and to two benchmark years (2023, 2025); not directly representative of other countries or periods, Covers only firms that have public websites and sufficiently detailed web text; excludes firms without web presence or with non-textual disclosures, May undercount 'hidden' AI adoption (internal use not disclosed) and overcount firms that publicize AI strategically without substantive use, Likely biased toward larger, digitally visible firms and sectors with greater online marketing, Potential misclassification from LLMs, language variability, and rubric interpretation limits cross-context comparability

Claims (9)

Claim	Direction	Outcome	Confidence & Evidence	Details
The paper introduces a nationwide dataset that maps how 112,814 Spanish firms communicate and implement artificial intelligence (AI) on their corporate websites in 2023 and 2025. Adoption Rate	positive	adoption_rate	Reading fidelity high Study strength high	n=112814 0.3
The dataset results in 225,628 firm-year observations. Adoption Rate	positive	adoption_rate	Reading fidelity high Study strength high	n=225628 0.3
The paper uses a systemic pipeline based on large language models (LLMs) to segment website text, semantically filter it, and evaluate it with a structured rubric. Other	positive	other	Reading fidelity high Study strength medium	not reported 0.18
The pipeline identifies explicit evidence of AI use both in firms' internal processes and embedded in their products or services. Adoption Rate	positive	adoption_rate	Reading fidelity high Study strength medium	not reported 0.18
The dataset offers a detailed portrait of AI adoption across regions (NUTS 3), industries, and firm size categories. Adoption Rate	positive	adoption_rate	Reading fidelity high Study strength high	not reported 0.3
For each province–sector–size combination, the dataset reports whether firms adopt AI, whether they apply it internally, whether it is embedded in their offerings, and how many firms have valid website content. Adoption Rate	positive	adoption_rate	Reading fidelity high Study strength high	not reported 0.3
This multi-dimensional structure enables users to explore territorial patterns, sectoral differences, and size-related disparities in the uptake of AI. Adoption Rate	positive	adoption_rate	Reading fidelity high Study strength medium	not reported 0.18
By providing indicators for two benchmark years, the dataset supports the study of how AI adoption evolves across the Spanish business landscape. Adoption Rate	positive	adoption_rate	Reading fidelity high Study strength medium	not reported 0.18
The dataset provides a reproducible and scalable foundation for research on technological diffusion, regional digitalisation, and industry-level transformation, and can be readily extended to future years or adapted to other countries. Adoption Rate	positive	adoption_rate	Reading fidelity high Study strength speculative	not reported 0.03