Evidence (3492 claims)
Adoption
7395 claims
Productivity
6507 claims
Governance
5877 claims
Human-AI Collaboration
5157 claims
Innovation
3492 claims
Org Design
3470 claims
Labor Markets
3224 claims
Skills & Training
2608 claims
Inequality
1835 claims
Evidence Matrix
Claim counts by outcome category and direction of finding.
| Outcome | Positive | Negative | Mixed | Null | Total |
|---|---|---|---|---|---|
| Other | 609 | 159 | 77 | 736 | 1615 |
| Governance & Regulation | 664 | 329 | 160 | 99 | 1273 |
| Organizational Efficiency | 624 | 143 | 105 | 70 | 949 |
| Technology Adoption Rate | 502 | 176 | 98 | 78 | 861 |
| Research Productivity | 348 | 109 | 48 | 322 | 836 |
| Output Quality | 391 | 120 | 44 | 40 | 595 |
| Firm Productivity | 385 | 46 | 85 | 17 | 539 |
| Decision Quality | 275 | 143 | 62 | 34 | 521 |
| AI Safety & Ethics | 183 | 241 | 59 | 30 | 517 |
| Market Structure | 152 | 154 | 109 | 20 | 440 |
| Task Allocation | 158 | 50 | 56 | 26 | 295 |
| Innovation Output | 178 | 23 | 38 | 17 | 257 |
| Skill Acquisition | 137 | 52 | 50 | 13 | 252 |
| Fiscal & Macroeconomic | 120 | 64 | 38 | 23 | 252 |
| Employment Level | 93 | 46 | 96 | 12 | 249 |
| Firm Revenue | 130 | 43 | 26 | 3 | 202 |
| Consumer Welfare | 99 | 51 | 40 | 11 | 201 |
| Inequality Measures | 36 | 105 | 40 | 6 | 187 |
| Task Completion Time | 134 | 18 | 6 | 5 | 163 |
| Worker Satisfaction | 79 | 54 | 16 | 11 | 160 |
| Error Rate | 64 | 78 | 8 | 1 | 151 |
| Regulatory Compliance | 69 | 64 | 14 | 3 | 150 |
| Training Effectiveness | 81 | 15 | 13 | 18 | 129 |
| Wages & Compensation | 70 | 25 | 22 | 6 | 123 |
| Team Performance | 74 | 16 | 21 | 9 | 121 |
| Automation Exposure | 41 | 48 | 19 | 9 | 120 |
| Job Displacement | 11 | 71 | 16 | 1 | 99 |
| Developer Productivity | 71 | 14 | 9 | 3 | 98 |
| Hiring & Recruitment | 49 | 7 | 8 | 3 | 67 |
| Social Protection | 26 | 14 | 8 | 2 | 50 |
| Creative Output | 26 | 14 | 6 | 2 | 49 |
| Skill Obsolescence | 5 | 37 | 5 | 1 | 48 |
| Labor Share of Income | 12 | 13 | 12 | — | 37 |
| Worker Turnover | 11 | 12 | — | 3 | 26 |
| Industry | — | — | — | 1 | 1 |
Innovation
Remove filter
Realising DT value requires upfront investment in sensors, integration, standards, and skills; economic viability depends on contract structures and how gains are allocated between investors, owners, contractors, and operators.
Synthesis of cost/benefit discussions and case descriptions in the reviewed literature; policy and procurement examples referenced.
HCI has explored usable consent, but there is no systematic framework for consent in the AI era.
Literature synthesis and gap identification from workshop participants and solicited position papers; no systematic review or meta-analysis with counted studies reported in the summary.
Research priorities include causal studies on productivity gains from AI, firm‑level adoption dynamics, sectoral labor reallocation, long‑run general equilibrium effects, and heterogeneous impacts across regions and demographic groups.
Set of empirical research recommendations drawn from gaps identified in the literature review and limitations section; not an empirical claim but a prioritized research agenda based on secondary evidence.
Growth‑accounting frameworks and measurement approaches must be updated to capture AI/robotics as intangible and embodied capital, including quality improvements and spillovers.
Methodological argument grounded in literature on measurement challenges and examples of intangible capital; no new measurement exercise or empirical re‑estimation is provided in the paper.
Recommendation for research and modeling: economic models of AI markets should incorporate institutional regime types (centralized vs decentralized), enforcement uncertainty, and legitimacy effects as parameters affecting data access costs, R&D productivity, and market concentration.
Normative recommendation based on the comparative typology and inferred mechanisms from the document analysis; not empirically validated within the study.
Theoretical contribution: the paper extends modular coordination theory by treating openness–security trade‑offs as layered, adaptive institutional processes embedded in political regimes and 'legitimacy economies.'
Argumentative/theoretical development in the paper grounded in document analysis and literature on coordination and legitimacy.
The shift toward solo entry is particularly pronounced in categories that historically favored team-based ventures.
Category-level breakdowns within the Product Hunt dataset showing larger increases in solo-founder launches in categories with a historical bias toward team-based ventures.
For lenders and investors, wider VTech adoption can enhance valuation accuracy, portfolio transparency and collateral risk assessment, strengthening confidence in property markets and capital allocation.
Interpretation and implications drawn from interview data and theoretical synthesis; no quantitative measurement reported in the study.
Based on the findings, firms should invest in proprietary AI models and governments should promote open data initiatives.
Policy recommendations presented in the conclusion, motivated by empirical findings (inverted-U, homogenization trap, heterogeneity).
Smart manufacturing provides a practical pathway for enhancing economic performance while reducing environmental impact.
Framing/theoretical claim in the paper's introduction motivating the study; supported by cited literature rather than the paper's primary empirical DiD test.
Improvements in firms' resource allocation efficiency enhance their ability to adopt smart manufacturing technologies (mechanism).
Mechanism analysis within the study showing that gains in resource allocation efficiency at the firm level are associated with higher adoption of smart manufacturing after LCCP implementation.
City-level human capital upgrading lowers firms' costs of adopting smart manufacturing technologies, facilitating adoption (mechanism).
Mechanism analysis reported in the paper linking city-level human capital improvements to reduced firm-level adoption costs and increased adoption; likely based on city-level measures of human capital interacting with treatment in the DiD framework.
Generation-protocol variants show that crowding can be reduced through targeted design, making diversity collapse an actionable, development-time evaluation target for population-aware creative AI.
Experimental evidence in the paper demonstrating that modifying generation protocols (design choices) reduces crowding; abstract states results across protocol variants but does not provide quantitative effect sizes or sample counts.
Estimates stabilize with feasible model-only sample sizes.
Empirical/stability analysis reported in the paper (abstract claims convergence/stabilization of estimates with feasible numbers of model-only samples), but the abstract does not quantify what 'feasible' means or give sample counts.
Resource-based environmental taxation (the water resource tax reform) can play a role in promoting food security under rigid water constraints.
Interpretation and policy discussion based on the empirical results showing increased grain yield following the reform.
The reform improves water-use efficiency (a channel through which it raises agricultural productivity).
Mechanism analysis in the paper indicating strengthened water-use efficiency following the reform.
A DLM (Schema-1) eliminates the preprocessing pipelines that currently stand between raw tabular data and AI systems that consume it.
Claims based on model's native consumption of raw cell values and experimental demonstrations (design and reported evaluations suggest reduced need for preprocessing; specific operational workflow impacts not quantified in the abstract).
Schema-1 identifies the industry sector of any unseen dataset from raw cell values alone, reliably across any domain—a task no prior tabular model can perform.
Reported experiments demonstrating industry-sector identification from raw cell values on unseen datasets and cross-domain reliability (details of datasets, number of domains, and metrics not provided in the abstract).
Production agentic systems make many model calls per user request, and most of those calls are short, structured, and routine.
Contextual claim motivating the work; presented as an empirical generalization about production agent pipelines, but not quantified in the abstract.
Small and mid-sized open-weight models are already sufficient for much of the short-horizon, structured tool use work that dominates real agent pipelines.
Aggregate benchmark results across AgentFloor tiers showing high performance of smaller and mid-sized open-weight models on short-horizon structured tasks; supported by the 16,542 scored runs and model comparisons reported in the paper.
To our knowledge, this is the first demonstration of an AI agentic system autonomously identifying and experimentally validating a nontrivial, previously unreported physical mechanism.
Authors' novelty claim, supported by the reported autonomous proposal and experimental validation of the optical bilinear interaction in their study.
Qiushi Engine converts an abstract coherence-order theory into experimental observables, providing the first observation of this class of coherence-order structure.
Reported experimental procedure translating coherence-order theory into measurable observables and claiming the first observation of that class of structure; experimental data and analysis presented in the paper supporting the observation.
These simulations produce rich experiential learning signals, whose effectiveness is validated by significant improvements in agent performance on both in-domain and out-of-domain productivity evaluations.
Evaluation experiments reported in the paper claiming statistically/qualitatively significant improvements in agent performance on in-domain and out-of-domain productivity benchmarks after training on simulation-generated signals.
Claude 3.5 Sonnet aligns with a narrative funder profile, showing greater responsiveness to qualitative aspects of the pitch, somewhat higher funding levels, and strong cross-run reliability.
Comparative observations across the experiment: Claude 3.5 Sonnet was more responsive to qualitative information in pitch decks, tended to recommend higher funding levels, and demonstrated strong reliability across runs.
Agentic Architect is the first end-to-end open-source framework for agentic AI architecture exploration and optimization.
Authors' claim in the abstract and paper asserting novelty and open-source release. No independent verification provided in the abstract.
The central obstacle to agent self-improvement is not what to remember but how to use what has been remembered (which retrieval policy to apply, how to interpret prior outcomes, and when the current strategy itself must change).
Conceptual claim supported by authors' argumentation and by the experimental results (ablation showing gains from reflection/use mechanisms rather than added architectural complexity).
Differences in institutional quality, digital infrastructure, and absorptive capacity explain the disparity in technology impacts between GCC and non-GCC countries.
Exploratory/mediation or interaction analysis linking institutional quality, measures of digital infrastructure, and absorptive capacity to heterogeneity in estimated technology effects across countries in the panel.
The capital market evaluates AI investment as a future 'growth option' selectively in industries with strong data infrastructure, digital workforce readiness, and absorptive capacity.
Inference from heterogeneous positive Tobin's Q effect found in the ICT industry and null average effect across all firms; authors argue market valuation responds to industry-specific complementary assets and ecosystem conditions.
Developing and further developed countries only integrate with China, signaling China's expanding influence over the international AI research landscape.
Observed integration patterns in the publication-based collaboration and citation networks showing that (some) developing and further developed countries connect primarily with China rather than the US; comparison to randomized networks.
The calibration mapping suggests Google and OpenAI face conditions most conducive to foreclosure.
Outcomes of the paper's stylized calibration/comparative mapping across four providers (April 2026 data); authors' interpretation.
The proposed approach aligns machine learning with actuarial portfolio optimization by explicitly integrating profit-driven objectives and operational constraints, offering two practical and scalable solutions for risk-based decision-making in real-world insurance settings.
Conceptual claim supported by the combination of methodological design and empirical results presented in the paper (method descriptions + experimental validation).
The balanced ensemble provides the most favourable trade-off between predictive performance, robustness, interpretability, and computational efficiency, making it suitable for deployment in regulated insurance environments.
Authors' synthesis of experimental results (performance, robustness tests, interpretability considerations, and computational efficiency measurements) and discussion regarding regulatory deployment suitability.
Spatial heterogeneity: Eastern regions are driven by knowledge recombination opportunities.
Reported spatial heterogeneity findings indicating Eastern China’s diffusion is driven more by recombination/opportunity measures than by reliance on core hubs.
Spatial heterogeneity: Western regions rely heavily on core technological hubs.
Spatial analysis / heterogeneity results reported by region indicating Western China depends on core technological hubs as diffusion sources or anchors.
Heterogeneity analysis: market-driven enterprises heavily rely on high-value core technologies.
Reported heterogeneity results indicating enterprises (market-driven actors) concentrate on and depend upon core, high-value technologies within identified diffusion paths.
Heterogeneity analysis: universities bridge distant domains through knowledge diversity.
Stratified/heterogeneity analysis reported in the paper showing that university actors are associated with cross-domain bridging and higher measured knowledge diversity in the diffusion paths.
We demonstrate that, by modifying the agent's tools (FreeCAD and the assembly solver), we are able to create a strong verification signal which enables our system to build 3D assemblies with movable parts.
Claim of experimental demonstration: authors state they modified tools (FreeCAD and assembly solver) to create a verification signal enabling building of movable 3D assemblies. Implied evidence is demonstrations/experiments in the paper (details, sample sizes, benchmarks not included in excerpt).
This design decision allows AADvark to reason directly about assemblies with moving parts and can thereby achieve cross-cutting goals, including but not limited to mechanical movements.
Claim about functional consequence of the design choice (ability to reason about moving assemblies and achieve related goals); evidence implied to be from system behavior/demonstrations in the paper but not provided in the excerpt.
Data elements provide a unique mechanism that enables late‑entrant firms to catch up technologically.
Interpretation drawn from the observed stronger positive association between data factor utilization and AI patent output among low‑TFP (late‑entrant) firms in the panel analyses.
Exploitative innovation is associated with performance through incremental efficiency mechanisms.
Authors' interpretation of model results from the survey (104 managers) suggesting exploitative innovation improves performance via incremental efficiency, though specific mechanisms were not separately measured.
These findings establish a framework for evidence-based policy interventions to align the NIH AI portfolio with health equity goals and strategic research priorities.
Interpretive/concluding statement proposing that the reported portfolio analysis can inform policy interventions; framed as implication rather than an empirical result.
The positive effect of AIRC on productivity is mediated through improvements in reproducibility.
Structural equation modeling (SEM) reports mediation through reproducibility metrics in the OECD panel analysis.
The positive effect of AIRC on productivity is mediated through improvements in review efficiency.
Structural equation modeling (SEM) indicates mediation paths from AIRC to productivity via measures of review efficiency in the panel data.
Coordinated digital green development strategies are important to promote a more balanced and inclusive transition toward China’s dual-carbon goals.
Policy implication drawn from the study's empirical findings (AI reduces inequality while green innovation has not diffused), recommending coordinated digital and green development to achieve balanced outcomes.
Because other AI systems exhibit similar scaling-law economics, the mechanisms identified extend beyond computer vision, reinforcing that partial automation is often the economically rational long-run outcome, not merely a transitional phase.
Theoretical argument generalized from scaling-law evidence in the paper; no additional cross-domain empirical evidence reported in the summary.
We further provide initial evidence that this AI-for-AI paradigm can transfer beyond the AI stack through experiments in mathematics and biomedicine.
Reported preliminary experiments in mathematics and biomedicine intended to test transfer beyond the AI development stack.
To our knowledge, ASI-Evolve is the first unified framework to demonstrate AI-driven discovery across three central components of AI development: data, architectures, and learning algorithms.
Authors' claim of primacy based on reported experiments demonstrating AI-driven discovery in pretraining data curation, neural architecture design, and reinforcement learning algorithm design.
Intelligent manufacturing policies can generate economically meaningful benefits by improving firms’ sustainability performance and the credibility of ESG information, which are central to capital allocation and the effectiveness of green governance.
Synthesis/implication drawn from the empirical findings reported in the paper (positive effects on ESG ratings, reduced greenwashing, and lower ESG uncertainty).
The growth of digital platforms contributes to the decentralization of job creation.
Paper cites contemporary data on the growth of digital platforms as part of its analysis (no specific platform-level datasets or sample sizes cited in the abstract).
The paper's predictions are consistent with practitioner reports.
Authors claim qualitative consistency with practitioner reports (no systematic survey/sample size provided in the provided text).