Evidence (3492 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	609	159	77	736	1615
Governance & Regulation	664	329	160	99	1273
Organizational Efficiency	624	143	105	70	949
Technology Adoption Rate	502	176	98	78	861
Research Productivity	348	109	48	322	836
Output Quality	391	120	44	40	595
Firm Productivity	385	46	85	17	539
Decision Quality	275	143	62	34	521
AI Safety & Ethics	183	241	59	30	517
Market Structure	152	154	109	20	440
Task Allocation	158	50	56	26	295
Innovation Output	178	23	38	17	257
Skill Acquisition	137	52	50	13	252
Fiscal & Macroeconomic	120	64	38	23	252
Employment Level	93	46	96	12	249
Firm Revenue	130	43	26	3	202
Consumer Welfare	99	51	40	11	201
Inequality Measures	36	105	40	6	187
Task Completion Time	134	18	6	5	163
Worker Satisfaction	79	54	16	11	160
Error Rate	64	78	8	1	151
Regulatory Compliance	69	64	14	3	150
Training Effectiveness	81	15	13	18	129
Wages & Compensation	70	25	22	6	123
Team Performance	74	16	21	9	121
Automation Exposure	41	48	19	9	120
Job Displacement	11	71	16	1	99
Developer Productivity	71	14	9	3	98
Hiring & Recruitment	49	7	8	3	67
Social Protection	26	14	8	2	50
Creative Output	26	14	6	2	49
Skill Obsolescence	5	37	5	1	48
Labor Share of Income	12	13	12	—	37
Worker Turnover	11	12	—	3	26
Industry	—	—	—	1	1

Innovation Remove filter

Realising DT value requires upfront investment in sensors, integration, standards, and skills; economic viability depends on contract structures and how gains are allocated between investors, owners, contractors, and operators.

Synthesis of cost/benefit discussions and case descriptions in the reviewed literature; policy and procurement examples referenced.

medium null result Digital Twins Across the Asset Lifecycle: Technical, Organis... investment requirements and determinants of economic viability

HCI has explored usable consent, but there is no systematic framework for consent in the AI era.

Literature synthesis and gap identification from workshop participants and solicited position papers; no systematic review or meta-analysis with counted studies reported in the summary.

medium null result Moving Beyond Clicks: Rethinking Consent and User Control in... existence of a systematic AI-era consent framework

Research priorities include causal studies on productivity gains from AI, firm‑level adoption dynamics, sectoral labor reallocation, long‑run general equilibrium effects, and heterogeneous impacts across regions and demographic groups.

Set of empirical research recommendations drawn from gaps identified in the literature review and limitations section; not an empirical claim but a prioritized research agenda based on secondary evidence.

medium null result AI and Robotics Redefine Output and Growth: The New Producti... knowledge gaps to be addressed (research outcomes)

Growth‑accounting frameworks and measurement approaches must be updated to capture AI/robotics as intangible and embodied capital, including quality improvements and spillovers.

Methodological argument grounded in literature on measurement challenges and examples of intangible capital; no new measurement exercise or empirical re‑estimation is provided in the paper.

medium null result AI and Robotics Redefine Output and Growth: The New Producti... measurement accuracy of productivity accounts, capture of intangible capital and...

Recommendation for research and modeling: economic models of AI markets should incorporate institutional regime types (centralized vs decentralized), enforcement uncertainty, and legitimacy effects as parameters affecting data access costs, R&D productivity, and market concentration.

Normative recommendation based on the comparative typology and inferred mechanisms from the document analysis; not empirically validated within the study.

medium null result Balancing openness and security in scientific data governanc... modeling parameters (regime type, enforcement uncertainty, legitimacy effects) a...

Theoretical contribution: the paper extends modular coordination theory by treating openness–security trade‑offs as layered, adaptive institutional processes embedded in political regimes and 'legitimacy economies.'

Argumentative/theoretical development in the paper grounded in document analysis and literature on coordination and legitimacy.

medium null result Balancing openness and security in scientific data governanc... theoretical framing / extension of modular coordination theory

The shift toward solo entry is particularly pronounced in categories that historically favored team-based ventures.

Category-level breakdowns within the Product Hunt dataset showing larger increases in solo-founder launches in categories with a historical bias toward team-based ventures.

medium positive Generative AI Fuels Solo Entrepreneurship, but Teams Still L... change in solo-founder share by category (relative increase)

For lenders and investors, wider VTech adoption can enhance valuation accuracy, portfolio transparency and collateral risk assessment, strengthening confidence in property markets and capital allocation.

Interpretation and implications drawn from interview data and theoretical synthesis; no quantitative measurement reported in the study.

medium positive Exploring barriers to valuation technology adoption in prope... valuation accuracy, portfolio transparency and collateral risk assessment

Based on the findings, firms should invest in proprietary AI models and governments should promote open data initiatives.

Policy recommendations presented in the conclusion, motivated by empirical findings (inverted-U, homogenization trap, heterogeneity).

medium positive The Inverted-U Relationship Between AI and Corporate Innovat... policy/recommendation implications (firm and government actions)

Smart manufacturing provides a practical pathway for enhancing economic performance while reducing environmental impact.

Framing/theoretical claim in the paper's introduction motivating the study; supported by cited literature rather than the paper's primary empirical DiD test.

medium positive Environmental policy synergies: unintended benefits for smar... economic performance and environmental impact in relation to smart manufacturing...

Improvements in firms' resource allocation efficiency enhance their ability to adopt smart manufacturing technologies (mechanism).

Mechanism analysis within the study showing that gains in resource allocation efficiency at the firm level are associated with higher adoption of smart manufacturing after LCCP implementation.

medium positive Environmental policy synergies: unintended benefits for smar... firms' resource allocation efficiency and subsequent adoption of smart manufactu...

City-level human capital upgrading lowers firms' costs of adopting smart manufacturing technologies, facilitating adoption (mechanism).

Mechanism analysis reported in the paper linking city-level human capital improvements to reduced firm-level adoption costs and increased adoption; likely based on city-level measures of human capital interacting with treatment in the DiD framework.

medium positive Environmental policy synergies: unintended benefits for smar... firms' cost of adopting smart manufacturing technologies (mediated by city-level...

Generation-protocol variants show that crowding can be reduced through targeted design, making diversity collapse an actionable, development-time evaluation target for population-aware creative AI.

Experimental evidence in the paper demonstrating that modifying generation protocols (design choices) reduces crowding; abstract states results across protocol variants but does not provide quantitative effect sizes or sample counts.

medium positive Ex Ante Evaluation of AI-Induced Idea Diversity Collapse change in crowding (Δ or ρ) under generation-protocol variants

Estimates stabilize with feasible model-only sample sizes.

Empirical/stability analysis reported in the paper (abstract claims convergence/stabilization of estimates with feasible numbers of model-only samples), but the abstract does not quantify what 'feasible' means or give sample counts.

medium positive Ex Ante Evaluation of AI-Induced Idea Diversity Collapse stability/convergence of crowding estimates as model-only sample size increases

Resource-based environmental taxation (the water resource tax reform) can play a role in promoting food security under rigid water constraints.

Interpretation and policy discussion based on the empirical results showing increased grain yield following the reform.

medium positive Can water resource tax reform increase grain yield?—Evidence... food security (via grain yield)

The reform improves water-use efficiency (a channel through which it raises agricultural productivity).

Mechanism analysis in the paper indicating strengthened water-use efficiency following the reform.

medium positive Can water resource tax reform increase grain yield?—Evidence... water-use efficiency

A DLM (Schema-1) eliminates the preprocessing pipelines that currently stand between raw tabular data and AI systems that consume it.

Claims based on model's native consumption of raw cell values and experimental demonstrations (design and reported evaluations suggest reduced need for preprocessing; specific operational workflow impacts not quantified in the abstract).

medium positive Data Language Models: A New Foundation Model Class for Tabul... presence/absence or reduction of preprocessing pipeline steps required before mo...

Schema-1 identifies the industry sector of any unseen dataset from raw cell values alone, reliably across any domain—a task no prior tabular model can perform.

Reported experiments demonstrating industry-sector identification from raw cell values on unseen datasets and cross-domain reliability (details of datasets, number of domains, and metrics not provided in the abstract).

medium positive Data Language Models: A New Foundation Model Class for Tabul... accuracy/reliability of industry-sector identification from raw tabular data

Production agentic systems make many model calls per user request, and most of those calls are short, structured, and routine.

Contextual claim motivating the work; presented as an empirical generalization about production agent pipelines, but not quantified in the abstract.

medium positive AgentFloor: How Far Up the tool use Ladder Can Small Open-We... distribution of model-call types in production agentic systems (short/structured...

Small and mid-sized open-weight models are already sufficient for much of the short-horizon, structured tool use work that dominates real agent pipelines.

Aggregate benchmark results across AgentFloor tiers showing high performance of smaller and mid-sized open-weight models on short-horizon structured tasks; supported by the 16,542 scored runs and model comparisons reported in the paper.

medium positive AgentFloor: How Far Up the tool use Ladder Can Small Open-We... ability to complete short-horizon, structured tool-use tasks on the AgentFloor b...

To our knowledge, this is the first demonstration of an AI agentic system autonomously identifying and experimentally validating a nontrivial, previously unreported physical mechanism.

Authors' novelty claim, supported by the reported autonomous proposal and experimental validation of the optical bilinear interaction in their study.

medium positive End-to-end autonomous scientific discovery on a real optical... novelty of AI-driven autonomous experimental discovery (identification + experim...

Qiushi Engine converts an abstract coherence-order theory into experimental observables, providing the first observation of this class of coherence-order structure.

Reported experimental procedure translating coherence-order theory into measurable observables and claiming the first observation of that class of structure; experimental data and analysis presented in the paper supporting the observation.

medium positive End-to-end autonomous scientific discovery on a real optical... observation of coherence-order structure predicted by theory

These simulations produce rich experiential learning signals, whose effectiveness is validated by significant improvements in agent performance on both in-domain and out-of-domain productivity evaluations.

Evaluation experiments reported in the paper claiming statistically/qualitatively significant improvements in agent performance on in-domain and out-of-domain productivity benchmarks after training on simulation-generated signals.

medium positive Synthetic Computers at Scale for Long-Horizon Productivity S... agent performance on productivity evaluations (in-domain and out-of-domain)

Claude 3.5 Sonnet aligns with a narrative funder profile, showing greater responsiveness to qualitative aspects of the pitch, somewhat higher funding levels, and strong cross-run reliability.

Comparative observations across the experiment: Claude 3.5 Sonnet was more responsive to qualitative information in pitch decks, tended to recommend higher funding levels, and demonstrated strong reliability across runs.

medium positive Algorithmic personalities and the myth of neutrality: financ... responsiveness to qualitative aspects; funding levels; reliability

Agentic Architect is the first end-to-end open-source framework for agentic AI architecture exploration and optimization.

Authors' claim in the abstract and paper asserting novelty and open-source release. No independent verification provided in the abstract.

medium positive Agentic Architect: An Agentic AI Framework for Architecture ... adoption_rate

The central obstacle to agent self-improvement is not what to remember but how to use what has been remembered (which retrieval policy to apply, how to interpret prior outcomes, and when the current strategy itself must change).

Conceptual claim supported by authors' argumentation and by the experimental results (ablation showing gains from reflection/use mechanisms rather than added architectural complexity).

medium positive AEL: Agent Evolving Learning for Open-Ended Environments bottleneck characterization for agent self-improvement

Differences in institutional quality, digital infrastructure, and absorptive capacity explain the disparity in technology impacts between GCC and non-GCC countries.

Exploratory/mediation or interaction analysis linking institutional quality, measures of digital infrastructure, and absorptive capacity to heterogeneity in estimated technology effects across countries in the panel.

medium positive Digital Transformation, AI Efficiency, and Sustainable Devel... heterogeneity in the effect of digital transformation/AI on sustainable developm...

The capital market evaluates AI investment as a future 'growth option' selectively in industries with strong data infrastructure, digital workforce readiness, and absorptive capacity.

Inference from heterogeneous positive Tobin's Q effect found in the ICT industry and null average effect across all firms; authors argue market valuation responds to industry-specific complementary assets and ecosystem conditions.

medium positive The Dynamic Causal Effects of Corporate AI Adoption on Profi... market valuation response to AI investment (interpreted as growth-option pricing...

Developing and further developed countries only integrate with China, signaling China's expanding influence over the international AI research landscape.

Observed integration patterns in the publication-based collaboration and citation networks showing that (some) developing and further developed countries connect primarily with China rather than the US; comparison to randomized networks.

medium positive Polarization and Integration in Global AI Research international research integration of developing and further developed countries...

The calibration mapping suggests Google and OpenAI face conditions most conducive to foreclosure.

Outcomes of the paper's stylized calibration/comparative mapping across four providers (April 2026 data); authors' interpretation.

medium positive The Inference Bottleneck: A Formal Model of Vertical Foreclo... conduciveness to foreclosure

The proposed approach aligns machine learning with actuarial portfolio optimization by explicitly integrating profit-driven objectives and operational constraints, offering two practical and scalable solutions for risk-based decision-making in real-world insurance settings.

Conceptual claim supported by the combination of methodological design and empirical results presented in the paper (method descriptions + experimental validation).

medium positive Advanced Insurance Risk Modeling for Pseudo-New Customers Us... risk-based_decision-making_effectiveness

The balanced ensemble provides the most favourable trade-off between predictive performance, robustness, interpretability, and computational efficiency, making it suitable for deployment in regulated insurance environments.

Authors' synthesis of experimental results (performance, robustness tests, interpretability considerations, and computational efficiency measurements) and discussion regarding regulatory deployment suitability.

medium positive Advanced Insurance Risk Modeling for Pseudo-New Customers Us... suitability_for_deployment / trade-off_between_metrics

Spatial heterogeneity: Eastern regions are driven by knowledge recombination opportunities.

Reported spatial heterogeneity findings indicating Eastern China’s diffusion is driven more by recombination/opportunity measures than by reliance on core hubs.

medium positive Mapping China’s digital transformation: a multilayer network... drivers of diffusion in Eastern regions (knowledge recombination)

Spatial heterogeneity: Western regions rely heavily on core technological hubs.

Spatial analysis / heterogeneity results reported by region indicating Western China depends on core technological hubs as diffusion sources or anchors.

medium positive Mapping China’s digital transformation: a multilayer network... regional dependence on core technological hubs (Western regions)

Heterogeneity analysis: market-driven enterprises heavily rely on high-value core technologies.

Reported heterogeneity results indicating enterprises (market-driven actors) concentrate on and depend upon core, high-value technologies within identified diffusion paths.

medium positive Mapping China’s digital transformation: a multilayer network... enterprises' reliance on core high-value technologies

Heterogeneity analysis: universities bridge distant domains through knowledge diversity.

Stratified/heterogeneity analysis reported in the paper showing that university actors are associated with cross-domain bridging and higher measured knowledge diversity in the diffusion paths.

medium positive Mapping China’s digital transformation: a multilayer network... universities' role in bridging domains via knowledge diversity

We demonstrate that, by modifying the agent's tools (FreeCAD and the assembly solver), we are able to create a strong verification signal which enables our system to build 3D assemblies with movable parts.

Claim of experimental demonstration: authors state they modified tools (FreeCAD and assembly solver) to create a verification signal enabling building of movable 3D assemblies. Implied evidence is demonstrations/experiments in the paper (details, sample sizes, benchmarks not included in excerpt).

medium positive Agent-Aided Design for Dynamic CAD Models ability to build 3D assemblies with movable parts (enabled by enhanced verificat...

This design decision allows AADvark to reason directly about assemblies with moving parts and can thereby achieve cross-cutting goals, including but not limited to mechanical movements.

Claim about functional consequence of the design choice (ability to reason about moving assemblies and achieve related goals); evidence implied to be from system behavior/demonstrations in the paper but not provided in the excerpt.

medium positive Agent-Aided Design for Dynamic CAD Models reasoning about assemblies with moving parts / achieving mechanical movement goa...

Data elements provide a unique mechanism that enables late‑entrant firms to catch up technologically.

Interpretation drawn from the observed stronger positive association between data factor utilization and AI patent output among low‑TFP (late‑entrant) firms in the panel analyses.

medium positive The level of data element utilization in the integration of ... technological catch‑up (proxied by AI patent output increases among late entrant...

Exploitative innovation is associated with performance through incremental efficiency mechanisms.

Authors' interpretation of model results from the survey (104 managers) suggesting exploitative innovation improves performance via incremental efficiency, though specific mechanisms were not separately measured.

medium positive Generative AI Adoption in B2B Firms: Ethical Governance, Inn... long-term competitive performance

These findings establish a framework for evidence-based policy interventions to align the NIH AI portfolio with health equity goals and strategic research priorities.

Interpretive/concluding statement proposing that the reported portfolio analysis can inform policy interventions; framed as implication rather than an empirical result.

medium positive An Analysis of Artificial Intelligence Adoption in NIH-Funde... use of analysis to support policy interventions

The positive effect of AIRC on productivity is mediated through improvements in reproducibility.

Structural equation modeling (SEM) reports mediation through reproducibility metrics in the OECD panel analysis.

medium positive AI-Augmented Peer Review and Scientific Productivity: A Cros... research reproducibility (as mediator)

The positive effect of AIRC on productivity is mediated through improvements in review efficiency.

Structural equation modeling (SEM) indicates mediation paths from AIRC to productivity via measures of review efficiency in the panel data.

medium positive AI-Augmented Peer Review and Scientific Productivity: A Cros... review efficiency (as mediator)

Coordinated digital green development strategies are important to promote a more balanced and inclusive transition toward China’s dual-carbon goals.

Policy implication drawn from the study's empirical findings (AI reduces inequality while green innovation has not diffused), recommending coordinated digital and green development to achieve balanced outcomes.

medium positive Artificial intelligence, green innovation, and regional carb... balanced and inclusive transition to carbon peak and neutrality goals

Because other AI systems exhibit similar scaling-law economics, the mechanisms identified extend beyond computer vision, reinforcing that partial automation is often the economically rational long-run outcome, not merely a transitional phase.

Theoretical argument generalized from scaling-law evidence in the paper; no additional cross-domain empirical evidence reported in the summary.

medium positive Economics of Human and AI Collaboration: When is Partial Aut... prevalence of partial automation across AI application domains

We further provide initial evidence that this AI-for-AI paradigm can transfer beyond the AI stack through experiments in mathematics and biomedicine.

Reported preliminary experiments in mathematics and biomedicine intended to test transfer beyond the AI development stack.

medium positive ASI-Evolve: AI Accelerates AI transferability of AI-for-AI paradigm to domains outside core AI (mathematics an...

To our knowledge, ASI-Evolve is the first unified framework to demonstrate AI-driven discovery across three central components of AI development: data, architectures, and learning algorithms.

Authors' claim of primacy based on reported experiments demonstrating AI-driven discovery in pretraining data curation, neural architecture design, and reinforcement learning algorithm design.

medium positive ASI-Evolve: AI Accelerates AI breadth of AI-driven discovery across data, architectures, and learning algorith...

Intelligent manufacturing policies can generate economically meaningful benefits by improving firms’ sustainability performance and the credibility of ESG information, which are central to capital allocation and the effectiveness of green governance.

Synthesis/implication drawn from the empirical findings reported in the paper (positive effects on ESG ratings, reduced greenwashing, and lower ESG uncertainty).

medium positive Intelligent Manufacturing Demonstration Projects Driving Cor... sustainability performance and credibility of ESG information

The growth of digital platforms contributes to the decentralization of job creation.

Paper cites contemporary data on the growth of digital platforms as part of its analysis (no specific platform-level datasets or sample sizes cited in the abstract).

medium positive AI Civilization and the Transformation of Work role of digital platforms in job creation / decentralization

The paper's predictions are consistent with practitioner reports.

Authors claim qualitative consistency with practitioner reports (no systematic survey/sample size provided in the provided text).

medium positive The Novelty Bottleneck: A Framework for Understanding Human ... qualitative alignment with practitioner experiences

« Prev 1 2 3 … 51 52 53 … 69 70 Next »