Evidence (5267 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	378	106	59	455	1007
Governance & Regulation	379	176	116	58	739
Research Productivity	240	96	34	294	668
Organizational Efficiency	370	82	63	35	553
Technology Adoption Rate	296	118	66	29	513
Firm Productivity	277	34	68	10	394
AI Safety & Ethics	117	177	44	24	364
Output Quality	244	61	23	26	354
Market Structure	107	123	85	14	334
Decision Quality	168	74	37	19	301
Fiscal & Macroeconomic	75	52	32	21	187
Employment Level	70	32	74	8	186
Skill Acquisition	89	32	39	9	169
Firm Revenue	96	34	22	—	152
Innovation Output	106	12	21	11	151
Consumer Welfare	70	30	37	7	144
Regulatory Compliance	52	61	13	3	129
Inequality Measures	24	68	31	4	127
Task Allocation	75	11	29	6	121
Training Effectiveness	55	12	12	16	96
Error Rate	42	48	6	—	96
Worker Satisfaction	45	32	11	6	94
Task Completion Time	78	5	4	2	89
Wages & Compensation	46	13	19	5	83
Team Performance	44	9	15	7	76
Hiring & Recruitment	39	4	6	3	52
Automation Exposure	18	17	9	5	50
Job Displacement	5	31	12	—	48
Social Protection	21	10	6	2	39
Developer Productivity	29	3	3	1	36
Worker Turnover	10	12	—	3	25
Skill Obsolescence	3	19	2	—	24
Creative Output	15	5	3	1	24
Labor Share of Income	10	4	9	—	23

Adoption Remove filter

A weak-model compensation pattern was observed: the lowest-baseline model (Gemini) shows a much larger D-A gain (+1.006) than the strongest model (Claude, +0.217).

Model-level comparison of D-A gain (difference between structured and unstructured conditions) across three models (Claude, GPT-4o, Gemini) on the evaluated outputs; reported gains for Gemini and Claude.

high positive Structured Intent as a Protocol-Like Communication Layer: Cr... D-A gain (improvement in goal-alignment score from structured prompting)

The strongest structured conditions reduce cross-language sigma from 0.470 to about 0.020.

Reported numeric comparison of sigma (variance) between unstructured baseline and strongest structured prompting conditions across evaluated outputs.

high positive Structured Intent as a Protocol-Like Communication Layer: Cr... cross-language sigma (standard deviation of scores across languages)

Structured prompting substantially reduces cross-language score variance relative to unstructured baselines.

Empirical comparison across 3,240 outputs evaluated by DeepSeek-V3, comparing structured vs. unstructured prompting across three languages.

high positive Structured Intent as a Protocol-Like Communication Layer: Cr... cross-language score variance (sigma)

Prior work showed that PPS (Prompt Protocol Specification), a 5W3H-based structured intent framework, improves goal alignment in Chinese and generalizes to English and Japanese.

Statement referring to prior work (not new experiments in this paper); no sample size or methods provided in this text excerpt.

high positive Structured Intent as a Protocol-Like Communication Layer: Cr... goal alignment (language generalization)

Large language model (LLM) use can improve observable output and short-term task performance.

Paper synthesizes empirical findings from human–AI interaction studies, learning-research experiments, and model-evaluation work indicating improved produced outputs and short-term task performance when humans use LLMs; no single pooled sample size or unified effect estimate is reported in the paper.

high positive Beyond the Steeper Curve: AI-Mediated Metacognitive Decoupli... observable output quality and short-term task performance

Proposition 2: An increase in the pace of technology creation (m(b) rising from m to m') generates a transitory increase in the skill premium (even if the increase is permanent, because new technologies eventually age).

Analytical result (proposition) proved in the paper's model appendix; intuition and special-case (γ=σ) illustrated in text.

high positive THE SKILL PREMIUM IN TIMES OF RAPID TECHNOLOGICAL CHANGE transitional behavior of skill premium following a change in m(b)

The college premium rose first among young workers and later among older workers; a model extension that assumes younger workers have a comparative advantage in new technologies generates age-specific increases that account for half of the observed age gaps.

Extension of the model with worker demographics; calibration using CPS data on computer use by worker age (showing young workers used computers more intensively initially) and simulation comparing model to observed age-specific wage premium changes.

high positive THE SKILL PREMIUM IN TIMES OF RAPID TECHNOLOGICAL CHANGE college premium by worker age (timing and magnitude of increase)

Slow diffusion, combined with the rapid pace of technology creation, accounts for 6.2 of the 8.7 log-point differential increase in the skill premium between high- and low-density regions over 1980–2005.

Model calibrated with estimated diffusion rates across regions from the text-based dataset; quantitative decomposition attributing portions of the regional differential to the mechanism.

high positive THE SKILL PREMIUM IN TIMES OF RAPID TECHNOLOGICAL CHANGE regional differential increase in skill premium (log points) over 1980–2005

The mechanism explains why the college premium is higher in dense cities and why its increase was mainly urban.

Model extension incorporating regional diffusion of technologies combined with estimates of diffusion rates across locations (using the Kalyani et al. dataset); comparison of model predictions to documented urban–rural wage premium patterns.

high positive THE SKILL PREMIUM IN TIMES OF RAPID TECHNOLOGICAL CHANGE college premium by city density

Total demand for college-educated workers increased by 100 log points since 1980; changes in the pace of technology creation account for one-third of that increase, with the remainder attributed to residual structural changes in production.

Model-based decomposition calibrated to data (demand and supply of college-educated workers since 1980); quantitative accounting exercise reported in the paper.

high positive THE SKILL PREMIUM IN TIMES OF RAPID TECHNOLOGICAL CHANGE demand for college-educated workers (log points since 1980)

When calibrated to the observed pace of technology creation, the model generates a 28 log-point (32 percent) increase in the college premium between 1980 and 2010, which then flattens and begins to revert.

Quantitative calibration of the model to novel text-based technology data (arrival and diffusion) and wage series (CPS); simulation results.

high positive THE SKILL PREMIUM IN TIMES OF RAPID TECHNOLOGICAL CHANGE college premium over 1980–2010

The data show a temporary increase in the pace of new technology creation beginning in the 1970s, accelerating in the 1980s, and tapering off in the 2000s.

Time series of identified new technologies from text-based measures (patent text/job posting linkage) covering 1976–2007 (as in Kalyani et al., 2025) used to measure arrival rates by cohort.

high positive THE SKILL PREMIUM IN TIMES OF RAPID TECHNOLOGICAL CHANGE rate of arrival of new technologies (pace of technology creation)

The pace of technology creation is a key driver of the skill premium: a rapid pace of technology creation leads to a sustained increase in the skill premium (because skilled workers learn to use new technologies faster).

Theoretical model developed in the paper in which new technologies arrive exogenously and skilled workers have a comparative advantage in learning new technologies; supported by calibration using novel text-based data (patent text and job postings) and CPS wage data.

high positive THE SKILL PREMIUM IN TIMES OF RAPID TECHNOLOGICAL CHANGE skill premium (college wage premium)

These household-level non-market productivity gains (ChatGPT making productive online tasks more efficient and freeing time for leisure) are economically large and likely constitute a substantial share of the overall economic impact of generative AI.

Combination of empirical IV estimates showing leisure increases and productivity-unchanged productive time, plus model-implied efficiency gains; authors' interpretation and welfare discussion in paper.

high positive https://arxiv.org/pdf/2603.03144 household non-market productivity and welfare (implied aggregate economic impact...

Mapping the empirical time-reallocation into a quantitative household time-allocation model implies generative AI approximately doubles the efficiency of productive online tasks for adopters; preferred calibration implies efficiency gains of 76%–176%.

Quantitative time-allocation model adapted from Aguiar et al. (2021); model uses empirical IV estimates for time reallocation and Engel curve elasticities estimated via IV (local precipitation shocks). Authors report implied efficiency gains of 76%–176% and state 'approximately doubles' efficiency.

high positive https://arxiv.org/pdf/2603.03144 efficiency (productivity) of productive digital tasks

Households predominantly utilize ChatGPT in the context of productive online activities (education, job search, informational research) rather than during leisure browsing, as inferred from the browsing context around ChatGPT use.

High-frequency analysis comparing 30-minute browsing intervals around ChatGPT visits to intervals of demographically similar non-users; LLM-based inference of website purpose; observed co-occurrence with productive-site categories.

high positive https://arxiv.org/pdf/2603.03144 context/purpose of ChatGPT use (productive vs leisure)

ChatGPT adoption increases the leisure share of browsing duration by about 30 percentage points.

IV long-difference estimates from Comscore browsing data with LLM-based site classification; authors report a ~30 percentage point increase in leisure share after adoption.

high positive https://arxiv.org/pdf/2603.03144 leisure share of total browsing duration

In long-difference IV estimates, ChatGPT adoption raises total leisure browsing time by roughly 150 log points.

IV long-difference estimates using pre-ChatGPT exposure as instrument; reported effect described as 'roughly 150 log points' increase in total leisure browsing time.

high positive https://arxiv.org/pdf/2603.03144 total leisure browsing time (log change)

A household's pre-ChatGPT ex-ante exposure (based on 2021 browsing composition) strongly predicts subsequent ChatGPT adoption: a 1 SD higher exposure predicts a 2.5 percentage point higher rate of having used ChatGPT by December 2024.

Constructed 'exposure' measure by aggregating site-level overlap with chatbot capabilities over household 2021 browsing; predictive regression (household-level) linking 1 SD change in exposure to 2.5pp higher adoption by Dec 2024 (statistic reported in paper).

high positive https://arxiv.org/pdf/2603.03144 probability / rate of ChatGPT adoption by Dec 2024

ChatGPT adoption among private households has been rapid following release, but adoption is far from uniform.

Descriptive adoption patterns measured from Comscore browsing data over time (pre- and post-Nov 30, 2022) on the household panel (2021–2024); time-series of observed ChatGPT site visits and adoption rates.

high positive https://arxiv.org/pdf/2603.03144 ChatGPT adoption rate over time

The experience-centered learning mechanism proactively recalls rewarded trajectories at inference time.

Specific technical/design claim about Synergy's learning mechanism; asserted in paper as a mechanism feature rather than demonstrated with quantified results in the provided text.

high positive Synergy: A Next-Generation General-Purpose Agent for Open Ag... agent learning behavior (recall of rewarded trajectories during inference)

Synergy grounds collaboration in session-native orchestration, repository-backed workspaces, and social communication; identity in typed memory, notes, agenda, skills, and persistent social relationships; and evolution in an experience-centered learning mechanism that proactively recalls rewarded trajectories at inference time.

Detailed design claims describing Synergy's mechanisms and intended grounding for collaboration, identity, and evolution; presented as architectural description, no experimental evaluation provided in the excerpt.

high positive Synergy: A Next-Generation General-Purpose Agent for Open Ag... architectural features supporting collaboration, identity, and learning (session...

We present Synergy, a general-purpose agent architecture and runtime harness for persistent, collaborative, and evolving agents on Open Agentic Web.

Paper's contribution statement indicating the authors propose an architecture named Synergy; this is a systems/design claim rather than an empirical result in the provided text.

high positive Synergy: A Next-Generation General-Purpose Agent for Open Ag... existence of an architecture (Synergy) designed for persistent, collaborative, e...

The next generation of agents must become Agentic Citizens, defined by three requirements: Agentic-Web-Native Collaboration, participation in open collaboration networks rather than only closed internal orchestration; Agent Identity and Personhood, continuity as a social entity rather than a resettable function call; and Lifelong Evolution, improvement across task performance, communication, and collaboration over time.

Normative/design prescription from the authors; conceptual argument for three requirements rather than empirical validation.

high positive Synergy: A Next-Generation General-Purpose Agent for Open Ag... agent design properties (collaboration, identity/personhood, lifelong evolution)

As the internet prepares to host billions of such entities, it is shifting toward what we call Open Agentic Web, a decentralized digital ecosystem in which agents from different users, organizations, and runtimes can discover one another, negotiate task boundaries, and delegate work across open technical and social surfaces at scale.

Conceptual claim / framing by the authors describing a projected/ongoing shift; no empirical measurement of 'billions' or of ecosystem properties provided in the excerpt.

high positive Synergy: A Next-Generation General-Purpose Agent for Open Ag... emergence of an Open Agentic Web (decentralized agent ecosystem, discovery, nego...

Embodied agents are spreading across smartphones, vehicles, and robots.

Author observation/claim in the paper's opening; no empirical study, metrics, or examples quantified in the provided text.

high positive Synergy: A Next-Generation General-Purpose Agent for Open Ag... deployment/penetration of embodied agents across device categories

Open-source frameworks such as OpenClaw are putting personal agents in the hands of millions.

Author assertion naming OpenClaw and a numeric adoption claim; no supporting empirical data or citation contained in the provided text.

high positive Synergy: A Next-Generation General-Purpose Agent for Open Ag... adoption of personal agents (number of users)

AI agents are rapidly expanding in both capability and population: they now write code, operate computers across platforms, manage cloud infrastructure, and make purchasing decisions.

Author assertion in paper's introduction / high-level observation; no empirical study, dataset, or experiment reported in the provided text.

high positive Synergy: A Next-Generation General-Purpose Agent for Open Ag... agents' capabilities and population growth (write code, operate computers, manag...

IMDPs lower ESG rating uncertainty.

The paper constructs measures of ESG rating uncertainty and finds IMDP participation reduces rating uncertainty.

high positive Intelligent Manufacturing Demonstration Projects Driving Cor... ESG rating uncertainty

IMDPs reduce greenwashing.

The paper constructs measures of greenwashing and reports that IMDP participation lowers those greenwashing measures.

high positive Intelligent Manufacturing Demonstration Projects Driving Cor... greenwashing

The positive effect of IMDP participation on ESG performance is stronger in capital-scarce industries.

Heterogeneity analysis by industry capital-scarcity reported in the paper indicating larger IMDP effects in capital-scarce industries.

high positive Intelligent Manufacturing Demonstration Projects Driving Cor... ESG ratings

The positive effect of IMDP participation on ESG performance is stronger for firms at the growth stage.

Heterogeneity analysis by firm life-cycle stage reported in the paper showing larger effects for growth-stage firms.

high positive Intelligent Manufacturing Demonstration Projects Driving Cor... ESG ratings

The positive effect of IMDP participation on ESG performance is stronger for firms under intense competitive pressure.

Heterogeneity analysis reported in the paper that splits the sample by measures of competitive pressure and finds larger effects for firms facing more intense competition.

high positive Intelligent Manufacturing Demonstration Projects Driving Cor... ESG ratings

The effect of IMDP participation on ESG performance operates through improved cost management, consistent with capability upgrading and resource reallocation toward sustainability-related activities.

Mechanism analyses reported in the paper linking IMDP participation to measures of cost management and interpreting this as capability upgrading/resource reallocation.

high positive Intelligent Manufacturing Demonstration Projects Driving Cor... cost management

The effect of IMDP participation on ESG performance operates through higher innovation efficiency.

Mechanism analyses reported in the paper (mediation/decomposition analyses linking IMDP participation to measures of innovation efficiency).

high positive Intelligent Manufacturing Demonstration Projects Driving Cor... innovation efficiency

IMDP participation increases ESG ratings by approximately 0.14 rating levels relative to comparable non-participating firms.

Quasi-natural experiment exploiting staggered rollout of IMDPs; propensity score matching combined with a multi-period difference-in-differences design using panel data on Chinese listed manufacturing firms from 2009 to 2022 (as reported in the paper).

high positive Intelligent Manufacturing Demonstration Projects Driving Cor... ESG ratings

Education and workforce development should shift focus from rote knowledge accumulation to cultivating skills in human-AI collaboration, creative problem-solving, and the design of novel economic domains.

Normative policy recommendation derived from the paper's framework and analysis of anticipated labor market changes (no empirical evaluation or trial data reported in the abstract).

high positive AI Civilization and the Transformation of Work educational focus / skill composition

Human-AI co-evolution will significantly increase individual productivity and open new frontiers of economic activity.

Projected outcome based on combined analysis of AI capabilities, historical patterns, and platform growth; the abstract does not report empirical measurement or sample sizes for this projection.

high positive AI Civilization and the Transformation of Work individual productivity and emergence of new economic activities

AI-driven productivity augmentation dramatically lowers the barriers to creating economic value, enabling the decentralized generation of employment.

Argument supported by paper's analysis of contemporary labor market dynamics and the growth of digital platforms; no quantified empirical estimates or sample sizes provided in the abstract.

high positive AI Civilization and the Transformation of Work barriers to entry for value creation / individual productivity

The transition to an AI-civilization will fundamentally restructure the mechanisms of employment creation from a centralized model (few organizations creating jobs for the many) to a decentralized ecosystem where individuals are empowered to generate their own employment opportunities.

Central thesis of the paper, motivated by theoretical argumentation and synthesis of contemporary data on labor markets and digital platforms (no empirical test or sample sizes specified in the abstract).

high positive AI Civilization and the Transformation of Work structure/mechanism of employment creation (centralized vs decentralized)

Historical precedents from past technological revolutions suggest that innovation tends to expand, rather than shrink, the scope of economic activity and employment in the long run.

Paper draws on analysis of economic history (qualitative historical analysis implied; no specific historical datasets or sample sizes provided in the abstract).

high positive AI Civilization and the Transformation of Work scope of economic activity and long-run employment levels

By formalizing the end-to-end transaction model together with its asset and incentive layers, EpochX reframes agentic AI as an organizational design problem focused on infrastructures where verifiable work leaves persistent, reusable artifacts and value flows support durable human-agent collaboration.

Theoretical framing and normative claim in the paper; no empirical evaluation demonstrating that this reframing yields measurable benefits.

high positive EpochX: Building the Infrastructure for an Emergent Agent Ci... organizational framing and potential for durable human-agent collaboration

Credits lock task bounties, allow budget delegation, settle rewards upon acceptance, and compensate creators when verified assets are reused.

Functional description of the credit mechanics and settlement rules within the proposed EpochX marketplace; presented as part of system design without empirical settlement or user-behavior data.

high positive EpochX: Building the Infrastructure for an Emergent Agent Ci... incentive flows, reward settlement, and compensation for asset reuse

EpochX introduces a native credit mechanism to make participation economically viable under real compute costs.

Proposed economic/incentive mechanism described in the paper; no empirical cost analysis, pricing model validation, or participant economic outcomes reported.

high positive EpochX: Building the Infrastructure for an Emergent Agent Ci... economic viability of participation under compute costs

These assets are stored with explicit dependency structure, enabling retrieval, composition, and cumulative improvement over time.

Design-level assertion about data model/asset graph in the EpochX proposal; no empirical results demonstrating retrieval/composition or measured cumulative improvement.

high positive EpochX: Building the Infrastructure for an Emergent Agent Ci... asset retrieval/composition and cumulative improvement

Each completed transaction can produce reusable ecosystem assets, including skills, workflows, execution traces, and distilled experience.

Architectural claim about artifacts produced per transaction in EpochX; described as a design goal rather than backed by empirical evidence or deployment data.

high positive EpochX: Building the Infrastructure for an Emergent Agent Ci... creation of reusable assets (skills, workflows, traces, distilled experience)

Claimed tasks can be decomposed into subtasks and executed through an explicit delivery workflow with verification and acceptance.

Design description of the workflow and verification/acceptance mechanisms in the proposed EpochX architecture; no empirical testing or metrics reported.

high positive EpochX: Building the Infrastructure for an Emergent Agent Ci... task execution workflow, verification and acceptance outcomes

EpochX treats humans and agents as peer participants who can post tasks or claim them.

Architectural/design specification in the paper describing participant roles and interactions; no empirical validation provided.

high positive EpochX: Building the Infrastructure for an Emergent Agent Ci... task posting and claiming behavior / task allocation model

We introduce EpochX, a credits-native marketplace infrastructure for human-agent production networks.

System/design description in the paper (architectural proposal); no deployment, user study, or evaluation results reported.

high positive EpochX: Building the Infrastructure for an Emergent Agent Ci... marketplace infrastructure availability / adoption potential

AI methods improve sustainability disclosure (disclosure to sustainability).

Stated in the review as an outcome of employing AI for ESG analytics and sustainability reporting; specific supporting studies or sizes are not provided in the excerpt.

high positive Artificial intelligence in sustainable finance and Environme... sustainability disclosure quality/transparency

« Prev 1 2 3 … 25 26 27 … 105 106 Next »