Evidence (6491 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	758	199	100	900	2007
Governance & Regulation	826	400	191	122	1563
Organizational Efficiency	777	193	124	84	1189
Technology Adoption Rate	635	233	124	97	1098
Research Productivity	422	128	57	336	954
Output Quality	476	179	59	47	761
Decision Quality	328	177	81	47	640
Firm Productivity	435	57	88	20	606
AI Safety & Ethics	218	277	65	33	599
Market Structure	180	170	123	24	502
Task Allocation	213	64	72	33	387
Skill Acquisition	170	61	61	17	309
Innovation Output	203	27	43	18	292
Employment Level	105	54	107	13	281
Fiscal & Macroeconomic	131	69	43	26	276
Consumer Welfare	117	63	42	11	233
Firm Revenue	153	48	26	3	230
Task Completion Time	173	31	8	12	225
Inequality Measures	44	122	49	6	221
Worker Satisfaction	89	65	22	12	188
Error Rate	69	92	10	2	173
Regulatory Compliance	77	69	14	5	165
Automation Exposure	56	56	26	13	154
Training Effectiveness	94	21	13	19	149
Wages & Compensation	77	36	25	6	144
Team Performance	86	17	27	10	141
Developer Productivity	95	17	14	6	133
Job Displacement	12	80	20	1	113
Hiring & Recruitment	52	7	8	3	70
Creative Output	31	18	8	3	61
Skill Obsolescence	5	46	6	1	58
Social Protection	27	16	8	2	53
Labor Share of Income	17	19	17	—	53
Worker Turnover	11	12	—	3	26
Industry	—	—	—	1	1

Human Ai Collab Remove filter

The planner can raise social welfare by focusing technological progress on making goods cheaper that are disproportionately consumed by relatively poorer agents, thereby raising their real income.

Extension of the baseline model to multiple goods showing distributional gains via composition of price changes (real income channel).

high positive NBER WORKING PAPER SERIES real income of poorer agents / social welfare

When capital and labor are gross complements, a planner concerned with workers' welfare would favor capital-augmenting innovations to raise wages.

Analytical result from the model analyzing factor-augmenting technological progress and complementarity between capital and labor.

high positive NBER WORKING PAPER SERIES wages

A planner with sufficient welfare weight on workers will impose positive robot taxes, with the tax rate increasing in the planner's concern for workers' welfare.

Application of the baseline model to robot taxation; analytical derivation of optimal robot tax under planner preferences.

high positive NBER WORKING PAPER SERIES optimal robot tax rate

As labor's economic value diminishes, steering progress focuses increasingly on enhancing human well-being (non-monetary aspects) rather than labor productivity.

Theoretical discussion and model results in the paper showing planner's shifting objective when labor is devalued.

high positive NBER WORKING PAPER SERIES focus of technological steering (monetary productivity vs non-monetary well-bein...

The welfare benefits of steering technology are greater the less efficient social safety nets are.

Analytical result from the paper's theoretical model comparing a planner who can/cannot perform transfers and evaluating steering as second-best when redistribution is costly.

high positive NBER WORKING PAPER SERIES welfare benefits of steering technological progress

These household-level non-market productivity gains (ChatGPT making productive online tasks more efficient and freeing time for leisure) are economically large and likely constitute a substantial share of the overall economic impact of generative AI.

Combination of empirical IV estimates showing leisure increases and productivity-unchanged productive time, plus model-implied efficiency gains; authors' interpretation and welfare discussion in paper.

high positive https://arxiv.org/pdf/2603.03144 household non-market productivity and welfare (implied aggregate economic impact...

Mapping the empirical time-reallocation into a quantitative household time-allocation model implies generative AI approximately doubles the efficiency of productive online tasks for adopters; preferred calibration implies efficiency gains of 76%–176%.

Quantitative time-allocation model adapted from Aguiar et al. (2021); model uses empirical IV estimates for time reallocation and Engel curve elasticities estimated via IV (local precipitation shocks). Authors report implied efficiency gains of 76%–176% and state 'approximately doubles' efficiency.

high positive https://arxiv.org/pdf/2603.03144 efficiency (productivity) of productive digital tasks

Households predominantly utilize ChatGPT in the context of productive online activities (education, job search, informational research) rather than during leisure browsing, as inferred from the browsing context around ChatGPT use.

High-frequency analysis comparing 30-minute browsing intervals around ChatGPT visits to intervals of demographically similar non-users; LLM-based inference of website purpose; observed co-occurrence with productive-site categories.

high positive https://arxiv.org/pdf/2603.03144 context/purpose of ChatGPT use (productive vs leisure)

ChatGPT adoption increases the leisure share of browsing duration by about 30 percentage points.

IV long-difference estimates from Comscore browsing data with LLM-based site classification; authors report a ~30 percentage point increase in leisure share after adoption.

high positive https://arxiv.org/pdf/2603.03144 leisure share of total browsing duration

In long-difference IV estimates, ChatGPT adoption raises total leisure browsing time by roughly 150 log points.

IV long-difference estimates using pre-ChatGPT exposure as instrument; reported effect described as 'roughly 150 log points' increase in total leisure browsing time.

high positive https://arxiv.org/pdf/2603.03144 total leisure browsing time (log change)

A household's pre-ChatGPT ex-ante exposure (based on 2021 browsing composition) strongly predicts subsequent ChatGPT adoption: a 1 SD higher exposure predicts a 2.5 percentage point higher rate of having used ChatGPT by December 2024.

Constructed 'exposure' measure by aggregating site-level overlap with chatbot capabilities over household 2021 browsing; predictive regression (household-level) linking 1 SD change in exposure to 2.5pp higher adoption by Dec 2024 (statistic reported in paper).

high positive https://arxiv.org/pdf/2603.03144 probability / rate of ChatGPT adoption by Dec 2024

ChatGPT adoption among private households has been rapid following release, but adoption is far from uniform.

Descriptive adoption patterns measured from Comscore browsing data over time (pre- and post-Nov 30, 2022) on the household panel (2021–2024); time-series of observed ChatGPT site visits and adoption rates.

high positive https://arxiv.org/pdf/2603.03144 ChatGPT adoption rate over time

Task-level analyses show that activities expanded in AI-enabled projects—particularly ideation and experimentation—are increasingly compatible with large language model capabilities, suggesting potential for future productivity gains as these technologies mature.

Task-level classification mapping tasks described in proposals to LLM-relevant capabilities using LLM-based classification; finding that tasks expanded in AI-enabled projects cluster on ideation and experimentation, which align with current LLM strengths.

high positive Artificial Intelligence in Science: Returns, Reallocation, a... frequency/expansion of specific task categories (ideation, experimentation) and ...

AI-enabled projects undertake a broader set of tasks.

Task-level analysis of proposal descriptions (task inventories) classifying tasks via keyword extraction and LLMs, showing AI-enabled proposals list a wider variety of activities than non-AI proposals.

high positive Artificial Intelligence in Science: Returns, Reallocation, a... breadth/variety of tasks undertaken in projects

AI-enabled projects involve larger teams.

Comparison of team structure in proposals (team size) between AI-enabled and non-AI projects using the same comprehensive proposal dataset and LLM-based classification of AI presence.

high positive Artificial Intelligence in Science: Returns, Reallocation, a... team size / team structure

AI-enabled projects reallocate resources toward human capital (i.e., shift budget allocations toward labor / human capital).

Analysis of detailed budget allocations in the proposal dataset, comparing projects identified as AI-enabled versus non-AI projects using keyword extraction and LLM classification to identify AI presence and role.

high positive Artificial Intelligence in Science: Returns, Reallocation, a... budget allocation share toward human capital (labor share)

In the short run, AI adoption is associated with modest improvements in scientific outcomes concentrated in the upper tail.

Observational analysis linking identified AI presence in a comprehensive dataset of research proposals (funded and unfunded) to subsequent publication outcomes; AI presence identified via keyword extraction combined with large language model (LLM) classification; publication outcomes measured after proposal submission.

high positive Artificial Intelligence in Science: Returns, Reallocation, a... subsequent publication outcomes (scientific outcomes)

The experience-centered learning mechanism proactively recalls rewarded trajectories at inference time.

Specific technical/design claim about Synergy's learning mechanism; asserted in paper as a mechanism feature rather than demonstrated with quantified results in the provided text.

high positive Synergy: A Next-Generation General-Purpose Agent for Open Ag... agent learning behavior (recall of rewarded trajectories during inference)

Synergy grounds collaboration in session-native orchestration, repository-backed workspaces, and social communication; identity in typed memory, notes, agenda, skills, and persistent social relationships; and evolution in an experience-centered learning mechanism that proactively recalls rewarded trajectories at inference time.

Detailed design claims describing Synergy's mechanisms and intended grounding for collaboration, identity, and evolution; presented as architectural description, no experimental evaluation provided in the excerpt.

high positive Synergy: A Next-Generation General-Purpose Agent for Open Ag... architectural features supporting collaboration, identity, and learning (session...

We present Synergy, a general-purpose agent architecture and runtime harness for persistent, collaborative, and evolving agents on Open Agentic Web.

Paper's contribution statement indicating the authors propose an architecture named Synergy; this is a systems/design claim rather than an empirical result in the provided text.

high positive Synergy: A Next-Generation General-Purpose Agent for Open Ag... existence of an architecture (Synergy) designed for persistent, collaborative, e...

The next generation of agents must become Agentic Citizens, defined by three requirements: Agentic-Web-Native Collaboration, participation in open collaboration networks rather than only closed internal orchestration; Agent Identity and Personhood, continuity as a social entity rather than a resettable function call; and Lifelong Evolution, improvement across task performance, communication, and collaboration over time.

Normative/design prescription from the authors; conceptual argument for three requirements rather than empirical validation.

high positive Synergy: A Next-Generation General-Purpose Agent for Open Ag... agent design properties (collaboration, identity/personhood, lifelong evolution)

As the internet prepares to host billions of such entities, it is shifting toward what we call Open Agentic Web, a decentralized digital ecosystem in which agents from different users, organizations, and runtimes can discover one another, negotiate task boundaries, and delegate work across open technical and social surfaces at scale.

Conceptual claim / framing by the authors describing a projected/ongoing shift; no empirical measurement of 'billions' or of ecosystem properties provided in the excerpt.

high positive Synergy: A Next-Generation General-Purpose Agent for Open Ag... emergence of an Open Agentic Web (decentralized agent ecosystem, discovery, nego...

Embodied agents are spreading across smartphones, vehicles, and robots.

Author observation/claim in the paper's opening; no empirical study, metrics, or examples quantified in the provided text.

high positive Synergy: A Next-Generation General-Purpose Agent for Open Ag... deployment/penetration of embodied agents across device categories

Open-source frameworks such as OpenClaw are putting personal agents in the hands of millions.

Author assertion naming OpenClaw and a numeric adoption claim; no supporting empirical data or citation contained in the provided text.

high positive Synergy: A Next-Generation General-Purpose Agent for Open Ag... adoption of personal agents (number of users)

AI agents are rapidly expanding in both capability and population: they now write code, operate computers across platforms, manage cloud infrastructure, and make purchasing decisions.

Author assertion in paper's introduction / high-level observation; no empirical study, dataset, or experiment reported in the provided text.

high positive Synergy: A Next-Generation General-Purpose Agent for Open Ag... agents' capabilities and population growth (write code, operate computers, manag...

Education and workforce development should shift focus from rote knowledge accumulation to cultivating skills in human-AI collaboration, creative problem-solving, and the design of novel economic domains.

Normative policy recommendation derived from the paper's framework and analysis of anticipated labor market changes (no empirical evaluation or trial data reported in the abstract).

high positive AI Civilization and the Transformation of Work educational focus / skill composition

Human-AI co-evolution will significantly increase individual productivity and open new frontiers of economic activity.

Projected outcome based on combined analysis of AI capabilities, historical patterns, and platform growth; the abstract does not report empirical measurement or sample sizes for this projection.

high positive AI Civilization and the Transformation of Work individual productivity and emergence of new economic activities

AI-driven productivity augmentation dramatically lowers the barriers to creating economic value, enabling the decentralized generation of employment.

Argument supported by paper's analysis of contemporary labor market dynamics and the growth of digital platforms; no quantified empirical estimates or sample sizes provided in the abstract.

high positive AI Civilization and the Transformation of Work barriers to entry for value creation / individual productivity

The transition to an AI-civilization will fundamentally restructure the mechanisms of employment creation from a centralized model (few organizations creating jobs for the many) to a decentralized ecosystem where individuals are empowered to generate their own employment opportunities.

Central thesis of the paper, motivated by theoretical argumentation and synthesis of contemporary data on labor markets and digital platforms (no empirical test or sample sizes specified in the abstract).

high positive AI Civilization and the Transformation of Work structure/mechanism of employment creation (centralized vs decentralized)

Historical precedents from past technological revolutions suggest that innovation tends to expand, rather than shrink, the scope of economic activity and employment in the long run.

Paper draws on analysis of economic history (qualitative historical analysis implied; no specific historical datasets or sample sizes provided in the abstract).

high positive AI Civilization and the Transformation of Work scope of economic activity and long-run employment levels

The paper studies principal-agent alignment using revealed preference techniques.

Stated methodological approach in the abstract; implies analytical use of revealed-preference methods for identification.

high positive A Revealed Preference Framework for AI Alignment methodological approach (use of revealed preference techniques to study alignmen...

The AI's alignment (similarity of human and AI preferences) can be generically identified in the field setting, where only AI choices are observed.

Analytical/theoretical identification result presented in the paper using revealed preference techniques (as stated in abstract); no empirical sample reported in the abstract.

high positive A Revealed Preference Framework for AI Alignment identifiability of AI alignment parameter from observed AI-only choices (field s...

The AI's alignment (similarity of human and AI preferences) can be generically identified in the laboratory setting, where both human and AI choices are observed.

Analytical/theoretical identification result presented in the paper using revealed preference techniques (as stated in abstract); no empirical sample reported in the abstract.

high positive A Revealed Preference Framework for AI Alignment identifiability of AI alignment parameter from observed human and AI choices (la...

The paper introduces the Luce Alignment Model, where the AI's choices are a mixture of two Luce rules, one reflecting the human's preferences and the other the AI's.

Paper proposes and defines a new theoretical model (model specification described in abstract).

high positive A Revealed Preference Framework for AI Alignment model specification of AI choice behavior (mixture of Luce rules)

Human decision makers increasingly delegate choices to AI agents.

Stated as motivation in the abstract; no empirical data or sample described in the provided text.

high positive A Revealed Preference Framework for AI Alignment frequency of delegation of choices to AI agents

By formalizing the end-to-end transaction model together with its asset and incentive layers, EpochX reframes agentic AI as an organizational design problem focused on infrastructures where verifiable work leaves persistent, reusable artifacts and value flows support durable human-agent collaboration.

Theoretical framing and normative claim in the paper; no empirical evaluation demonstrating that this reframing yields measurable benefits.

high positive EpochX: Building the Infrastructure for an Emergent Agent Ci... organizational framing and potential for durable human-agent collaboration

Credits lock task bounties, allow budget delegation, settle rewards upon acceptance, and compensate creators when verified assets are reused.

Functional description of the credit mechanics and settlement rules within the proposed EpochX marketplace; presented as part of system design without empirical settlement or user-behavior data.

high positive EpochX: Building the Infrastructure for an Emergent Agent Ci... incentive flows, reward settlement, and compensation for asset reuse

EpochX introduces a native credit mechanism to make participation economically viable under real compute costs.

Proposed economic/incentive mechanism described in the paper; no empirical cost analysis, pricing model validation, or participant economic outcomes reported.

high positive EpochX: Building the Infrastructure for an Emergent Agent Ci... economic viability of participation under compute costs

These assets are stored with explicit dependency structure, enabling retrieval, composition, and cumulative improvement over time.

Design-level assertion about data model/asset graph in the EpochX proposal; no empirical results demonstrating retrieval/composition or measured cumulative improvement.

high positive EpochX: Building the Infrastructure for an Emergent Agent Ci... asset retrieval/composition and cumulative improvement

Each completed transaction can produce reusable ecosystem assets, including skills, workflows, execution traces, and distilled experience.

Architectural claim about artifacts produced per transaction in EpochX; described as a design goal rather than backed by empirical evidence or deployment data.

high positive EpochX: Building the Infrastructure for an Emergent Agent Ci... creation of reusable assets (skills, workflows, traces, distilled experience)

Claimed tasks can be decomposed into subtasks and executed through an explicit delivery workflow with verification and acceptance.

Design description of the workflow and verification/acceptance mechanisms in the proposed EpochX architecture; no empirical testing or metrics reported.

high positive EpochX: Building the Infrastructure for an Emergent Agent Ci... task execution workflow, verification and acceptance outcomes

EpochX treats humans and agents as peer participants who can post tasks or claim them.

Architectural/design specification in the paper describing participant roles and interactions; no empirical validation provided.

high positive EpochX: Building the Infrastructure for an Emergent Agent Ci... task posting and claiming behavior / task allocation model

We introduce EpochX, a credits-native marketplace infrastructure for human-agent production networks.

System/design description in the paper (architectural proposal); no deployment, user study, or evaluation results reported.

high positive EpochX: Building the Infrastructure for an Emergent Agent Ci... marketplace infrastructure availability / adoption potential

Google has been pioneering machine learning usage across dozens of products.

Contextual statement in the abstract about the organization's activity; asserted without empirical detail in abstract.

high positive A Multi-agent AI System for Deep Learning Model Migration fr... extent of ML usage across Google products

The techniques and approaches described can be generalized for other framework migrations and general code transformation tasks.

Authors' stated expectation/generalization claim in the abstract; no empirical evidence or cross-framework experiments reported in the abstract.

high positive A Multi-agent AI System for Deep Learning Model Migration fr... generalizability to other framework migrations / code transformation tasks

The system creates a virtuous circle where effectively AI supports its own development workflow.

Conceptual claim supported by the system's design and reported improvements that enable iterative AI-assisted development; described qualitatively in the paper.

high positive A Multi-agent AI System for Deep Learning Model Migration fr... self-supporting/iterative improvement of AI-assisted development workflow

Our approach dramatically reduces the time (6.4x-8x speedup) for deep learning model migrations.

Quantitative speedup figure reported in the paper's abstract (6.4x-8x); likely based on measured migration times on demonstrated cases, though the abstract does not state sample size or exact experimental setup.

high positive A Multi-agent AI System for Deep Learning Model Migration fr... time required to perform deep learning model migrations

The system accelerates code migrations in a large hyperscaler environment on commercial real-world use-cases.

Reported demonstration and evaluation in a hyperscaler (commercial) environment using real-world cases as described in the paper; no detailed sample size given in abstract.

high positive A Multi-agent AI System for Deep Learning Model Migration fr... speed of code migrations in commercial/hyperscaler environment

We define quality metrics and AI-based judges that accelerate development when the code to evaluate has no tests and has to adhere to strict style and dependency requirements.

Design and implementation of quality metrics and AI-based judges described in the paper; claimed acceleration of development workflow (no numeric quantification in abstract).

high positive A Multi-agent AI System for Deep Learning Model Migration fr... development speed / time to develop when evaluating untested code under strict s...

We built an AI-based multi-agent system to support automatic migration of TensorFlow-based deep learning models into JAX-based ones.

System implementation and description in the paper; demonstration on real-world code migration tasks in a hyperscaler environment (qualitative description in abstract).

high positive A Multi-agent AI System for Deep Learning Model Migration fr... existence and functioning of an AI-based migration system

« Prev 1 2 3 … 80 81 82 … 129 130 Next »