Evidence (13827 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	749	195	97	889	1979
Governance & Regulation	815	391	188	121	1539
Organizational Efficiency	771	189	124	83	1177
Technology Adoption Rate	624	233	123	96	1084
Research Productivity	410	121	56	331	929
Output Quality	466	177	59	47	749
Decision Quality	320	174	75	42	618
Firm Productivity	435	55	88	20	604
AI Safety & Ethics	214	276	65	33	593
Market Structure	178	166	122	24	495
Task Allocation	206	64	70	31	376
Skill Acquisition	165	57	60	17	299
Innovation Output	201	27	41	18	288
Employment Level	105	51	107	13	278
Fiscal & Macroeconomic	131	69	43	26	276
Consumer Welfare	116	63	42	11	232
Firm Revenue	149	46	26	3	224
Inequality Measures	44	122	49	6	221
Task Completion Time	169	29	8	12	219
Worker Satisfaction	89	61	20	12	182
Error Rate	69	91	10	2	172
Regulatory Compliance	76	68	14	5	163
Training Effectiveness	92	19	13	19	145
Wages & Compensation	77	36	25	6	144
Automation Exposure	51	54	22	12	142
Team Performance	86	17	27	9	140
Developer Productivity	94	17	14	6	132
Job Displacement	12	80	20	1	113
Hiring & Recruitment	51	7	8	3	69
Skill Obsolescence	5	45	6	1	57
Creative Output	31	16	7	2	57
Social Protection	27	16	8	2	53
Labor Share of Income	17	17	17	—	51
Worker Turnover	11	12	—	3	26
Industry	—	—	—	1	1

AI use is concentrated in large firms and knowledge-intensive sectors, reaching 50%–60% (60%–70%, employment-weighted) among very large firms in Information, Professional Services, and Finance.

Stratified descriptive statistics by firm size and industry from the BTOS AI supplement (Nov 2025–Jan 2026); employment-weighted estimates reported; exact sample sizes by stratum not provided in excerpt.

high positive The Microstructure of AI Diffusion: Evidence from Firms, Bus... firm-level AI adoption by firm size and sector

Adoption is expected to reach 22% of firms within six months.

Survey question asking firms about expected near-term adoption (BTOS AI supplement, Nov 2025–Jan 2026), producing a stated expected adoption rate; sample size not given.

high positive The Microstructure of AI Diffusion: Evidence from Firms, Bus... expected firm-level AI adoption within six months

Employment-weighted adoption rate was 32% (i.e., 32% of employment is in firms using AI in at least one function).

Employment-weighted descriptive statistic from the BTOS AI supplement covering Nov 2025–Jan 2026 (survey-based weighting by employment; sample size not stated).

high positive The Microstructure of AI Diffusion: Evidence from Firms, Bus... employment-weighted firm AI adoption

During Nov 2025–Jan 2026, 18% of firms used AI in at least one function.

Descriptive statistics from the 2026 AI supplement to the U.S. Census Bureau’s Business Trends and Outlook Survey (BTOS), fielded Nov 2025–Jan 2026; nationally representative firm survey (sample size not stated in excerpt).

high positive The Microstructure of AI Diffusion: Evidence from Firms, Bus... firm-level AI adoption (use in at least one function)

Industrial robots influence global value chain length primarily through technological innovation.

Mechanism analysis in the paper linking robot adoption to technological innovation measures and then to GVC length, based on the IFR and 14-subsector panel data; exact innovation indicators and estimation details not provided in the abstract.

high positive Research on the impact of industrial robot application on th... global value chain length (mediated by technological innovation)

Industrial robots influence global value chain length primarily through human capital upgrading.

Mechanism analysis reported in the paper linking robot adoption to changes in human capital (upgrading) and then to changes in GVC length using the same IFR and panel data; specific tests/mediation approaches not detailed in the abstract.

high positive Research on the impact of industrial robot application on th... global value chain length (mediated by human capital upgrading)

Industrial robots promote participation in global production networks within capital-intensive industries (i.e., they increase global value chain length for capital-intensive sectors).

Subsample or heterogeneous-effects analysis across capital-intensive vs. labor-intensive sub-sectors using the panel of 14 Chinese manufacturing sub-sectors; results reported for capital-intensive industries as positive effect on GVC participation/length.

high positive Research on the impact of industrial robot application on th... participation in global production networks / global value chain length (capital...

The application of industrial robots significantly extends the length of global value chains in manufacturing.

Empirical analysis using IFR robot data and panel data on 14 manufacturing sub-sectors; significance reported in paper (panel regression results). Exact model specifications and significance levels not provided in the abstract.

high positive Research on the impact of industrial robot application on th... global value chain length

The results generalize existing optimality theorems for fairness-constrained classification and extend them to generalized fairness metrics and partial fairness regimes.

Mathematical generalization and extension of prior theorems to a broader class of fairness metrics and to settings with partial (not full) fairness constraints; proofs provided in the paper.

high positive Fairness vs Performance: Characterizing the Pareto Frontier ... generality/extent of optimality theorems (coverage of fairness metrics and parti...

This result complements existing optimality theorems from the literature which, for specific fairness constraints, posit lower-bound threshold rules only.

Comparative theoretical discussion and extension of prior optimality results (literature comparison plus proofs showing how their characterization extends prior lower-bound-only threshold results).

high positive Fairness vs Performance: Characterizing the Pareto Frontier ... relation to and extension of existing fairness-constrained optimality theorems

The Pareto frontier consists of deterministic, group-specific threshold rules applied to individuals' success probability.

Theoretical analysis framing decision making as a multi-objective optimization problem (decision-maker utility vs. group fairness) and deriving the set of Pareto-optimal decision rules for arbitrary utility functions, arbitrary population distributions, and a wide range of group fairness metrics (mathematical proofs/derivations).

high positive Fairness vs Performance: Characterizing the Pareto Frontier ... form of Pareto-optimal decision rules (deterministic, group-specific thresholds ...

Regulatory modernisation, secure national data infrastructure and targeted digital training are essential to enable sustainable innovation in valuation practice.

Policy and practitioner recommendations derived from interview data and thematic analysis; synthesis into prescriptive recommendations.

high positive Exploring barriers to valuation technology adoption in prope... enablers of sustainable VTech innovation

A majority seems optimistic about [AI's] overall impact.

Paper reports a majority-level positive attitude in surveys about AI's overall impact (no survey details or sample sizes provided in the excerpt).

high positive AI’s Economy and Its Political and Institutional Consequence... overall public optimism about AI

The framework and results are developed/applied to two instances: AI agent oversight (motivating setting) and marketplace operation (a parallel mechanism-design domain).

Paper includes two instantiated examples/applications illustrating the formal framework: one in AI agent oversight and one in marketplace operation (illustrative case studies within the theoretical paper).

high positive The Endogeneity of Miscalibration: Impossibility and Escape ... applicability of theoretical results to AI oversight and marketplace operation d...

A constructive escape exists: a step-function approval threshold achieves first-best screening for every strictly proper scoring rule, because the agent's binary inflate-or-not choice creates a type-space threshold regardless of the generator's curvature.

Constructive existence proof in the paper showing a step-function approval rule that attains first-best screening; analytical argument based on agent's binary inflate/not strategy.

high positive The Endogeneity of Miscalibration: Impossibility and Escape ... achievement of first-best screening / principal welfare under step-function appr...

The principal's optimal oversight necessarily uses a non-affine approval function to screen types.

Analytical result derived from the paper's formal principal-agent model and optimization of the principal's objective (theoretical proof).

high positive The Endogeneity of Miscalibration: Impossibility and Escape ... shape of the approval function used in optimal oversight (affine vs. non-affine)

The framework is illustrated with applications in income-based social protection programs and humanitarian demining in Colombia, where the tension between screening costs and allocation efficiency is operationally consequential.

Applied examples / case studies presented in the paper (applications to social protection and humanitarian demining contexts).

high positive The Limits of AI-Driven Allocation: Optimal Screening under ... operational consequences of screening cost vs allocation efficiency trade-off

Efficiency gains from screening grow as the aleatoric uncertainty in the population increases.

Empirical characterization and/or model-based analysis presented in the paper (claims based on theoretical comparative statics and illustrative empirical examples).

high positive The Limits of AI-Driven Allocation: Optimal Screening under ... efficiency gains from screening (improvement in allocation performance)

In a two-stage allocation framework where a screening stage observes true outcomes for a subset of units before a final allocation under a fixed coverage budget, the optimal strategy screens units at the margin of algorithmic allocation while directly targeting the highest-risk units.

Analytical result derived from the paper's two-stage allocation model (theoretical/mathematical analysis of optimal screening and allocation policy).

high positive The Limits of AI-Driven Allocation: Optimal Screening under ... allocation efficiency / optimality of screening and targeting strategy

Algorithmic targeting is typically cheaper and faster than traditional screening procedures that directly observe the latent vulnerability status through physical verification.

Comparative claim stated in paper introduction; presented as typical advantage of algorithmic targeting (background rationale).

high positive The Limits of AI-Driven Allocation: Optimal Screening under ... cost and speed of targeting procedures

The rise of machine learning has shifted targeted resource allocation in policy and humanitarian settings toward algorithmic targeting based on predicted risk scores.

Descriptive statement in paper introduction; references to the adoption of algorithmic targeting in policy/humanitarian contexts (motivation/background rather than new empirical data).

high positive The Limits of AI-Driven Allocation: Optimal Screening under ... use of algorithmic targeting (shift in allocation method)

The contribution of the paper is conceptual and programmatic, presenting a vocabulary, set of candidate primitives, and a research agenda for an agent-aware model of web analytics.

Statement of the paper's scope and contributions (position paper content description; no empirical sample).

high positive The Vanishing User: Web Analytics in an Agent-Dominated Inte... existence of a conceptual/programmatic research agenda for agent-aware web analy...

The paper proposes five candidate measurement primitives — task chain, actor class, interaction provenance, objective alignment, and signal authenticity — with explicit operational definitions.

Prescriptive contribution of the position paper (proposal based on conceptual synthesis; no empirical validation sample reported).

high positive The Vanishing User: Web Analytics in an Agent-Dominated Inte... proposed measurement primitives for agent-aware web analytics

The paper develops a working taxonomy of crawlers, traditional bots, AI agents, LLM-powered agents, and autonomous agents.

Descriptive contribution of the position paper (conceptual taxonomy developed from literature synthesis; no empirical validation sample reported).

high positive The Vanishing User: Web Analytics in an Agent-Dominated Inte... existence and structure of a proposed taxonomy distinguishing types of automated...

Autonomous AI agents are emerging as a further class of actors layered on top of automated traffic.

Qualitative synthesis/observation of recent developments in AI agents and system design; forward-looking claim in a conceptual paper (no empirical sample presented).

high positive The Vanishing User: Web Analytics in an Agent-Dominated Inte... emergence and presence of autonomous AI agents in web traffic

Crawlers and traditional bots already account for a substantial fraction of online interactions.

Assertion grounded in synthesis of existing bot-detection and web-traffic measurement literature cited in the paper (no new empirical sample reported in this position paper excerpt).

high positive The Vanishing User: Web Analytics in an Agent-Dominated Inte... share of online interactions generated by crawlers and traditional bots

Deep learning models (particularly LSTM and Transformer) exhibit stronger tail-risk control than traditional benchmark models.

Empirical risk analysis reported in the paper (tail-risk metrics/comparisons) indicating better tail-risk outcomes for LSTM and Transformer relative to linear and tree-based benchmarks.

high positive Optimizing stock market prediction and stock trading strateg... tail-risk control (tail-risk metrics)

Deep learning models (especially LSTM and Transformer) produce more stable WEI scores than traditional benchmarks.

Empirical comparison of WEI (the paper's proposed weighted evaluation index) across model types showing LSTM and Transformer with more stable (less variable/improved) WEI over the evaluation period.

high positive Optimizing stock market prediction and stock trading strateg... WEI stability /WEI scores

Deep learning models, particularly LSTM and Transformer, deliver superior prediction accuracy compared to traditional benchmarks (linear and tree-based models).

Empirical model comparison using rolling-window forecasts on A-share data (2013–2024) across the listed factors; accuracy metrics reported in the paper (e.g., RMSE or similar) show better performance for deep learning models, with LSTM and Transformer highlighted.

high positive Optimizing stock market prediction and stock trading strateg... prediction accuracy

The paper articulates a research agenda for how MASS should be modeled, evaluated and governed.

Stated in the abstract (position paper concludes with an articulated research agenda); evidence is the discussion and proposed agenda sections in the paper.

high positive Social Theory Should Be a Structural Prior for Agentic AI: A... proposed research directions for modeling, evaluation and governance of MASS

The importance of each structural prior is demonstrated through formal propositions.

Methodological claim in the abstract that the paper provides formal propositions demonstrating the role/importance of the four priors; evidence contained in proofs/propositions within the paper.

high positive Social Theory Should Be a Structural Prior for Agentic AI: A... formal/theoretical demonstration of the role of each structural prior

MASS is represented as a class of dynamical systems of information generation, local influence and interaction structure, formulated by four structural priors anchored in social theory: strategic heterogeneity, networked-constrained dependence, co-evolution and distributional instability.

Descriptive claim from the abstract about the formal structure of MASS; supported by the framework and definitions presented in the paper (formal/modeling content).

high positive Social Theory Should Be a Structural Prior for Agentic AI: A... formal representation of multi-agent dynamics via four structural priors

The paper formalizes a Multi-Agent Social Systems (MASS) framework for how agents interact and influence to generate system-level outcomes.

Direct methodological claim in abstract indicating the authors present a formal framework (MASS) in the paper; evidence consists of the formalization provided in the paper (propositions, definitions).

high positive Social Theory Should Be a Structural Prior for Agentic AI: A... formal modeling of agent interactions and system-level outcomes

Agentic AI systems must be modeled with social theory as a structural prior.

Normative / prescriptive claim from the paper's abstract (position paper arguing for this modeling choice; supported by the authors' theoretical arguments and formal framework in the paper).

high positive Social Theory Should Be a Structural Prior for Agentic AI: A... modeling approach for agentic AI systems (use of social-theory structural priors...

Emergent dynamics of individuals in a social group have been long studied by social scientists in human contexts.

Historical/contextual claim in the abstract; supported by reference to social-science literature (no sample size; general scholarly consensus).

high positive Social Theory Should Be a Structural Prior for Agentic AI: A... existence of a body of social-science research on emergent group dynamics

In multi-agent social settings, system behavior emerges not from individual agents alone, but from the multi-agent interactions over time.

Conceptual claim in the paper's abstract, supported by the paper's argumentation and references to social-science literature on emergent dynamics (formal development likely in main text).

high positive Social Theory Should Be a Structural Prior for Agentic AI: A... emergent system-level behavior resulting from agent interactions

Agentic AI systems are increasingly deployed not in isolation, but inside social environments populated by other agents and humans (e.g., social media platforms, multi-agent LLM pipelines, autonomous robotics fleets).

Statement from the paper's abstract and motivating examples; implied supporting citation/literature review in the paper (no empirical sample size reported in abstract).

high positive Social Theory Should Be a Structural Prior for Agentic AI: A... deployment prevalence of agentic AI inside social environments (multi-agent sett...

Evaluation indicates improved architectural consistency and deployability compared to general-purpose AI code generation workflows, suggesting that constraint-aware retrieval is essential for aligning AI-assisted service development with production software engineering practices.

Paper reports an evaluation comparing the proposed retrieval-augmented scaffolding approach to general-purpose AI code generation workflows and concludes improvements in architectural consistency and deployability; the excerpt does not provide evaluation design details, metrics, or sample size.

high positive Architectural Constraints Alignment in AI-assisted, Platform... architectural consistency and deployability

By combining template retrieval with structured interaction, the method embeds production-relevant considerations during service scaffolding.

Paper's description of the mechanism by which the proposed approach operates (template retrieval + structured interaction) to incorporate production concerns; presented as a design claim without detailed empirical quantification in the excerpt.

high positive Architectural Constraints Alignment in AI-assisted, Platform... embedding of production-relevant considerations in scaffolding

We propose a retrieval-augmented scaffolding approach that combines platform-based code generation with agentic clarification loops to expose and resolve architectural constraint ambiguities.

Methodological contribution described in the paper: a retrieval-augmented scaffolding method combining template retrieval and agentic clarification loops; this is a proposed approach rather than reported empirical proof in the provided text.

high positive Architectural Constraints Alignment in AI-assisted, Platform... exposure and resolution of architectural constraint ambiguities

AI-assisted development tools enable rapid prototyping of services.

Stated assertion in paper's introduction/abstract that AI-assisted tools speed up prototyping; no quantitative evaluation or sample size given in the provided text.

high positive Architectural Constraints Alignment in AI-assisted, Platform... rapid prototyping (development speed/productivity)

The C³ Framework provides implementable design patterns and testable propositions intended to help accounting leaders capture productivity gains from human + AI work while preserving accountability, consistency, and alignment with governance expectations in high-stakes reporting contexts.

Conclusions section stating intended practical utility; presented as intended outcomes of applying the proposed framework, not as empirically demonstrated results in this paper.

high positive Collaborative Intelligence in Accounting: A Human + AI Compl... organizational_efficiency

The paper proposes a role taxonomy that clarifies review responsibility, escalation thresholds, and evidence retention for human–AI collaboration in accounting.

Results section proposing a role taxonomy as part of the C³ Framework; presented as a design artifact derived from synthesis of research and guidance.

high positive Collaborative Intelligence in Accounting: A Human + AI Compl... task_allocation

The framework specifies five mandatory control points for high-judgment use cases: source grounding and traceability, independent verification and tie-out, contradiction testing, escalation and approval, and audit-trail logging.

Results section listing five control points as mandatory design elements for high-judgment accounting use cases; conceptual recommendation from synthesis.

high positive Collaborative Intelligence in Accounting: A Human + AI Compl... governance_and_regulation

The paper develops the C³ Framework—Complementarity, Controls, and Competencies—which maps accounting tasks by task structure and judgment/materiality to recommend collaboration modes.

Results section: conceptual framework developed by the authors based on synthesized literature and guidance; no reported empirical validation in the abstract.

high positive Collaborative Intelligence in Accounting: A Human + AI Compl... task_allocation

AI accelerates drafting, summarization, and pattern detection in accounting while professionals remain accountable for judgment, materiality, and defensibility in financial reporting and analysis.

Statement in paper summarizing literature and practitioner guidance (2023–2025); conceptual synthesis rather than new empirical data.

high positive Collaborative Intelligence in Accounting: A Human + AI Compl... task_completion_time

AI tools can serve as valuable aids in task splitting, provided there is human oversight to filter out irrelevant tasks.

Paper's conclusion synthesizing experimental results and participant feedback, recommending human-in-the-loop oversight when using AI for task-splitting.

high positive Splitting User Stories Into Tasks with AI -- A Foe or an All... effectiveness of AI-assisted task-splitting under human oversight

Participants favored a hybrid approach, combining AI tools with conventional methods to maintain high accuracy in planning.

Participant preferences and qualitative feedback reported from the controlled experiment indicating preference for combining AI assistance with human methods; sample size not provided.

high positive Splitting User Stories Into Tasks with AI -- A Foe or an All... participant preference for planning approach / planning accuracy

AI-assisted approaches can help ensure no important tasks are overlooked during task-splitting.

Reported finding from the experiment indicating AI assistance reduced omissions in task lists (paper statement based on experiment and participant observations); sample size not stated.

high positive Splitting User Stories Into Tasks with AI -- A Foe or an All... task omission rate / completeness of task lists

AI-assisted approaches can generate more granular task lists than traditional methods.

Experimental comparison reported in the paper showing AI-generated task lists were more granular (based on task lists produced during the controlled experiment); sample size not provided in summary.

high positive Splitting User Stories Into Tasks with AI -- A Foe or an All... task list granularity

« Prev 1 2 3 … 120 121 122 … 276 277 Next »