Evidence (13661 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	740	192	95	871	1945
Governance & Regulation	796	388	185	119	1512
Organizational Efficiency	765	186	123	82	1166
Technology Adoption Rate	610	227	121	95	1061
Research Productivity	409	121	56	331	928
Output Quality	464	174	58	47	743
Decision Quality	318	173	75	42	615
Firm Productivity	432	55	88	20	601
AI Safety & Ethics	214	273	65	33	589
Market Structure	175	165	120	24	489
Task Allocation	206	64	70	31	376
Skill Acquisition	161	57	57	16	291
Innovation Output	201	27	41	18	288
Fiscal & Macroeconomic	130	69	43	26	275
Employment Level	104	50	105	13	274
Consumer Welfare	116	62	42	11	231
Firm Revenue	149	45	26	3	223
Inequality Measures	43	120	49	6	218
Task Completion Time	164	29	8	12	214
Worker Satisfaction	89	60	20	12	181
Error Rate	69	89	9	2	169
Regulatory Compliance	74	67	14	4	159
Training Effectiveness	91	19	13	19	144
Wages & Compensation	77	33	25	6	141
Team Performance	86	17	27	9	140
Automation Exposure	49	50	22	12	136
Developer Productivity	91	17	14	5	128
Job Displacement	12	80	19	1	112
Hiring & Recruitment	51	7	8	3	69
Creative Output	31	16	7	2	57
Skill Obsolescence	5	43	6	1	55
Social Protection	27	16	8	2	53
Labor Share of Income	17	17	17	—	51
Worker Turnover	11	12	—	3	26
Industry	—	—	—	1	1

We propose a per-task leverage ratio for human-agent collaboration: human work displaced by an agent, divided by the human time required to specify the task, resolve mid-run interrupts, and review the result.

Theoretical/conceptual proposal and formal definition provided in the paper; no empirical sample or experimental data reported.

high positive Leverage Laws: A Per-Task Framework for Human-Agent Collabor... human work displaced per unit human time (per-task leverage)

Grounding recommendations in validated research offers leaders a framework for navigating AI's labor implications responsibly.

Paper asserts that its synthesis and recommendations provide a practical framework for leaders; no empirical validation of the framework is reported in the abstract.

high positive AI Displacement Risk in the Labor Market: Evidence, Exposure... ability of leaders to navigate AI labor implications and mitigate harm

Evidence-based organizational responses (transparent workforce planning, skills investment, redesigned roles, adaptive governance, and long-term capability-building) can mitigate harm and prepare organizations for workplace transformation.

Paper proposes these organizational responses grounded in the synthesized empirical literature; this is a recommendation rather than an empirically tested intervention in the paper abstract.

high positive AI Displacement Risk in the Labor Market: Evidence, Exposure... organizational readiness and mitigation of AI-related harms

There is an absence of a comprehensive national strategy in Israel for AI in employment, and the paper calls for the development of a forward-looking regulatory framework that balances innovation with protection of fundamental rights (dignity, equality, privacy), transparency, human oversight, and fairness.

Normative policy recommendation based on the paper's regulatory analysis; not an empirical finding and no policy-design experiments are reported in the excerpt.

high positive Artificial Intelligence in Israel, Trends, Developments, and... existence of a comprehensive national AI-employment strategy and recommended pol...

The AI-driven transformation is accompanied by an increasing emphasis on reskilling and continuous learning, reflecting a shift from workforce replacement to reconfiguration of modes of employment.

Reported observation in the paper about workforce development trends; no quantitative measures of reskilling uptake or program counts are provided in the excerpt.

high positive Artificial Intelligence in Israel, Trends, Developments, and... emphasis and activity in reskilling and continuous learning related to AI adopti...

Israeli legal scholarship reflects broad interdisciplinary engagement with AI across labor law, intellectual property, privacy, constitutional law, and additional fields; the study advances theoretical models, including reconceptualizations of accountability, creativity, and the role of AI as a legal actor.

Literature review/academic survey and theoretical contributions reported in the paper; specific counts of publications or analytical methods not provided in the excerpt.

high positive Artificial Intelligence in Israel, Trends, Developments, and... scope and interdisciplinarity of Israeli legal scholarship on AI and the paper's...

Israel is a leading “AI Nation,” characterized by exceptionally high levels of technological integration across both the private and public sectors.

Statement in paper based on the author's characterisation of national-level technological integration; specific empirical measures or sample size not provided in the excerpt.

high positive Artificial Intelligence in Israel, Trends, Developments, and... level of technological integration of AI across private and public sectors

Our evolved prefetcher achieves a 1.76x geomean IPC speedup over no prefetching, 17% over its VA/AMPM Lite seed (1.59x) and 21% over SMS (1.55x).

Reported experimental result comparing geomean IPC across benchmark set; comparisons made to no prefetching and two specific prefetcher baselines. Benchmark details not included in abstract.

high positive Agentic Architect: An Agentic AI Framework for Architecture ... task_completion_time

Our evolved branch predictor achieves a 1.100x geomean IPC speedup over Bimodal, 1.5% over its Hashed Perceptron seed (1.085x).

Reported experimental result comparing geomean IPC across benchmark set; compared to Bimodal and Hashed Perceptron seed as baselines. Benchmark details not given in abstract.

high positive Agentic Architect: An Agentic AI Framework for Architecture ... task_completion_time

Our best evolved cache replacement design achieves a 1.062x geomean IPC speedup over LRU, 0.6% over Mockingjay (1.056x).

Reported experimental result comparing geomean IPC across benchmark set; exact benchmark count/split not provided in abstract. Comparison reported against LRU and Mockingjay baselines.

high positive Agentic Architect: An Agentic AI Framework for Architecture ... task_completion_time

Across cache replacement, data prefetching, and branch prediction, Agentic Architect matches or exceeds state-of-the-art designs.

Experimental evaluation across three microarchitectural component domains (cache replacement, prefetching, branch prediction) reported in the paper with comparative performance results versus baselines.

high positive Agentic Architect: An Agentic AI Framework for Architecture ... task_completion_time

We introduce Agentic Architect, an agentic AI framework for computer architecture design exploration and optimization that combines LLM-driven code evolution with cycle-accurate simulation.

Authors' description of the system and methodology in the paper (introduction and methods). No numeric sample size reported in the abstract; evidence is the implemented framework and accompanying descriptions; authors state it is open-source.

high positive Agentic Architect: An Agentic AI Framework for Architecture ... innovation_output

PD--RSAC maintained zero feeder-limit violations in the experiments.

Empirical reporting in the paper's experimental results on the simulator that PD--RSAC had zero feeder-limit violations while operating under the formulated constraints.

high positive Semi-Markov Reinforcement Learning for City-Scale EV Ride-Ha... number of feeder-limit violations

Experiments on a large-scale EV fleet simulator built from NYC taxi data show that PD--RSAC achieves the highest net profit, reaching $1.22M.

Empirical results reported from experiments run on a large-scale EV fleet simulator constructed from NYC taxi data; PD--RSAC reported net profit of $1.22M in these experiments (paper's experimental section).

high positive Semi-Markov Reinforcement Learning for City-Scale EV Ride-Ha... net profit

Through targeted prompting inspired by these findings, we modify agents' negotiation behavior and improve win rates from 22.2% to 32.7%.

Intervention experiment reported in the paper where prompts were changed and resulting agent win rates were measured.

high positive Cooperate to Compete: Strategic Coordination in Multi-Agent ... agent win rate

The gender gap in autonomy narrows as robot exposure increases.

EWCTS 2021 merged with IFR robot exposure at the country–industry level; weighted logit regressions with controls, country and industry fixed effects, and gender × robot-exposure interaction terms showing reduced gender differences in autonomy with higher robot exposure.

high positive Gendered Effects of Robotisation on Job Quality autonomy (job-quality dimension)

Robotisation is associated with lower physical risks for both genders.

EWCTS 2021 individual data combined with IFR-based country–industry robot exposure; estimated via weighted logit models with controls and country and industry fixed effects, including gender interaction terms to test heterogeneity.

high positive Gendered Effects of Robotisation on Job Quality physical risks (job-quality dimension)

For software engineers, GAI's (GHC's) productivity impacts and creation of new tasks appear to outweigh potential displacement effects from automation of some SWE tasks.

Interpretation based on observed associations: higher hiring probability (especially entry-level), increased non-programming skills in new hires, and no decline in coding skills in the LinkedIn/GitHub observational data.

high positive Firms' GitHub Copilot adoption and labor market outcomes for... net labor effects for software engineers (balance of productivity/task-creation ...

New hires at GHC-adopting firms exhibit around 5% more non-programming skills.

Analysis of LinkedIn skill listings for new hires linked to GitHub/GHC adoption status, comparing the prevalence/count of non-programming skills among new hires at adopters versus non-adopters.

high positive Firms' GitHub Copilot adoption and labor market outcomes for... quantity/prevalence of non-programming skills among new hires

The increase in hiring probability is driven by entry-level hires.

Subgroup/heterogeneity analysis within the LinkedIn/GitHub observational data showing the hiring increase concentrated among entry-level SWE hires.

high positive Firms' GitHub Copilot adoption and labor market outcomes for... hiring of entry-level software engineers

GHC adoption is associated with around a 3%–5% higher monthly probability of hiring SWEs.

Observational analysis using LinkedIn and GitHub data comparing firms that adopted GitHub Copilot (GHC) to firms that did not; association measured as change in firms' monthly probability of hiring software engineers.

high positive Firms' GitHub Copilot adoption and labor market outcomes for... monthly probability of hiring software engineers (SWEs)

In clinical utility evaluation across three abstraction tasks, semantic search reduced time-to-completion by 24 to 89% compared to clinician-performed chart review.

Clinical utility assessment compared chart abstraction efficiency across three tasks and reported percentage reductions in time-to-completion ranging from 24% to 89%.

high positive Health System Scale Semantic Search Across Unstructured Clin... time-to-completion

Qwen3 embeddings with 300-token chunk size achieved 94.6% accuracy on a clinical question-answering benchmark.

Optimization experiment on a physician-authored clinical question-answering benchmark; best-performing configuration reported as qwen3 embeddings with 300-token chunks and 94.6% accuracy.

high positive Health System Scale Semantic Search Across Unstructured Clin... accuracy_on_clinical_question_answering_benchmark

The system delivers sub-second query latency: median 237 ms single-user, 451 ms at 20-user concurrency.

Full-scale performance characterization reported exact median latencies for single-user and 20-user concurrency.

high positive Health System Scale Semantic Search Across Unstructured Clin... query_latency

Governance maturity is therefore not merely a constraint on AI adoption; it is a condition that shapes whether capability improvements translate into productive deployment.

Synthesis/conclusion drawn from the analytical model showing governance affects the mapping from capability to productive deployment.

high positive The Security Cost of Intelligence: AI Capability, Cyber Risk... translation of AI capability improvements into productive deployment

Governance investment that reduces breach-loss magnitude shrinks the paradox region itself.

Analytical model result showing how changes in governance (modeled as reductions in breach-loss magnitude) affect the parameter region where the deployment paradox occurs.

high positive The Security Cost of Intelligence: AI Capability, Cyber Risk... size of the 'paradox region' (parameter range where better AI reduces deployment...

Technological advancement alone is insufficient—maximizing AI's economic potential requires strategic investments in workforce capability development (e.g., widespread AI fluency programs and targeted cultivation of higher-order judgment skills).

Policy recommendation based on the article's synthesis of task-based models and empirical literature; the excerpt does not report specific interventions, trials, or sample sizes.

high positive AI as Augmentation: How Human Capital Shapes Technology's Im... effectiveness of workforce capability investments for realizing AI-driven produc...

The supply of AI-literate workers amplifies productivity gains.

Stated as a mechanism in the task-based model synthesis; described qualitatively in the article without specific empirical method or sample sizes in the excerpt.

high positive AI as Augmentation: How Human Capital Shapes Technology's Im... productivity gains from AI adoption

Aggregate productivity improvements from AI advancement depend critically on two forms of human capital: specialized AI expertise and complementary non-AI skills.

Claim is presented as a theoretical result drawn from 'task-based economic models' in the article; empirical corroboration is referenced generally but no specific datasets or sample sizes are reported in the excerpt.

high positive AI as Augmentation: How Human Capital Shapes Technology's Im... aggregate productivity improvements

Mounting empirical evidence indicates AI primarily functions as augmentation technology—amplifying human capabilities rather than replacing workers.

Article states it draws on 'mounting empirical evidence' and synthesizes recent theoretical and empirical findings; no specific studies, methods, or sample sizes are cited in the excerpt.

high positive AI as Augmentation: How Human Capital Shapes Technology's Im... degree of workforce displacement versus augmentation (replacement vs. amplificat...

Multi-agent reinforcement learning has emerged as a promising approach for the combined scheduling of production and transportation tasks in decentralized factories.

Literature-context claim in the paper's introduction summarizing prior research trends and motivations for applying multi-agent RL to integrated production-transport scheduling.

high positive An Analysis of the Coordination Gap between Joint and Modula... potential improvement in scheduling/operational efficiency

Modular training represents a viable alternative in environments where a single scheduling task dominates.

Empirical findings from the paper's experiments/sensitivity analysis indicating modularly trained agents perform comparably to joint training when one scheduling task (either production or transportation) is temporally dominant.

high positive An Analysis of the Coordination Gap between Joint and Modula... relative scheduling performance (modular vs joint training)

Joint training can produce superior performance compared to the best-performing combinations of dispatching rules and modular training.

Empirical evaluation reported in the study comparing joint-training multi-agent RL against modular training and dispatching-rule baselines across simulated job-shop scheduling environments with transportation resources (sensitivity analysis over resource scarcity and temporal dominance). Specific sample size / number of scenarios not stated in the abstract.

high positive An Analysis of the Coordination Gap between Joint and Modula... scheduling performance (e.g., makespan / throughput / overall schedule quality)

AI agents are now running real transactions, workflows, and sub-agent chains across organizational boundaries without continuous human supervision.

Statement in abstract describing observed industry trend; paper reports a structured survey of industry trends, emerging standards, and technical literature as its method for situating this observation.

high positive AI Identity: Standards, Gaps, and Research Directions for AI... deployment of autonomous agents to execute transactions/workflows across organiz...

Addressing AI in evaluative systems requires treating monitoring (AI detection) and loosened selectivity as complementary design instruments.

Policy implication derived from model results and constrained optimization of editorial policy in the post-transition regime; argued in the paper's conclusion.

high positive Buying the Right to Monitor:Editorial Design in AI-Assisted ... effectiveness of combined interventions (monitoring + loosened selectivity) on e...

The proposed approach reframes AI control from optimizing decisions to governing their admissibility, introducing a protocol-level abstraction that operates independently of model architecture or training methodology.

Conceptual argument and proposal in the paper asserting architecture-agnostic protocol abstraction. No empirical tests across architectures or training methods reported.

high positive Right-to-Act: A Pre-Execution Non-Compensatory Decision Prot... shift in control paradigm (from decision optimization to admissibility governanc...

Through a scenario-based case study, we demonstrate how identical AI outputs can lead to divergent outcomes when evaluated under a Right-to-Act protocol, preserving reversibility and preventing premature or irreversible actions.

Scenario-based case study (illustrative demonstration). The paper reports example scenarios rather than empirical experiments; no sample size or quantitative evaluation reported.

high positive Right-to-Act: A Pre-Execution Non-Compensatory Decision Prot... divergent outcomes from identical AI outputs under the protocol; preservation of...

Unlike compensatory systems, where high-confidence signals can override failed conditions, the proposed framework enforces strict structural constraints: if any required condition is unmet, execution is halted or deferred.

Conceptual distinction and protocol rule specification in the paper (formal description of non-compensatory enforcement). No empirical testing reported.

high positive Right-to-Act: A Pre-Execution Non-Compensatory Decision Prot... whether execution proceeds when required conditions are unmet (halt/defer behavi...

We introduce the Right-to-Act protocol, a deterministic, non-compensatory pre-execution decision layer that evaluates whether an AI-generated decision is permitted to be realized at all.

Proposed method / conceptual contribution and formal definition provided in the paper (formalization and protocol specification). No empirical validation or sample size reported.

high positive Right-to-Act: A Pre-Execution Non-Compensatory Decision Prot... eligibility/admissibility of AI-generated decisions prior to execution

Taken together, these insights provide theoretical clarity and practical guidance for responsible GenAI integration into creative work.

Authors' stated contribution and practical recommendations derived from the conceptual framework; no empirical evaluation of guidance effectiveness provided.

high positive Beyond the Creativity Paradox: A Theory-informed Framework f... theoretical clarity and practical guidance for responsible GenAI integration

The study reinterprets process-oriented creativity theories through structural parallels with GenAI.

Conceptual reanalysis and theoretical reinterpretation based on literature synthesis (paper's theoretical contribution).

high positive Beyond the Creativity Paradox: A Theory-informed Framework f... process-oriented creativity theory reinterpretation

The authors propose a role-based integration model that aligns GenAI capabilities with key creative functions: idea generation, synthesis, strategic framing, and facilitation.

Presentation of a novel conceptual model / framework in the paper (theoretical design); no empirical validation or measured outcomes reported.

high positive Beyond the Creativity Paradox: A Theory-informed Framework f... alignment of GenAI capabilities with creative functions (idea generation, synthe...

The paper repositions GenAI as a cognitive collaborator rather than merely a productivity tool.

Argumentative / conceptual claim supported by the proposed theoretical reframing and role-based model in the paper; no empirical testing reported.

high positive Beyond the Creativity Paradox: A Theory-informed Framework f... role of GenAI in organizational workflows (cognitive collaborator vs productivit...

There are structural parallels between GenAI architectures and human cognition—such as heuristic search, divergent thinking, and iterative refinement.

Conceptual mapping and theoretical comparison between GenAI architecture characteristics and cognitive/creativity constructs presented in the paper (literature synthesis / theoretical argument).

high positive Beyond the Creativity Paradox: A Theory-informed Framework f... structural parallels between GenAI architectures and human cognition (heuristic ...

The study revisits foundational creativity theories to develop a framework for integrating GenAI into creative workflows.

Paper describes a conceptual review and theoretical synthesis of foundational creativity theories leading to a proposed integration framework; methodological (theoretical / conceptual) contribution rather than empirical validation.

high positive Beyond the Creativity Paradox: A Theory-informed Framework f... framework for integrating GenAI into creative workflows

Generative Artificial Intelligence (GenAI) is reshaping organisational creativity by emulating cognitive processes traditionally associated with human innovation.

Paper's theoretical argument and literature-grounded conceptual claims (conceptual analysis / literature review); no empirical sample or quantitative data reported.

high positive Beyond the Creativity Paradox: A Theory-informed Framework f... organisational creativity

That compliance layer can improve oversight by making departures from law easier to detect.

Claim supported by the paper's analytical argumentation (no empirical evidence reported).

high positive AI Governance under Political Turnover: The Alignment Surfac... detectability of departures from law (oversight effectiveness)

For probabilistic AI to be incorporated into public administration it must be embedded in a compliance layer that makes decisions reviewable, repeatable, and legally defensible.

Stated as a normative/architectural claim in the paper; supported by conceptual argument rather than empirical testing.

high positive AI Governance under Political Turnover: The Alignment Surfac... requirements for legal/administrative incorporation of probabilistic AI

Governments are increasingly interested in using AI to make administrative decisions cheaper, more scalable, and more consistent.

Stated as background motivation in the paper (no empirical data or sample size reported).

high positive AI Governance under Political Turnover: The Alignment Surfac... government interest in AI adoption for administrative decisions (cost, scale, co...

There is an open opportunity to support collaborative construction where users and AI jointly develop an evolving knowledge representation.

Paper's stated research opportunity and motivation based on gaps identified in prior tools and systems (conceptual argument).

high positive MindTrellis: Co-Creating Knowledge Structures with AI throug... potential benefits of joint user-AI collaborative knowledge representation (prop...

« Prev 1 2 3 … 128 129 130 … 273 274 Next »