The Commonplace
Home Dashboard Papers Evidence Syntheses Digests 🎲

Evidence (13661 claims)

Adoption
8339 claims
Productivity
7479 claims
Governance
6715 claims
Human-AI Collaboration
6267 claims
Org Design
4098 claims
Innovation
3987 claims
Labor Markets
3488 claims
Skills & Training
2888 claims
Inequality
2016 claims

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome Positive Negative Mixed Null Total
Other 740 192 95 871 1945
Governance & Regulation 796 388 185 119 1512
Organizational Efficiency 765 186 123 82 1166
Technology Adoption Rate 610 227 121 95 1061
Research Productivity 409 121 56 331 928
Output Quality 464 174 58 47 743
Decision Quality 318 173 75 42 615
Firm Productivity 432 55 88 20 601
AI Safety & Ethics 214 273 65 33 589
Market Structure 175 165 120 24 489
Task Allocation 206 64 70 31 376
Skill Acquisition 161 57 57 16 291
Innovation Output 201 27 41 18 288
Fiscal & Macroeconomic 130 69 43 26 275
Employment Level 104 50 105 13 274
Consumer Welfare 116 62 42 11 231
Firm Revenue 149 45 26 3 223
Inequality Measures 43 120 49 6 218
Task Completion Time 164 29 8 12 214
Worker Satisfaction 89 60 20 12 181
Error Rate 69 89 9 2 169
Regulatory Compliance 74 67 14 4 159
Training Effectiveness 91 19 13 19 144
Wages & Compensation 77 33 25 6 141
Team Performance 86 17 27 9 140
Automation Exposure 49 50 22 12 136
Developer Productivity 91 17 14 5 128
Job Displacement 12 80 19 1 112
Hiring & Recruitment 51 7 8 3 69
Creative Output 31 16 7 2 57
Skill Obsolescence 5 43 6 1 55
Social Protection 27 16 8 2 53
Labor Share of Income 17 17 17 51
Worker Turnover 11 12 3 26
Industry 1 1
We propose a per-task leverage ratio for human-agent collaboration: human work displaced by an agent, divided by the human time required to specify the task, resolve mid-run interrupts, and review the result.
Theoretical/conceptual proposal and formal definition provided in the paper; no empirical sample or experimental data reported.
high positive Leverage Laws: A Per-Task Framework for Human-Agent Collabor... human work displaced per unit human time (per-task leverage)
Grounding recommendations in validated research offers leaders a framework for navigating AI's labor implications responsibly.
Paper asserts that its synthesis and recommendations provide a practical framework for leaders; no empirical validation of the framework is reported in the abstract.
high positive AI Displacement Risk in the Labor Market: Evidence, Exposure... ability of leaders to navigate AI labor implications and mitigate harm
Evidence-based organizational responses (transparent workforce planning, skills investment, redesigned roles, adaptive governance, and long-term capability-building) can mitigate harm and prepare organizations for workplace transformation.
Paper proposes these organizational responses grounded in the synthesized empirical literature; this is a recommendation rather than an empirically tested intervention in the paper abstract.
high positive AI Displacement Risk in the Labor Market: Evidence, Exposure... organizational readiness and mitigation of AI-related harms
There is an absence of a comprehensive national strategy in Israel for AI in employment, and the paper calls for the development of a forward-looking regulatory framework that balances innovation with protection of fundamental rights (dignity, equality, privacy), transparency, human oversight, and fairness.
Normative policy recommendation based on the paper's regulatory analysis; not an empirical finding and no policy-design experiments are reported in the excerpt.
high positive Artificial Intelligence in Israel, Trends, Developments, and... existence of a comprehensive national AI-employment strategy and recommended pol...
The AI-driven transformation is accompanied by an increasing emphasis on reskilling and continuous learning, reflecting a shift from workforce replacement to reconfiguration of modes of employment.
Reported observation in the paper about workforce development trends; no quantitative measures of reskilling uptake or program counts are provided in the excerpt.
high positive Artificial Intelligence in Israel, Trends, Developments, and... emphasis and activity in reskilling and continuous learning related to AI adopti...
Israeli legal scholarship reflects broad interdisciplinary engagement with AI across labor law, intellectual property, privacy, constitutional law, and additional fields; the study advances theoretical models, including reconceptualizations of accountability, creativity, and the role of AI as a legal actor.
Literature review/academic survey and theoretical contributions reported in the paper; specific counts of publications or analytical methods not provided in the excerpt.
high positive Artificial Intelligence in Israel, Trends, Developments, and... scope and interdisciplinarity of Israeli legal scholarship on AI and the paper's...
Israel is a leading “AI Nation,” characterized by exceptionally high levels of technological integration across both the private and public sectors.
Statement in paper based on the author's characterisation of national-level technological integration; specific empirical measures or sample size not provided in the excerpt.
high positive Artificial Intelligence in Israel, Trends, Developments, and... level of technological integration of AI across private and public sectors
Our evolved prefetcher achieves a 1.76x geomean IPC speedup over no prefetching, 17% over its VA/AMPM Lite seed (1.59x) and 21% over SMS (1.55x).
Reported experimental result comparing geomean IPC across benchmark set; comparisons made to no prefetching and two specific prefetcher baselines. Benchmark details not included in abstract.
Our evolved branch predictor achieves a 1.100x geomean IPC speedup over Bimodal, 1.5% over its Hashed Perceptron seed (1.085x).
Reported experimental result comparing geomean IPC across benchmark set; compared to Bimodal and Hashed Perceptron seed as baselines. Benchmark details not given in abstract.
Our best evolved cache replacement design achieves a 1.062x geomean IPC speedup over LRU, 0.6% over Mockingjay (1.056x).
Reported experimental result comparing geomean IPC across benchmark set; exact benchmark count/split not provided in abstract. Comparison reported against LRU and Mockingjay baselines.
Across cache replacement, data prefetching, and branch prediction, Agentic Architect matches or exceeds state-of-the-art designs.
Experimental evaluation across three microarchitectural component domains (cache replacement, prefetching, branch prediction) reported in the paper with comparative performance results versus baselines.
We introduce Agentic Architect, an agentic AI framework for computer architecture design exploration and optimization that combines LLM-driven code evolution with cycle-accurate simulation.
Authors' description of the system and methodology in the paper (introduction and methods). No numeric sample size reported in the abstract; evidence is the implemented framework and accompanying descriptions; authors state it is open-source.
PD--RSAC maintained zero feeder-limit violations in the experiments.
Empirical reporting in the paper's experimental results on the simulator that PD--RSAC had zero feeder-limit violations while operating under the formulated constraints.
high positive Semi-Markov Reinforcement Learning for City-Scale EV Ride-Ha... number of feeder-limit violations
Experiments on a large-scale EV fleet simulator built from NYC taxi data show that PD--RSAC achieves the highest net profit, reaching $1.22M.
Empirical results reported from experiments run on a large-scale EV fleet simulator constructed from NYC taxi data; PD--RSAC reported net profit of $1.22M in these experiments (paper's experimental section).
Through targeted prompting inspired by these findings, we modify agents' negotiation behavior and improve win rates from 22.2% to 32.7%.
Intervention experiment reported in the paper where prompts were changed and resulting agent win rates were measured.
The gender gap in autonomy narrows as robot exposure increases.
EWCTS 2021 merged with IFR robot exposure at the country–industry level; weighted logit regressions with controls, country and industry fixed effects, and gender × robot-exposure interaction terms showing reduced gender differences in autonomy with higher robot exposure.
high positive Gendered Effects of Robotisation on Job Quality autonomy (job-quality dimension)
Robotisation is associated with lower physical risks for both genders.
EWCTS 2021 individual data combined with IFR-based country–industry robot exposure; estimated via weighted logit models with controls and country and industry fixed effects, including gender interaction terms to test heterogeneity.
high positive Gendered Effects of Robotisation on Job Quality physical risks (job-quality dimension)
For software engineers, GAI's (GHC's) productivity impacts and creation of new tasks appear to outweigh potential displacement effects from automation of some SWE tasks.
Interpretation based on observed associations: higher hiring probability (especially entry-level), increased non-programming skills in new hires, and no decline in coding skills in the LinkedIn/GitHub observational data.
high positive Firms' GitHub Copilot adoption and labor market outcomes for... net labor effects for software engineers (balance of productivity/task-creation ...
New hires at GHC-adopting firms exhibit around 5% more non-programming skills.
Analysis of LinkedIn skill listings for new hires linked to GitHub/GHC adoption status, comparing the prevalence/count of non-programming skills among new hires at adopters versus non-adopters.
high positive Firms' GitHub Copilot adoption and labor market outcomes for... quantity/prevalence of non-programming skills among new hires
The increase in hiring probability is driven by entry-level hires.
Subgroup/heterogeneity analysis within the LinkedIn/GitHub observational data showing the hiring increase concentrated among entry-level SWE hires.
high positive Firms' GitHub Copilot adoption and labor market outcomes for... hiring of entry-level software engineers
GHC adoption is associated with around a 3%–5% higher monthly probability of hiring SWEs.
Observational analysis using LinkedIn and GitHub data comparing firms that adopted GitHub Copilot (GHC) to firms that did not; association measured as change in firms' monthly probability of hiring software engineers.
high positive Firms' GitHub Copilot adoption and labor market outcomes for... monthly probability of hiring software engineers (SWEs)
In clinical utility evaluation across three abstraction tasks, semantic search reduced time-to-completion by 24 to 89% compared to clinician-performed chart review.
Clinical utility assessment compared chart abstraction efficiency across three tasks and reported percentage reductions in time-to-completion ranging from 24% to 89%.
Qwen3 embeddings with 300-token chunk size achieved 94.6% accuracy on a clinical question-answering benchmark.
Optimization experiment on a physician-authored clinical question-answering benchmark; best-performing configuration reported as qwen3 embeddings with 300-token chunks and 94.6% accuracy.
high positive Health System Scale Semantic Search Across Unstructured Clin... accuracy_on_clinical_question_answering_benchmark
The system delivers sub-second query latency: median 237 ms single-user, 451 ms at 20-user concurrency.
Full-scale performance characterization reported exact median latencies for single-user and 20-user concurrency.
Governance maturity is therefore not merely a constraint on AI adoption; it is a condition that shapes whether capability improvements translate into productive deployment.
Synthesis/conclusion drawn from the analytical model showing governance affects the mapping from capability to productive deployment.
high positive The Security Cost of Intelligence: AI Capability, Cyber Risk... translation of AI capability improvements into productive deployment
Governance investment that reduces breach-loss magnitude shrinks the paradox region itself.
Analytical model result showing how changes in governance (modeled as reductions in breach-loss magnitude) affect the parameter region where the deployment paradox occurs.
high positive The Security Cost of Intelligence: AI Capability, Cyber Risk... size of the 'paradox region' (parameter range where better AI reduces deployment...
Technological advancement alone is insufficient—maximizing AI's economic potential requires strategic investments in workforce capability development (e.g., widespread AI fluency programs and targeted cultivation of higher-order judgment skills).
Policy recommendation based on the article's synthesis of task-based models and empirical literature; the excerpt does not report specific interventions, trials, or sample sizes.
high positive AI as Augmentation: How Human Capital Shapes Technology's Im... effectiveness of workforce capability investments for realizing AI-driven produc...
The supply of AI-literate workers amplifies productivity gains.
Stated as a mechanism in the task-based model synthesis; described qualitatively in the article without specific empirical method or sample sizes in the excerpt.
high positive AI as Augmentation: How Human Capital Shapes Technology's Im... productivity gains from AI adoption
Aggregate productivity improvements from AI advancement depend critically on two forms of human capital: specialized AI expertise and complementary non-AI skills.
Claim is presented as a theoretical result drawn from 'task-based economic models' in the article; empirical corroboration is referenced generally but no specific datasets or sample sizes are reported in the excerpt.
high positive AI as Augmentation: How Human Capital Shapes Technology's Im... aggregate productivity improvements
Mounting empirical evidence indicates AI primarily functions as augmentation technology—amplifying human capabilities rather than replacing workers.
Article states it draws on 'mounting empirical evidence' and synthesizes recent theoretical and empirical findings; no specific studies, methods, or sample sizes are cited in the excerpt.
high positive AI as Augmentation: How Human Capital Shapes Technology's Im... degree of workforce displacement versus augmentation (replacement vs. amplificat...
Multi-agent reinforcement learning has emerged as a promising approach for the combined scheduling of production and transportation tasks in decentralized factories.
Literature-context claim in the paper's introduction summarizing prior research trends and motivations for applying multi-agent RL to integrated production-transport scheduling.
high positive An Analysis of the Coordination Gap between Joint and Modula... potential improvement in scheduling/operational efficiency
Modular training represents a viable alternative in environments where a single scheduling task dominates.
Empirical findings from the paper's experiments/sensitivity analysis indicating modularly trained agents perform comparably to joint training when one scheduling task (either production or transportation) is temporally dominant.
high positive An Analysis of the Coordination Gap between Joint and Modula... relative scheduling performance (modular vs joint training)
Joint training can produce superior performance compared to the best-performing combinations of dispatching rules and modular training.
Empirical evaluation reported in the study comparing joint-training multi-agent RL against modular training and dispatching-rule baselines across simulated job-shop scheduling environments with transportation resources (sensitivity analysis over resource scarcity and temporal dominance). Specific sample size / number of scenarios not stated in the abstract.
high positive An Analysis of the Coordination Gap between Joint and Modula... scheduling performance (e.g., makespan / throughput / overall schedule quality)
AI agents are now running real transactions, workflows, and sub-agent chains across organizational boundaries without continuous human supervision.
Statement in abstract describing observed industry trend; paper reports a structured survey of industry trends, emerging standards, and technical literature as its method for situating this observation.
high positive AI Identity: Standards, Gaps, and Research Directions for AI... deployment of autonomous agents to execute transactions/workflows across organiz...
Addressing AI in evaluative systems requires treating monitoring (AI detection) and loosened selectivity as complementary design instruments.
Policy implication derived from model results and constrained optimization of editorial policy in the post-transition regime; argued in the paper's conclusion.
high positive Buying the Right to Monitor:Editorial Design in AI-Assisted ... effectiveness of combined interventions (monitoring + loosened selectivity) on e...
The proposed approach reframes AI control from optimizing decisions to governing their admissibility, introducing a protocol-level abstraction that operates independently of model architecture or training methodology.
Conceptual argument and proposal in the paper asserting architecture-agnostic protocol abstraction. No empirical tests across architectures or training methods reported.
high positive Right-to-Act: A Pre-Execution Non-Compensatory Decision Prot... shift in control paradigm (from decision optimization to admissibility governanc...
Through a scenario-based case study, we demonstrate how identical AI outputs can lead to divergent outcomes when evaluated under a Right-to-Act protocol, preserving reversibility and preventing premature or irreversible actions.
Scenario-based case study (illustrative demonstration). The paper reports example scenarios rather than empirical experiments; no sample size or quantitative evaluation reported.
high positive Right-to-Act: A Pre-Execution Non-Compensatory Decision Prot... divergent outcomes from identical AI outputs under the protocol; preservation of...
Unlike compensatory systems, where high-confidence signals can override failed conditions, the proposed framework enforces strict structural constraints: if any required condition is unmet, execution is halted or deferred.
Conceptual distinction and protocol rule specification in the paper (formal description of non-compensatory enforcement). No empirical testing reported.
high positive Right-to-Act: A Pre-Execution Non-Compensatory Decision Prot... whether execution proceeds when required conditions are unmet (halt/defer behavi...
We introduce the Right-to-Act protocol, a deterministic, non-compensatory pre-execution decision layer that evaluates whether an AI-generated decision is permitted to be realized at all.
Proposed method / conceptual contribution and formal definition provided in the paper (formalization and protocol specification). No empirical validation or sample size reported.
high positive Right-to-Act: A Pre-Execution Non-Compensatory Decision Prot... eligibility/admissibility of AI-generated decisions prior to execution
Taken together, these insights provide theoretical clarity and practical guidance for responsible GenAI integration into creative work.
Authors' stated contribution and practical recommendations derived from the conceptual framework; no empirical evaluation of guidance effectiveness provided.
high positive Beyond the Creativity Paradox: A Theory-informed Framework f... theoretical clarity and practical guidance for responsible GenAI integration
The study reinterprets process-oriented creativity theories through structural parallels with GenAI.
Conceptual reanalysis and theoretical reinterpretation based on literature synthesis (paper's theoretical contribution).
high positive Beyond the Creativity Paradox: A Theory-informed Framework f... process-oriented creativity theory reinterpretation
The authors propose a role-based integration model that aligns GenAI capabilities with key creative functions: idea generation, synthesis, strategic framing, and facilitation.
Presentation of a novel conceptual model / framework in the paper (theoretical design); no empirical validation or measured outcomes reported.
high positive Beyond the Creativity Paradox: A Theory-informed Framework f... alignment of GenAI capabilities with creative functions (idea generation, synthe...
The paper repositions GenAI as a cognitive collaborator rather than merely a productivity tool.
Argumentative / conceptual claim supported by the proposed theoretical reframing and role-based model in the paper; no empirical testing reported.
high positive Beyond the Creativity Paradox: A Theory-informed Framework f... role of GenAI in organizational workflows (cognitive collaborator vs productivit...
There are structural parallels between GenAI architectures and human cognition—such as heuristic search, divergent thinking, and iterative refinement.
Conceptual mapping and theoretical comparison between GenAI architecture characteristics and cognitive/creativity constructs presented in the paper (literature synthesis / theoretical argument).
high positive Beyond the Creativity Paradox: A Theory-informed Framework f... structural parallels between GenAI architectures and human cognition (heuristic ...
The study revisits foundational creativity theories to develop a framework for integrating GenAI into creative workflows.
Paper describes a conceptual review and theoretical synthesis of foundational creativity theories leading to a proposed integration framework; methodological (theoretical / conceptual) contribution rather than empirical validation.
high positive Beyond the Creativity Paradox: A Theory-informed Framework f... framework for integrating GenAI into creative workflows
Generative Artificial Intelligence (GenAI) is reshaping organisational creativity by emulating cognitive processes traditionally associated with human innovation.
Paper's theoretical argument and literature-grounded conceptual claims (conceptual analysis / literature review); no empirical sample or quantitative data reported.
high positive Beyond the Creativity Paradox: A Theory-informed Framework f... organisational creativity
That compliance layer can improve oversight by making departures from law easier to detect.
Claim supported by the paper's analytical argumentation (no empirical evidence reported).
high positive AI Governance under Political Turnover: The Alignment Surfac... detectability of departures from law (oversight effectiveness)
For probabilistic AI to be incorporated into public administration it must be embedded in a compliance layer that makes decisions reviewable, repeatable, and legally defensible.
Stated as a normative/architectural claim in the paper; supported by conceptual argument rather than empirical testing.
high positive AI Governance under Political Turnover: The Alignment Surfac... requirements for legal/administrative incorporation of probabilistic AI
Governments are increasingly interested in using AI to make administrative decisions cheaper, more scalable, and more consistent.
Stated as background motivation in the paper (no empirical data or sample size reported).
high positive AI Governance under Political Turnover: The Alignment Surfac... government interest in AI adoption for administrative decisions (cost, scale, co...
There is an open opportunity to support collaborative construction where users and AI jointly develop an evolving knowledge representation.
Paper's stated research opportunity and motivation based on gaps identified in prior tools and systems (conceptual argument).
high positive MindTrellis: Co-Creating Knowledge Structures with AI throug... potential benefits of joint user-AI collaborative knowledge representation (prop...