The Commonplace
Home Dashboard Papers Evidence Digests 🎲

Evidence (2340 claims)

Adoption
5267 claims
Productivity
4560 claims
Governance
4137 claims
Human-AI Collaboration
3103 claims
Labor Markets
2506 claims
Innovation
2354 claims
Org Design
2340 claims
Skills & Training
1945 claims
Inequality
1322 claims

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome Positive Negative Mixed Null Total
Other 378 106 59 455 1007
Governance & Regulation 379 176 116 58 739
Research Productivity 240 96 34 294 668
Organizational Efficiency 370 82 63 35 553
Technology Adoption Rate 296 118 66 29 513
Firm Productivity 277 34 68 10 394
AI Safety & Ethics 117 177 44 24 364
Output Quality 244 61 23 26 354
Market Structure 107 123 85 14 334
Decision Quality 168 74 37 19 301
Fiscal & Macroeconomic 75 52 32 21 187
Employment Level 70 32 74 8 186
Skill Acquisition 89 32 39 9 169
Firm Revenue 96 34 22 152
Innovation Output 106 12 21 11 151
Consumer Welfare 70 30 37 7 144
Regulatory Compliance 52 61 13 3 129
Inequality Measures 24 68 31 4 127
Task Allocation 75 11 29 6 121
Training Effectiveness 55 12 12 16 96
Error Rate 42 48 6 96
Worker Satisfaction 45 32 11 6 94
Task Completion Time 78 5 4 2 89
Wages & Compensation 46 13 19 5 83
Team Performance 44 9 15 7 76
Hiring & Recruitment 39 4 6 3 52
Automation Exposure 18 17 9 5 50
Job Displacement 5 31 12 48
Social Protection 21 10 6 2 39
Developer Productivity 29 3 3 1 36
Worker Turnover 10 12 3 25
Skill Obsolescence 3 19 2 24
Creative Output 15 5 3 1 24
Labor Share of Income 10 4 9 23
Clear
Org Design Remove filter
Build shared compute infrastructures tailored to medical workloads and validation needs.
Workshop recommendation from infrastructure-themed sessions and consensus outcomes (NSF workshop, Sept 26–27, 2024).
medium positive Report for NSF Workshop on Algorithm-Hardware Co-design for ... existence and utilization of shared compute infrastructure for medical R&D (comp...
Sustain investment in shared, standardized data infrastructures (datasets, ontologies, benchmarks) to support medical algorithm–hardware co-design.
Workshop infrastructure call presented during breakout sessions and final recommendations at the NSF workshop (Sept 26–27, 2024).
medium positive Report for NSF Workshop on Algorithm-Hardware Co-design for ... availability and use of standardized medical datasets/ontologies/benchmarks
Principal recommendation: shift from isolated algorithm or hardware efforts to integrated algorithm–hardware–workflow co-design for medical contexts.
Stated workshop recommendation derived from panels and cross-disciplinary consensus at the NSF workshop (Sept 26–27, 2024).
medium positive Report for NSF Workshop on Algorithm-Hardware Co-design for ... alignment and integration of R&D efforts (degree of co-design adoption in projec...
Sustained public investment and new validation, governance, and translation ecosystems are needed to de-risk commercialization and accelerate safe, accountable clinical adoption.
Workshop principal recommendation based on qualitative synthesis of expert judgment from participants and breakout outcomes (NSF workshop, Sept 26–27, 2024).
medium positive Report for NSF Workshop on Algorithm-Hardware Co-design for ... commercialization risk level and speed/rate of clinical adoption
Enabling next-generation medical technologies requires a fundamental reorientation toward algorithm–hardware co-design that is clinic-aware, validated continuously, and backed by shared data and compute infrastructures.
Consensus recommendation from a two-day NSF workshop (Sept 26–27, 2024) in Pittsburgh convening interdisciplinary participants (academic researchers in algorithms and hardware, clinicians, industry leaders). Methods: expert panels, thematic breakout sessions, cross-disciplinary discussions, consensus-building. Documentation at https://sites.google.com/view/nsfworkshop.
medium positive Report for NSF Workshop on Algorithm-Hardware Co-design for ... successful development and clinical adoption of next-generation medical technolo...
Automation of routine SE tasks suggests measurable productivity gains at team and firm levels, but quantification requires causal, outcome-based studies (e.g., throughput, defect rates, time-to-market).
Interpretation of literature review findings and survey-reported perceived productivity gains; no causal empirical estimates provided in the paper.
medium positive Artificial Intelligence as a Catalyst for Innovation in Soft... potential productivity metrics (throughput, defect rates, time-to-market) — not ...
Empirical survey evidence shows generally positive perceptions of AI tools among software engineering professionals and growing adoption.
Cross-sectional survey of software engineering professionals asking about current tool usage and perceived benefits (productivity, quality, speed); absolute respondent count and sampling frame not provided in the summary.
medium positive Artificial Intelligence as a Catalyst for Innovation in Soft... self-reported perception of AI tools and self-reported adoption rate
ML enables predictive features in software engineering: effort estimation, defect prediction, work prioritization, and risk forecasting that support Agile planning and continuous delivery.
Literature review of ML-for-SE research and practitioner survey reporting use or expectations of predictive features; specific model performance metrics or dataset sizes not reported in the summary.
medium positive Artificial Intelligence as a Catalyst for Innovation in Soft... availability/use of predictive outputs (e.g., estimated effort, defect risk scor...
NLP techniques improve requirements management and team collaboration by extracting intent from natural-language artifacts (tickets, specs, PRs) and reducing miscommunication.
Synthesis of prior studies in the literature review and survey responses indicating perceived improvement in requirements handling and communication; survey sample size not reported.
medium positive Artificial Intelligence as a Catalyst for Innovation in Soft... perceived reduction in miscommunication / improved clarity of requirements
Greater ROI may come from investing in better feedback models (how to use feedback) than solely collecting richer feedback sources.
Empirical finding that feedback model choice often produced larger retrieval-quality improvements than changing the feedback source across the evaluated tasks and methods.
medium positive A Systematic Study of Pseudo-Relevance Feedback with LLMs Return on investment (performance improvement per resource invested in model vs....
The study's results clarify which elements of the PRF design space are most important to prioritize in practice (i.e., prioritize feedback-model improvements over source collection in many low-resource settings).
Comparative performance gains observed in controlled experiments showing larger effect sizes from varying feedback model than from varying source, combined with cost analyses.
medium positive A Systematic Study of Pseudo-Relevance Feedback with LLMs Relative impact on retrieval performance and cost-effectiveness
Across 13 low-resource BEIR tasks and five LLM PRF methods, the choice of feedback model (how feedback is applied) critically affects retrieval effectiveness.
Empirical results reported over 13 BEIR tasks using five LLM-based PRF methods, with systematic variation of feedback model.
medium positive A Systematic Study of Pseudo-Relevance Feedback with LLMs Retrieval effectiveness (standard BEIR metrics)
Purely LLM-generated feedback yields the best cost-effectiveness overall (best performance per unit LLM invocation cost) for low-resource retrieval tasks.
Cost-effectiveness analysis in experiments across 13 BEIR tasks and five PRF methods that accounted for LLM invocation cost versus retrieval gains.
medium positive A Systematic Study of Pseudo-Relevance Feedback with LLMs Cost-effectiveness (retrieval gains per LLM invocation cost)
Feedback model choice can have a larger impact on retrieval quality than feedback source.
Controlled experiments comparing five LLM-based PRF methods across 13 low-resource BEIR tasks, measuring retrieval effectiveness with standard BEIR metrics.
medium positive A Systematic Study of Pseudo-Relevance Feedback with LLMs Retrieval effectiveness (standard BEIR retrieval metrics)
Demand will grow for hybrid specialists (quantum algorithm engineers, HPC systems integrators, middleware developers) and for domain scientists fluent in hybrid workflows, shifting skill premiums toward interdisciplinary expertise.
Labor-market inference from technology adoption and the skills required by proposed QCSC systems; qualitative only, no labor-market survey data provided.
medium positive Reference Architecture of a Quantum-Centric Supercomputer demand for specific skills, wage premiums for interdisciplinary expertise
Public investment and shared facilities can mitigate entry barriers and diffuse benefits to smaller firms and research groups.
Policy analysis and precedent from shared scientific infrastructure models; no case-study data specific to QCSC presented.
medium positive Reference Architecture of a Quantum-Centric Supercomputer access to QCSC resources by small firms/research groups, reduction in entry barr...
Tightly integrating QPUs, GPUs, and CPUs across hardware, middleware, and application layers (QCSC vision) will enable high-throughput, low-latency hybrid workflows.
Architectural design reasoning and analogies to heterogeneous co-design in classical HPC; no empirical throughput/latency measurements provided.
medium positive Reference Architecture of a Quantum-Centric Supercomputer throughput and end-to-end latency of hybrid quantum-classical workflows
A phased roadmap (offload engines → middleware-coupled heterogeneous systems → fully co-designed heterogeneous systems) and a reference architecture can remove current friction (manual orchestration, scheduling, data transfer) and materially accelerate algorithmic discovery and applied quantum utility.
Roadmap and reference architecture proposed from system decomposition and use-case requirements analysis; argument based on observed friction points from literature and early hybrid deployments; no empirical validation provided.
medium positive Reference Architecture of a Quantum-Centric Supercomputer reduction in manual orchestration, scheduling overhead, data-movement latency; i...
Quantum-Centric Supercomputing (QCSC) — integrated systems co-designing QPUs with classical HPC components and middleware — is necessary to scale hybrid quantum-classical algorithms for chemistry, materials, and other applied research.
Conceptual systems-architecture analysis and synthesis of recent quantum-simulation demonstrations and hybrid algorithms; use-case-driven analysis for chemistry and materials; no new empirical performance benchmarks presented.
medium positive Reference Architecture of a Quantum-Centric Supercomputer scalability and practicability of hybrid quantum-classical algorithm execution (...
DPS compares favorably to standard rollout-based prompt-selection baselines across the reported metrics (rollouts required, training speed, final accuracy).
Empirical comparisons against baseline methods reported in the experiments; specific numeric comparisons and statistical details are not present in the provided summary.
medium positive Dynamics-Predictive Sampling for Active RL Finetuning of Lar... relative performance vs baseline on number of rollouts, training speed, and fina...
DPS creates a predictive prior that identifies informative prompts without performing exhaustive rollouts over large candidate batches.
Methodological mechanism plus empirical claim that selection operates via predictive prior and reduces candidate rollouts; supported by experiments vs rollout-filtering baselines.
medium positive Dynamics-Predictive Sampling for Active RL Finetuning of Lar... informativeness of selected prompts (as implied by downstream learning gains and...
The DPS inference procedure requires only historical rollout reward signals and therefore adds only a small amount of extra compute compared to the rollouts it avoids.
Practical considerations described in the paper: inference uses past rollout rewards; authors state the extra compute is small relative to avoided rollouts. (No quantified compute-cost ratio in the summary.)
medium positive Dynamics-Predictive Sampling for Active RL Finetuning of Lar... additional inference compute relative to avoided rollout compute
DPS improves final reasoning performance (final task accuracy) across evaluated domains: mathematical reasoning, planning, and visual-geometry tasks.
Empirical results reported across those benchmark domains showing improved downstream reasoning accuracy relative to baselines. (Summary does not include exact effect sizes or sample counts.)
medium positive Dynamics-Predictive Sampling for Active RL Finetuning of Lar... final reasoning accuracy on benchmarks (mathematics, planning, visual-geometry)
DPS speeds up RL finetuning in terms of required rollout budgets and wall-clock rollout compute.
Reported empirical findings: faster convergence of RL finetuning measured by rollout budgets and wall-clock compute on evaluated tasks. (Exact runtime metrics and sample sizes not provided in the summary.)
medium positive Dynamics-Predictive Sampling for Active RL Finetuning of Lar... training speed (rollout budget to convergence; wall-clock rollout compute)
Compared to standard online prompt-selection methods that rely on large candidate-batch rollouts for filtering, DPS substantially reduces the number of redundant (uninformative) rollouts.
Empirical comparisons against rollout-based filtering baselines across benchmark tasks (mathematics, planning, visual-geometry). Specific numeric savings not provided in the summary.
medium positive Dynamics-Predictive Sampling for Active RL Finetuning of Lar... number of rollouts (redundant rollouts avoided)
AI-enabled forecasting can raise operational productivity by reducing forecasting error, stockouts, and excess inventory, but realized returns depend on organizational complements (processes, governance).
Authors' synthesis of case evidence where AI forecasting reduced errors and inventory problems, combined with the theoretical claim that organizational complements condition realized gains.
medium positive Optimizing integrated supply planning in logistics: Bridging... forecast error, stockout frequency, inventory levels, operational productivity
Critical enablers for successful ISP adoption include executive sponsorship, cross-functional processes, data quality/governance, shared KPIs, and continuous learning cycles.
Recurring themes identified across the five case studies and synthesized in the authors' cross-case analysis as necessary organizational complements.
medium positive Optimizing integrated supply planning in logistics: Bridging... successful ISP adoption and subsequent performance improvements
AI-enabled forecasting combined with ERP integration leads to better synchronization across procurement, production, inventory, and distribution; improved decision visibility; and reduced forecasting errors where implemented.
Reported outcomes from cases in which firms implemented AI forecasting and ERP integration; interviewees described improved synchronization and lower forecasting errors (qualitative reports rather than quantified effect sizes).
medium positive Optimizing integrated supply planning in logistics: Bridging... forecasting error (e.g., MAPE), synchronization metrics across functions, decisi...
Policy recommendations: economists and policymakers should perform cost–benefit analyses of explainability mandates, incentivize research into human-centered explanation methods, subsidize standards and certification infrastructure, and consider staged regulation balancing innovation with accountability in high-risk domains.
Prescriptive recommendations drawn by the paper's authors from the review of technical, social-science, and policy literatures; based on synthesis rather than empirical testing of policy impacts.
medium positive Explainable AI in High-Stakes Domains: Improving Trust, Tran... policy design actions (cost–benefit analysis, incentives, subsidies, staged regu...
Clearer explanations and audit trails make it easier to assign responsibility and price risk (insurance markets, contract terms), potentially reducing uncertainty in public procurement and private contracts.
Economic and legal literature included in the review providing conceptual arguments and illustrative cases; no new empirical risk-pricing estimates provided in the paper.
medium positive Explainable AI in High-Stakes Domains: Improving Trust, Tran... ability to assign responsibility; risk pricing and uncertainty in procurement/co...
Better explainability (when usable) raises willingness-to-adopt AI in regulated, risk-averse sectors by reducing information asymmetries and perceived liability—potentially expanding market size for explainable systems.
Economic and conceptual arguments synthesized from the reviewed literature; the review aggregates studies and arguments but does not present new quantitative adoption estimates.
medium positive Explainable AI in High-Stakes Domains: Improving Trust, Tran... willingness-to-adopt AI; potential market size for explainable systems
Implementation requires organizational practices—governance, training, monitoring, and incentives—to translate explainability into safer, more legitimate AI use.
Synthesis of organizational, policy, and case-study literature in the review that identifies organizational measures correlated with effective deployment of explainable systems; descriptive evidence rather than causal experiments.
medium positive Explainable AI in High-Stakes Domains: Improving Trust, Tran... safety and perceived legitimacy of AI deployment
Regulatory frameworks, auditability, documentation (e.g., model cards, datasheets), and clear lines of responsibility amplify the effectiveness of explainability for accountability and compliance.
Synthesis of policy and governance literature included in the review that discusses how institutional mechanisms interact with technical explainability to produce accountability; descriptive evidence from case studies and governance proposals in the literature.
medium positive Explainable AI in High-Stakes Domains: Improving Trust, Tran... organizational accountability and regulatory compliance outcomes
Labor demand will increasingly favor skills that support effective Human–AI teaming (interpretation, interrogation of AI, systems orchestration, shared-model building) rather than routine task execution.
Implication drawn from the framework and literature on complementarity and skill-biased technological change; presented as an expectation rather than quantified by labor market data in the paper.
medium positive Toward a science of human–AI teaming for decision-making: A ... labor demand by skill type (employment shares, wage growth for non-routine teami...
Instituting continuous training, evaluation, and feedback loops is required to adapt Human–AI teams over time and maintain performance.
Prescriptive inference from organizational learning and human factors literature synthesized in the paper; suggested as best practice without empirical evaluation within the paper.
medium positive Toward a science of human–AI teaming for decision-making: A ... performance trajectories over time (learning curves), calibration of trust, adap...
Building knowledge infrastructures that capture, curate, and make provenance accessible is necessary for team knowledge continuity, accountability, and learning.
Conceptual recommendation informed by literature on knowledge management and provenance; no empirical measures or case studies reported to quantify impact.
medium positive Toward a science of human–AI teaming for decision-making: A ... knowledge availability, traceability/provenance metrics, learning/adaptation spe...
Partitioning roles — assigning pattern-detection tasks to AI and normative or contextual judgment to humans — improves task allocation based on comparative strengths.
Design recommendation derived from matching cognitive primitives to task types, supported conceptually by literature; not validated with empirical experiments in this paper.
medium positive Toward a science of human–AI teaming for decision-making: A ... task performance (accuracy, speed, decision quality) under role-partitioned work...
Complementarity requires structuring interactions so humans and AI amplify each other's strengths rather than substitute for one another.
Conceptual argument based on theoretical review of complementarity and collective intelligence; no empirical tests included.
medium positive Toward a science of human–AI teaming for decision-making: A ... degree of complementarity (interaction effects between human skill and AI capabi...
Aligning AI capabilities with human cognitive processes — reasoning, memory, and attention — is foundational to effective Human–AI teaming.
Theoretical grounding and literature synthesis drawing on cognitive science and human factors; proposed as a core lens for the framework rather than validated empirically in the paper.
medium positive Toward a science of human–AI teaming for decision-making: A ... team effectiveness (decision quality, error rate) as mediated by alignment with ...
Human–AI teams can achieve true complementarity such that joint team performance exceeds that of humans or AI alone.
Conceptual claim supported by an integrative, cross-disciplinary framework synthesizing literature from collective intelligence, cognitive science, AI, human factors, organizational behavior, and ethics. No primary empirical dataset or controlled experiments reported in the paper.
medium positive Toward a science of human–AI teaming for decision-making: A ... joint team performance (overall accuracy/quality of decisions compared to indivi...
Operationalizing explainability alongside monitoring (data-drift detection, retraining schedules) and usage rules stabilizes managerial outcomes and raises adoption/trust.
Argument supported by the pilot illustration and the paper's operational design; evidence primarily from single-case pilot and conceptual reasoning rather than multi-site causal testing.
medium positive ALGORITHM FOR IMPLEMENTING AI IN THE MANAGEMENT LOOP OF SMES... stability of managerial outcomes (e.g., consistent decision impact) and adoption...
Explainability (XAI) tools were integrated with the model and, together with operational quality controls (data-drift monitoring, retraining routines, and usage regulations), increased user trust and improved reproducibility of managerial impact in the pilot.
Pilot case study reporting integration of XAI and operational controls and reporting increases in user trust and reproducibility of managerial outcomes (single SME pilot; qualitative and quantitative details referenced but not listed in the summary).
medium positive ALGORITHM FOR IMPLEMENTING AI IN THE MANAGEMENT LOOP OF SMES... user trust (reported increase) and reproducibility of managerial impact (stabili...
A pilot implementation in an SME for inventory-demand forecasting used a gradient-boosting model which outperformed a business-as-usual baseline on forecasting accuracy metrics.
Single pilot case study reported in the paper: inventory-demand forecasting pilot comparing a gradient-boosting model to a baseline forecasting approach (sample: one SME pilot; specific implementation details and exact metrics not provided in the summary).
medium positive ALGORITHM FOR IMPLEMENTING AI IN THE MANAGEMENT LOOP OF SMES... forecasting accuracy (forecast error / accuracy metrics) of gradient-boosting mo...
Firms and governments should invest in continuous training, certification for AI‑augmented skills, and transition assistance to mitigate frictions.
Policy recommendation grounded in the paper's assessment of transition risks and complementarities; not based on program evaluation data.
medium positive How AI Will Transform the Daily Life of a Techie within 5 Ye... policy uptake and effectiveness (training participation rates, certification pre...
Likely increase in the skill premium for workers who can coordinate with and supervise AI (architecture, ethics, systems thinking), creating upward pressure on wages for those skill sets.
Economic reasoning about complementarity between AI capital and high‑skill labor; no wage‑level empirical analysis presented.
medium positive How AI Will Transform the Daily Life of a Techie within 5 Ye... wage changes by skill type (skill premium increase for AI‑complementary skills)
Short‑ to medium‑term productivity gains in software and digital‑product development are likely, lowering per‑unit development costs and accelerating release cycles.
Scenario reasoning and task automation/complementarity arguments extrapolating from current tools; no firm‑level productivity data analyzed.
medium positive How AI Will Transform the Daily Life of a Techie within 5 Ye... productivity metrics (output per developer, per‑unit development cost, release f...
Personalized, continuous learning through AI tutors and on‑the‑job assistants will lower some training frictions but raise the returns to upskilling.
Conceptual reasoning and examples of tutoring/assistive AI; not supported by empirical evaluation of learning outcomes or labor market returns.
medium positive How AI Will Transform the Daily Life of a Techie within 5 Ye... training frictions (time/cost to skill acquisition) and returns to upskilling (w...
AI will change how teams coordinate (automated status summaries, intelligent task routing, synthesis of asynchronous work), potentially speeding product cycles.
Scenario reasoning based on possible AI features in PM and collaboration tools; no measured changes in product cycle times presented.
medium positive How AI Will Transform the Daily Life of a Techie within 5 Ye... product cycle length / time‑to‑release and team coordination metrics (frequency ...
Demand will grow for skills complementary to AI: prompt‑engineering‑like skills, validation/verification, interpretability, governance, and stakeholder communication.
Qualitative reasoning about complementarities between human skills and AI capabilities and illustrative examples; no labor market data analyzed.
medium positive How AI Will Transform the Daily Life of a Techie within 5 Ye... demand for specific complementary skills (job postings, hiring rates for validat...
Practitioners will shift focus toward problem framing, architecture, system‑level reasoning, domain expertise, human‑centered design, and ethics as AI handles more routine tasks.
Task decomposition analysis identifying which tasks become complementary versus automatable; scenario reasoning about how remaining human tasks change; no empirical occupational data.
medium positive How AI Will Transform the Daily Life of a Techie within 5 Ye... change in time allocation and job task composition for tech practitioners (propo...