The Commonplace
Home Dashboard Papers Evidence Digests 🎲

Evidence (5267 claims)

Adoption
5267 claims
Productivity
4560 claims
Governance
4137 claims
Human-AI Collaboration
3103 claims
Labor Markets
2506 claims
Innovation
2354 claims
Org Design
2340 claims
Skills & Training
1945 claims
Inequality
1322 claims

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome Positive Negative Mixed Null Total
Other 378 106 59 455 1007
Governance & Regulation 379 176 116 58 739
Research Productivity 240 96 34 294 668
Organizational Efficiency 370 82 63 35 553
Technology Adoption Rate 296 118 66 29 513
Firm Productivity 277 34 68 10 394
AI Safety & Ethics 117 177 44 24 364
Output Quality 244 61 23 26 354
Market Structure 107 123 85 14 334
Decision Quality 168 74 37 19 301
Fiscal & Macroeconomic 75 52 32 21 187
Employment Level 70 32 74 8 186
Skill Acquisition 89 32 39 9 169
Firm Revenue 96 34 22 152
Innovation Output 106 12 21 11 151
Consumer Welfare 70 30 37 7 144
Regulatory Compliance 52 61 13 3 129
Inequality Measures 24 68 31 4 127
Task Allocation 75 11 29 6 121
Training Effectiveness 55 12 12 16 96
Error Rate 42 48 6 96
Worker Satisfaction 45 32 11 6 94
Task Completion Time 78 5 4 2 89
Wages & Compensation 46 13 19 5 83
Team Performance 44 9 15 7 76
Hiring & Recruitment 39 4 6 3 52
Automation Exposure 18 17 9 5 50
Job Displacement 5 31 12 48
Social Protection 21 10 6 2 39
Developer Productivity 29 3 3 1 36
Worker Turnover 10 12 3 25
Skill Obsolescence 3 19 2 24
Creative Output 15 5 3 1 24
Labor Share of Income 10 4 9 23
Clear
Adoption Remove filter
The study's results clarify which elements of the PRF design space are most important to prioritize in practice (i.e., prioritize feedback-model improvements over source collection in many low-resource settings).
Comparative performance gains observed in controlled experiments showing larger effect sizes from varying feedback model than from varying source, combined with cost analyses.
medium positive A Systematic Study of Pseudo-Relevance Feedback with LLMs Relative impact on retrieval performance and cost-effectiveness
Across 13 low-resource BEIR tasks and five LLM PRF methods, the choice of feedback model (how feedback is applied) critically affects retrieval effectiveness.
Empirical results reported over 13 BEIR tasks using five LLM-based PRF methods, with systematic variation of feedback model.
medium positive A Systematic Study of Pseudo-Relevance Feedback with LLMs Retrieval effectiveness (standard BEIR metrics)
Purely LLM-generated feedback yields the best cost-effectiveness overall (best performance per unit LLM invocation cost) for low-resource retrieval tasks.
Cost-effectiveness analysis in experiments across 13 BEIR tasks and five PRF methods that accounted for LLM invocation cost versus retrieval gains.
medium positive A Systematic Study of Pseudo-Relevance Feedback with LLMs Cost-effectiveness (retrieval gains per LLM invocation cost)
Feedback model choice can have a larger impact on retrieval quality than feedback source.
Controlled experiments comparing five LLM-based PRF methods across 13 low-resource BEIR tasks, measuring retrieval effectiveness with standard BEIR metrics.
medium positive A Systematic Study of Pseudo-Relevance Feedback with LLMs Retrieval effectiveness (standard BEIR retrieval metrics)
Demand will grow for hybrid specialists (quantum algorithm engineers, HPC systems integrators, middleware developers) and for domain scientists fluent in hybrid workflows, shifting skill premiums toward interdisciplinary expertise.
Labor-market inference from technology adoption and the skills required by proposed QCSC systems; qualitative only, no labor-market survey data provided.
medium positive Reference Architecture of a Quantum-Centric Supercomputer demand for specific skills, wage premiums for interdisciplinary expertise
Public investment and shared facilities can mitigate entry barriers and diffuse benefits to smaller firms and research groups.
Policy analysis and precedent from shared scientific infrastructure models; no case-study data specific to QCSC presented.
medium positive Reference Architecture of a Quantum-Centric Supercomputer access to QCSC resources by small firms/research groups, reduction in entry barr...
Tightly integrating QPUs, GPUs, and CPUs across hardware, middleware, and application layers (QCSC vision) will enable high-throughput, low-latency hybrid workflows.
Architectural design reasoning and analogies to heterogeneous co-design in classical HPC; no empirical throughput/latency measurements provided.
medium positive Reference Architecture of a Quantum-Centric Supercomputer throughput and end-to-end latency of hybrid quantum-classical workflows
A phased roadmap (offload engines → middleware-coupled heterogeneous systems → fully co-designed heterogeneous systems) and a reference architecture can remove current friction (manual orchestration, scheduling, data transfer) and materially accelerate algorithmic discovery and applied quantum utility.
Roadmap and reference architecture proposed from system decomposition and use-case requirements analysis; argument based on observed friction points from literature and early hybrid deployments; no empirical validation provided.
medium positive Reference Architecture of a Quantum-Centric Supercomputer reduction in manual orchestration, scheduling overhead, data-movement latency; i...
Quantum-Centric Supercomputing (QCSC) — integrated systems co-designing QPUs with classical HPC components and middleware — is necessary to scale hybrid quantum-classical algorithms for chemistry, materials, and other applied research.
Conceptual systems-architecture analysis and synthesis of recent quantum-simulation demonstrations and hybrid algorithms; use-case-driven analysis for chemistry and materials; no new empirical performance benchmarks presented.
medium positive Reference Architecture of a Quantum-Centric Supercomputer scalability and practicability of hybrid quantum-classical algorithm execution (...
Adoption of GNN-based, FL-coordinated beam management can provide competitive differentiation by offering more reliable NTN services in challenging geometries (e.g., low-elevation, edge coverage).
Synthesized implication from experimental results showing improved GNN performance at low elevation angles and the marketing/economic discussion in the paper; no market adoption or field-deployment evidence provided.
medium positive Federated Learning-driven Beam Management in LEO 6G Non-Terr... service reliability in challenging geometries (e.g., low-elevation coverage) and...
FL via HAPS reduces data-centralization costs (bandwidth and storage) and improves privacy compared to sending raw channel data to a central server.
Implication drawn from the FL design used: federated aggregation reduces need to backhaul raw channel samples; paper lists bandwidth/storage and privacy advantages as economic/operational implications (no quantified cost measurements provided).
medium positive Federated Learning-driven Beam Management in LEO 6G Non-Terr... backhaul bandwidth and storage requirements; privacy exposure (qualitative)
The GNN solution is lightweight enough for practical on-board or edge deployment in NTN contexts.
Paper asserts the GNN is lightweight and suitable for on-board or HAPS/edge deployment; model described as designed to be compact for constrained compute/link budgets (no exact parameter counts provided in summary).
medium positive Federated Learning-driven Beam Management in LEO 6G Non-Terr... compute/footprint suitability for on-board or edge deployment (model lightweight...
Federated learning across LEO orbital planes, coordinated via HAPS, enables efficient distributed beam selection for Non-Terrestrial Networks (NTNs).
Experimental design in the paper: federated learning paradigm with orbital-plane clients and HAPS acting as aggregation/coordination points; evaluated on beam-prediction tasks using realistic channel/beamforming datasets and distributed training (no central pooling of raw samples).
medium positive Federated Learning-driven Beam Management in LEO 6G Non-Terr... beam prediction accuracy and stability in a distributed (federated) training set...
DPS compares favorably to standard rollout-based prompt-selection baselines across the reported metrics (rollouts required, training speed, final accuracy).
Empirical comparisons against baseline methods reported in the experiments; specific numeric comparisons and statistical details are not present in the provided summary.
medium positive Dynamics-Predictive Sampling for Active RL Finetuning of Lar... relative performance vs baseline on number of rollouts, training speed, and fina...
DPS creates a predictive prior that identifies informative prompts without performing exhaustive rollouts over large candidate batches.
Methodological mechanism plus empirical claim that selection operates via predictive prior and reduces candidate rollouts; supported by experiments vs rollout-filtering baselines.
medium positive Dynamics-Predictive Sampling for Active RL Finetuning of Lar... informativeness of selected prompts (as implied by downstream learning gains and...
The DPS inference procedure requires only historical rollout reward signals and therefore adds only a small amount of extra compute compared to the rollouts it avoids.
Practical considerations described in the paper: inference uses past rollout rewards; authors state the extra compute is small relative to avoided rollouts. (No quantified compute-cost ratio in the summary.)
medium positive Dynamics-Predictive Sampling for Active RL Finetuning of Lar... additional inference compute relative to avoided rollout compute
DPS improves final reasoning performance (final task accuracy) across evaluated domains: mathematical reasoning, planning, and visual-geometry tasks.
Empirical results reported across those benchmark domains showing improved downstream reasoning accuracy relative to baselines. (Summary does not include exact effect sizes or sample counts.)
medium positive Dynamics-Predictive Sampling for Active RL Finetuning of Lar... final reasoning accuracy on benchmarks (mathematics, planning, visual-geometry)
DPS speeds up RL finetuning in terms of required rollout budgets and wall-clock rollout compute.
Reported empirical findings: faster convergence of RL finetuning measured by rollout budgets and wall-clock compute on evaluated tasks. (Exact runtime metrics and sample sizes not provided in the summary.)
medium positive Dynamics-Predictive Sampling for Active RL Finetuning of Lar... training speed (rollout budget to convergence; wall-clock rollout compute)
Compared to standard online prompt-selection methods that rely on large candidate-batch rollouts for filtering, DPS substantially reduces the number of redundant (uninformative) rollouts.
Empirical comparisons against rollout-based filtering baselines across benchmark tasks (mathematics, planning, visual-geometry). Specific numeric savings not provided in the summary.
medium positive Dynamics-Predictive Sampling for Active RL Finetuning of Lar... number of rollouts (redundant rollouts avoided)
Structural fixes — altering environment design or policy class to ensure the induced Markov chain is ergodic (e.g., ensuring mixing/recurrence or preventing absorbing bad states) — can eliminate the ensemble/time-average gap.
Paper discussion and examples suggesting interventions to change chain structure; conceptual/theoretical proposal supported by illustrative examples (no empirical deployment studies).
medium positive Ergodicity in reinforcement learning ergodicity of induced dynamics and resulting alignment of ensemble and time-aver...
Robust/adversarial and model-uncertainty methods can hedge against trajectories that lead to poor long-run behavior and thus mitigate risks from non-ergodic dynamics.
Survey of robust control and adversarial RL approaches in the paper; conceptual argument linking robustness to protection against adverse sample paths; no new empirical tests.
medium positive Ergodicity in reinforcement learning worst-case or adversarial long-run reward under uncertainty
Ergodic control and sample-path optimality formulations recast control objectives in terms of time averages or almost-sure sample-path criteria rather than ensemble expectations and are therefore appropriate for single-trajectory performance targets.
Survey and formal discussion in the paper connecting ergodic control literature to single-trajectory objectives; theoretical references summarized.
medium positive Ergodicity in reinforcement learning time-average/sample-path optimality of control policies
Almost-sure and probabilistic constraint methods (chance constraints, safe RL) can enforce that long-run performance exceeds thresholds with high probability, addressing single-trajectory guarantees.
Surveyed methodologies and references in the paper describing chance-constrained and safe RL formulations; conceptual synthesis rather than empirical demonstration.
medium positive Ergodicity in reinforcement learning probability that long-run/time-average performance exceeds a threshold (chance c...
Distributional reinforcement learning (optimizing the full return distribution) enables optimizing objectives such as median, lower quantiles, or CVaR which better reflect single-run guarantees.
Literature survey in the paper citing distributional RL approaches and linking them conceptually to single-trajectory guarantees; no new experiments provided.
medium positive Ergodicity in reinforcement learning statistics of the return distribution (median, quantiles, CVaR) relevant to sing...
Risk-sensitive and utility-based objectives (e.g., maximize expected utility such as log-utility or minimize downside risk) can produce policies that prefer more reliable time-average outcomes compared to raw expected-reward objectives.
Surveyed literature in the paper summarizing risk-sensitive and utility-based RL approaches; conceptual argument rather than new empirical validation.
medium positive Ergodicity in reinforcement learning time-average reliability or downside risk of realized reward under risk-sensitiv...
Numerical simulations confirm the analytic extreme-value scaling for earliest discoveries and demonstrate that introducing non-reciprocal biases leads to stable monopolies whereas symmetric interactions do not.
Numerical simulations (stochastic realizations) reported in the paper used to validate analytic predictions and illustrate dynamical outcomes; however, the summary does not specify simulation sample sizes, parameter sweeps, or robustness checks.
medium positive Macroscopic Dominance from Microscopic Extremes: Symmetry Br... agreement between analytic scaling and simulation results (first-passage extreme...
AI-enabled forecasting can raise operational productivity by reducing forecasting error, stockouts, and excess inventory, but realized returns depend on organizational complements (processes, governance).
Authors' synthesis of case evidence where AI forecasting reduced errors and inventory problems, combined with the theoretical claim that organizational complements condition realized gains.
medium positive Optimizing integrated supply planning in logistics: Bridging... forecast error, stockout frequency, inventory levels, operational productivity
Critical enablers for successful ISP adoption include executive sponsorship, cross-functional processes, data quality/governance, shared KPIs, and continuous learning cycles.
Recurring themes identified across the five case studies and synthesized in the authors' cross-case analysis as necessary organizational complements.
medium positive Optimizing integrated supply planning in logistics: Bridging... successful ISP adoption and subsequent performance improvements
AI-enabled forecasting combined with ERP integration leads to better synchronization across procurement, production, inventory, and distribution; improved decision visibility; and reduced forecasting errors where implemented.
Reported outcomes from cases in which firms implemented AI forecasting and ERP integration; interviewees described improved synchronization and lower forecasting errors (qualitative reports rather than quantified effect sizes).
medium positive Optimizing integrated supply planning in logistics: Bridging... forecasting error (e.g., MAPE), synchronization metrics across functions, decisi...
Policy recommendations: economists and policymakers should perform cost–benefit analyses of explainability mandates, incentivize research into human-centered explanation methods, subsidize standards and certification infrastructure, and consider staged regulation balancing innovation with accountability in high-risk domains.
Prescriptive recommendations drawn by the paper's authors from the review of technical, social-science, and policy literatures; based on synthesis rather than empirical testing of policy impacts.
medium positive Explainable AI in High-Stakes Domains: Improving Trust, Tran... policy design actions (cost–benefit analysis, incentives, subsidies, staged regu...
Clearer explanations and audit trails make it easier to assign responsibility and price risk (insurance markets, contract terms), potentially reducing uncertainty in public procurement and private contracts.
Economic and legal literature included in the review providing conceptual arguments and illustrative cases; no new empirical risk-pricing estimates provided in the paper.
medium positive Explainable AI in High-Stakes Domains: Improving Trust, Tran... ability to assign responsibility; risk pricing and uncertainty in procurement/co...
Better explainability (when usable) raises willingness-to-adopt AI in regulated, risk-averse sectors by reducing information asymmetries and perceived liability—potentially expanding market size for explainable systems.
Economic and conceptual arguments synthesized from the reviewed literature; the review aggregates studies and arguments but does not present new quantitative adoption estimates.
medium positive Explainable AI in High-Stakes Domains: Improving Trust, Tran... willingness-to-adopt AI; potential market size for explainable systems
Implementation requires organizational practices—governance, training, monitoring, and incentives—to translate explainability into safer, more legitimate AI use.
Synthesis of organizational, policy, and case-study literature in the review that identifies organizational measures correlated with effective deployment of explainable systems; descriptive evidence rather than causal experiments.
medium positive Explainable AI in High-Stakes Domains: Improving Trust, Tran... safety and perceived legitimacy of AI deployment
Regulatory frameworks, auditability, documentation (e.g., model cards, datasheets), and clear lines of responsibility amplify the effectiveness of explainability for accountability and compliance.
Synthesis of policy and governance literature included in the review that discusses how institutional mechanisms interact with technical explainability to produce accountability; descriptive evidence from case studies and governance proposals in the literature.
medium positive Explainable AI in High-Stakes Domains: Improving Trust, Tran... organizational accountability and regulatory compliance outcomes
Operationalizing explainability alongside monitoring (data-drift detection, retraining schedules) and usage rules stabilizes managerial outcomes and raises adoption/trust.
Argument supported by the pilot illustration and the paper's operational design; evidence primarily from single-case pilot and conceptual reasoning rather than multi-site causal testing.
medium positive ALGORITHM FOR IMPLEMENTING AI IN THE MANAGEMENT LOOP OF SMES... stability of managerial outcomes (e.g., consistent decision impact) and adoption...
Explainability (XAI) tools were integrated with the model and, together with operational quality controls (data-drift monitoring, retraining routines, and usage regulations), increased user trust and improved reproducibility of managerial impact in the pilot.
Pilot case study reporting integration of XAI and operational controls and reporting increases in user trust and reproducibility of managerial outcomes (single SME pilot; qualitative and quantitative details referenced but not listed in the summary).
medium positive ALGORITHM FOR IMPLEMENTING AI IN THE MANAGEMENT LOOP OF SMES... user trust (reported increase) and reproducibility of managerial impact (stabili...
A pilot implementation in an SME for inventory-demand forecasting used a gradient-boosting model which outperformed a business-as-usual baseline on forecasting accuracy metrics.
Single pilot case study reported in the paper: inventory-demand forecasting pilot comparing a gradient-boosting model to a baseline forecasting approach (sample: one SME pilot; specific implementation details and exact metrics not provided in the summary).
medium positive ALGORITHM FOR IMPLEMENTING AI IN THE MANAGEMENT LOOP OF SMES... forecasting accuracy (forecast error / accuracy metrics) of gradient-boosting mo...
Firms and governments should invest in continuous training, certification for AI‑augmented skills, and transition assistance to mitigate frictions.
Policy recommendation grounded in the paper's assessment of transition risks and complementarities; not based on program evaluation data.
medium positive How AI Will Transform the Daily Life of a Techie within 5 Ye... policy uptake and effectiveness (training participation rates, certification pre...
Likely increase in the skill premium for workers who can coordinate with and supervise AI (architecture, ethics, systems thinking), creating upward pressure on wages for those skill sets.
Economic reasoning about complementarity between AI capital and high‑skill labor; no wage‑level empirical analysis presented.
medium positive How AI Will Transform the Daily Life of a Techie within 5 Ye... wage changes by skill type (skill premium increase for AI‑complementary skills)
Short‑ to medium‑term productivity gains in software and digital‑product development are likely, lowering per‑unit development costs and accelerating release cycles.
Scenario reasoning and task automation/complementarity arguments extrapolating from current tools; no firm‑level productivity data analyzed.
medium positive How AI Will Transform the Daily Life of a Techie within 5 Ye... productivity metrics (output per developer, per‑unit development cost, release f...
Personalized, continuous learning through AI tutors and on‑the‑job assistants will lower some training frictions but raise the returns to upskilling.
Conceptual reasoning and examples of tutoring/assistive AI; not supported by empirical evaluation of learning outcomes or labor market returns.
medium positive How AI Will Transform the Daily Life of a Techie within 5 Ye... training frictions (time/cost to skill acquisition) and returns to upskilling (w...
AI will change how teams coordinate (automated status summaries, intelligent task routing, synthesis of asynchronous work), potentially speeding product cycles.
Scenario reasoning based on possible AI features in PM and collaboration tools; no measured changes in product cycle times presented.
medium positive How AI Will Transform the Daily Life of a Techie within 5 Ye... product cycle length / time‑to‑release and team coordination metrics (frequency ...
Demand will grow for skills complementary to AI: prompt‑engineering‑like skills, validation/verification, interpretability, governance, and stakeholder communication.
Qualitative reasoning about complementarities between human skills and AI capabilities and illustrative examples; no labor market data analyzed.
medium positive How AI Will Transform the Daily Life of a Techie within 5 Ye... demand for specific complementary skills (job postings, hiring rates for validat...
Practitioners will shift focus toward problem framing, architecture, system‑level reasoning, domain expertise, human‑centered design, and ethics as AI handles more routine tasks.
Task decomposition analysis identifying which tasks become complementary versus automatable; scenario reasoning about how remaining human tasks change; no empirical occupational data.
medium positive How AI Will Transform the Daily Life of a Techie within 5 Ye... change in time allocation and job task composition for tech practitioners (propo...
AI will assist with design through adaptive interfaces, automated usability testing, and rapid prototype generation.
Illustrative examples of AI in design tooling and conceptual reasoning about model capabilities; not supported by systematic user studies in the paper.
medium positive How AI Will Transform the Daily Life of a Techie within 5 Ye... extent of AI usage in design tasks (adaptive UI changes, automated usability tes...
Autonomous code generation, refactoring, test creation, and automated security linting will become common capabilities of the AI co‑pilot.
Extrapolation from current large models and developer tool features, plus scenario reasoning; no empirical prevalence rates provided.
medium positive How AI Will Transform the Daily Life of a Techie within 5 Ye... prevalence of autonomous capabilities in developer‑facing AI (code generation, r...
AI‑driven assistants will be embedded in IDEs, design tools, project‑management platforms, and CI/CD pipelines.
Observation of current developer tooling trends and illustrative examples of existing integrations; scenario reasoning in a task‑based decomposition framework; no systematic adoption data.
medium positive How AI Will Transform the Daily Life of a Techie within 5 Ye... presence and extent of AI integrations in developer tooling (IDE, design, PM, CI...
Firms will reallocate investment toward cloud infrastructure, data engineering, model ops, and financial data integration, favoring vendors providing interoperable, audit-friendly solutions.
Predictive claim about investment incentives based on the paper's architectural and governance analysis; no spending data or vendor market-share evidence presented.
medium positive Next-Generation Financial Analytics Frameworks for AI-Enable... IT/technology spend composition (e.g., percent of budget on cloud/data engineeri...
Next-generation financial analytics frameworks embed AI (ML, NLP, anomaly detection) into core financial systems to shift enterprises from retrospective reporting to predictive, prescriptive, and real-time decision-making.
This is the paper's central conceptual claim supported by a descriptive synthesis of AI techniques and system architecture; no empirical sample, controlled experiments, or deployment case data are presented—recommendations are justified by logical argument and examples of techniques.
medium positive Next-Generation Financial Analytics Frameworks for AI-Enable... degree of shift from retrospective reporting to predictive/prescriptive/real-tim...
Documented benefits of structured risk management include improved organizational resilience and stability under uncertainty.
Synthesis of claims in the literature reviewed; secondary cross-sectional evidence from peer-reviewed articles and practitioner sources within the ten-year scope (no primary quantitative validation in this review).
medium positive The Role of Risk Management as an Organizational Management ... organizational resilience; stability under uncertainty