The Commonplace
Home Dashboard Papers Evidence Syntheses Digests 🎲

Evidence (3224 claims)

Adoption
7395 claims
Productivity
6507 claims
Governance
5877 claims
Human-AI Collaboration
5157 claims
Innovation
3492 claims
Org Design
3470 claims
Labor Markets
3224 claims
Skills & Training
2608 claims
Inequality
1835 claims

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome Positive Negative Mixed Null Total
Other 609 159 77 736 1615
Governance & Regulation 664 329 160 99 1273
Organizational Efficiency 624 143 105 70 949
Technology Adoption Rate 502 176 98 78 861
Research Productivity 348 109 48 322 836
Output Quality 391 120 44 40 595
Firm Productivity 385 46 85 17 539
Decision Quality 275 143 62 34 521
AI Safety & Ethics 183 241 59 30 517
Market Structure 152 154 109 20 440
Task Allocation 158 50 56 26 295
Innovation Output 178 23 38 17 257
Skill Acquisition 137 52 50 13 252
Fiscal & Macroeconomic 120 64 38 23 252
Employment Level 93 46 96 12 249
Firm Revenue 130 43 26 3 202
Consumer Welfare 99 51 40 11 201
Inequality Measures 36 105 40 6 187
Task Completion Time 134 18 6 5 163
Worker Satisfaction 79 54 16 11 160
Error Rate 64 78 8 1 151
Regulatory Compliance 69 64 14 3 150
Training Effectiveness 81 15 13 18 129
Wages & Compensation 70 25 22 6 123
Team Performance 74 16 21 9 121
Automation Exposure 41 48 19 9 120
Job Displacement 11 71 16 1 99
Developer Productivity 71 14 9 3 98
Hiring & Recruitment 49 7 8 3 67
Social Protection 26 14 8 2 50
Creative Output 26 14 6 2 49
Skill Obsolescence 5 37 5 1 48
Labor Share of Income 12 13 12 37
Worker Turnover 11 12 3 26
Industry 1 1
Clear
Labor Markets Remove filter
Applying causal inference methods (difference‑in‑differences, synthetic controls, instrumental variables, structural counterfactuals) can distinguish automation (task substitution) from augmentation (productivity/role change) and estimate net employment effects.
Methodological recommendation with examples of applicable identification strategies; no specific empirical applications or results reported in the paper.
medium positive Enhancing BLS Methodologies for Projecting AI's Impact on Em... causal estimates separating substitution vs augmentation effects; net employment...
Integrating multiple data streams (CPS, LEHD/LODES, UI wage records, administrative microdata, job ads, occupational manuals, enterprise adoption surveys) yields richer gross‑flows and skills measurement than using single data sources.
Proposed data-integration strategy and references to candidate datasets; no empirical demonstration or quantified improvement in measurement presented.
medium positive Enhancing BLS Methodologies for Projecting AI's Impact on Em... quality of gross‑flows estimates (transition rates, spell durations), comprehens...
A dynamic Occupational AI Exposure Score (OAIES) can quantify exposure at the task level using LLMs, job‑task matrices (e.g., O*NET), and real‑time job ad / workplace data to capture evolving capability of AI systems.
Methodological description of OAIES construction (mapping tasks to occupations, LLM scoring, weighting by time use/criticality); no empirical implementation or validation data presented in the paper.
medium positive Enhancing BLS Methodologies for Projecting AI's Impact on Em... OAIES scores (task- and occupation-level exposure measures) with uncertainty int...
Measurement and forecasting should move away from occupation-level forecasts toward task-level, continuously updated indicators linked to real-world adoption measures (firm purchases, API usage, procurement).
Recommendation in the paper motivated by rapid changes in AI capabilities and limitations of static indices; evidence basis is methodological argument and examples of richer adoption measures rather than a quantified evaluation of forecast improvements.
medium positive Recent Methodologies on AI and Labour - a Desk Review forecast accuracy and timeliness of AI exposure indicators
Policy should prioritise flexible reskilling and retraining programs targeted at high-risk tasks and low-skilled workers, informed by task-level exposure maps.
Policy implication recommended by the paper drawing on distributional findings (higher displacement risk for low-skilled tasks) and the availability of task-level exposure indices; evidence basis combines empirical pattern synthesis and normative recommendation rather than an RCT or program evaluation.
medium positive Recent Methodologies on AI and Labour - a Desk Review effectiveness of reskilling/training programs in mitigating displacement and imp...
Think tanks and international organisations are emphasising scenario planning with differing adoption initial conditions to inform reskilling and labour-market policy.
References to policy and scenario work by organisations named in the paper (TBI, IPPR, IMF, TBI 2024; IPPR 2024; Korinek 2023); evidence basis is published scenario reports and policy papers rather than experimental data.
medium positive Recent Methodologies on AI and Labour - a Desk Review policy scenario outputs (projected employment/wage/productivity under alternativ...
Labor complementarities with agentic AI will shift resources toward oversight, interpretation, and coordination roles rather than routine task execution.
Economic and organizational reasoning; literature synthesis on skill complementarities; no empirical labor-market data analyzed in the paper.
medium positive Visioning Human-Agentic AI Teaming: Continuity, Tension, and... allocation of labor hours/roles toward oversight and coordination tasks
Principal–agent contracting frameworks must be extended to account for evolving agent objectives and open-ended action spaces; contracts should be dynamic and include continuous renegotiation and monitoring.
Theoretical extension and recommendations based on economic reasoning; proposed formal models for future work.
medium positive Visioning Human-Agentic AI Teaming: Continuity, Tension, and... adequacy of static contracting frameworks vs. proposed dynamic contracts
Projection congruence — alignment of forecasts/plans across heterogeneous agents — becomes a central metric for assessing alignment in agentic human–AI teams.
Conceptual modeling and proposal in the paper; introduced as a new measurable construct (projection congruence indices) for future empirical work.
medium positive Visioning Human-Agentic AI Teaming: Continuity, Tension, and... degree of congruence in projected trajectories between human and AI teammates
Continuous human-in-the-loop oversight, monitoring, and retraining are required to maintain quality and prevent model drift.
Practitioner reports and conceptual literature synthesized in the review advocating monitoring and retraining; no longitudinal empirical study provided here.
medium positive The Effectiveness of ChatGPT in Customer Service and Communi... model performance over time, incidence of drift, quality-control metrics
Transparent disclosure to customers about AI involvement helps preserve trust.
Conceptual analyses and referenced empirical/regulatory discussions in the literature aggregated by the review; this paper presents no new experimental evidence on disclosure effects.
medium positive The Effectiveness of ChatGPT in Customer Service and Communi... consumer trust/satisfaction as a function of disclosure of AI use
Hybrid designs that automate low-risk, high-volume tasks while routing complex, judgment-sensitive cases to humans produce the best operational outcomes.
Inferred best-practice from aggregated empirical studies, industry examples, and conceptual reasoning; no controlled comparative trials presented in this review.
medium positive The Effectiveness of ChatGPT in Customer Service and Communi... operational outcomes including cost, resolution quality, customer trust, and esc...
Agent augmentation via suggested responses, summarization, and information retrieval improves agent productivity.
Aggregated evidence from prior empirical research and practitioner reports cited in the review; no new measurements or sample sizes presented here.
medium positive The Effectiveness of ChatGPT in Customer Service and Communi... agent productivity metrics (e.g., response time, task throughput, resolution rat...
Generative AI enables personalization at scale through automated tailoring of messaging and recommendations.
Qualitative synthesis of empirical studies and industry reports showing automated personalization use-cases; no systematic effect-size estimates or new quantitative data in this review.
medium positive The Effectiveness of ChatGPT in Customer Service and Communi... degree of message personalization/recommendation relevance and scale (number of ...
Generative AI provides 24/7 availability and cost-effective scaling of routine interactions.
Industry case examples and prior empirical studies aggregated in the review; no original data or quantified sample sizes provided in this paper.
medium positive The Effectiveness of ChatGPT in Customer Service and Communi... availability (hours of operation), cost per interaction, throughput for routine ...
Generative AI can materially transform customer service and strategic communication by enabling continuous automation, scalable hyper-personalization, and effective agent augmentation.
Nano review: qualitative aggregation and synthesis of existing empirical studies, industry case examples, and conceptual analyses. No novel primary data or sample size; conclusion drawn from heterogeneous secondary sources and practitioner reports (not a systematic meta-analysis).
medium positive The Effectiveness of ChatGPT in Customer Service and Communi... degree of automation, personalization scale, and agent productivity in customer ...
There is a need for standards around evaluation, bias mitigation, provenance, and accountability in AI-assisted ideation and design.
Policy recommendation motivated by documented biases, errors, and provenance issues in the reviewed studies; grounded in the synthesis's critique of existing practice.
medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... existence and adoption of evaluation/mitigation/provenance/accountability standa...
There will likely be complementarity-driven increases in demand for evaluative, integrative, and domain-expert roles (curators, synthesizers, implementation experts).
Inference from task-level studies and economic reasoning about complementarities between AI generative capability and human evaluative skills; empirical labor-market evidence is limited in the reviewed literature.
medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... employment demand for evaluative/integrative/domain-expert roles
Lower search and idea-generation costs enabled by LLMs may speed early-stage R&D and increase the gross flow of candidate innovations.
Theoretical economic interpretation supported by empirical findings of increased idea volumes in experimental/field studies summarized in the review; no long-run causal firm-level evidence presented.
medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... volume/rate of candidate ideas generated and pace of early-stage R&D activity
Generative AI accelerates early-stage hypothesis and prototype development by providing scaffolded prompts and procedural suggestions.
Applied case evidence and experimental studies summarized in the review showing reduced time or increased productivity in early-stage experimental/design tasks when using LLM assistance; no pooled effect size presented.
medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... time-to-hypothesis or prototype, number of prototype iterations in early-stage d...
Empirical studies document that AI-assisted tools can help break cognitive fixation and generate cross-domain analogies.
Cited experimental tasks and lab studies in the literature showing higher incidence of analogical or cross-domain suggestions from LLMs and improvements on fixation-related task metrics; heterogeneity across tasks and measures.
medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... frequency/quality of cross-domain analogies and fixation-related performance met...
Generative AI provides scaffolded, structured support that aids systematic hypothesis formation, prototyping steps, and decomposition of complex problems.
Review of design/ideation studies and applied case evidence where LLMs produced stepwise plans, decomposition prompts, or hypothesis scaffolds; evidence drawn from multiple short-term experimental and applied studies, sample sizes and exact designs vary by study.
medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... speed and/or quality of early-stage hypothesis generation and prototype developm...
Generative models rapidly produce many candidate ideas, analogies, and associative prompts that help overcome cognitive fixation.
Synthesis of experimental ideation and design studies reporting increases in number of ideas and examples of reduced fixation when participants used LLM outputs; heterogeneous sample sizes across cited studies (not reported in review).
medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... idea quantity and measures of fixation (e.g., fixation errors, number of distinc...
Generative AI can raise per-worker productivity for tasks involving brainstorming, drafting, and prototyping, but realized gains depend on downstream filtering and implementation costs.
User studies showing higher output on specific tasks (brainstorming/drafting), combined with qualitative reports of filtering/implementation effort; many studies measure immediate task output but not net realized productivity after implementation.
medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... task output (ideas/drafts) per worker; downstream filtering effort; implemented ...
Generative AI can increase creative output in both lab and field tasks as judged by external raters.
Controlled experiments and field studies reporting higher judged creativity/novelty scores for AI-assisted outputs versus controls; judged creativity/novelty is typically assessed by human raters using rubric-based scoring.
medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... rated creativity/novelty scores; externally judged idea quality
AI assistance helps people overcome fixation and produces cross-domain analogies that they might not generate alone.
Experimental studies and qualitative analyses documenting reductions in fixation effects and increases in cross-domain analogical suggestions when participants use generative models.
medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... measures of fixation (e.g., repetition of prior solutions); count/quality of cro...
Generative AI supports systematic problem breakdown and early-stage prototyping, accelerating hypothesis generation and prototype development.
Field case studies of AI-supported prototyping and lab/user studies reporting reduced time-to-prototype and generated hypotheses; measures include time-to-prototype and user-reported usefulness.
medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... time-to-prototype; number/quality of generated hypotheses/prototypes; user-perce...
Generative AI boosts ideational fluency—the quantity and diversity of ideas produced in brainstorming tasks.
Controlled experiments and user studies measuring number and diversity of ideas with and without AI assistance; typical study designs compare participant idea counts/uniqueness across conditions (note: many studies use small or convenience samples).
medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... number of ideas generated; diversity indices of ideas
When used as a 'cognitive co-pilot' that expands the solution space and challenges assumptions while humans curate and evaluate, generative AI generates economic value.
Inferred from experimental and field findings showing increased idea quantity/diversity and faster prototyping combined with qualitative studies showing human curation is needed; economic interpretation drawn from the review rather than direct macroeconomic measurement.
medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... idea space breadth; time-to-prototype; downstream implemented/valued ideas (larg...
Generative AI serves a dual cognitive role: (1) a high-volume catalyst for divergent idea generation and cross-domain analogy-making, and (2) a structured assistant for deconstructing complex problems and scaffolding hypotheses and prototypes.
Synthesis of controlled experiments, lab studies, field case studies, and qualitative analyses summarized in the review; evidence includes measures of idea fluency/diversity, examples of analogy production, and observations of AI-assisted problem decomposition in prototyping tasks. (Note: underlying studies are heterogeneous and often short-term or convenience samples.)
medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... ideational fluency/diversity; incidence of cross-domain analogies; quality/speed...
Agent augmentation (drafting replies, summarizing histories, suggesting actions) raises frontline productivity and can improve response consistency.
Pilot deployments and internal A/B tests cited that measure time saved by agents and improvements in draft quality/consistency; mostly short-run and firm-specific reports.
medium positive The Effectiveness of ChatGPT in Customer Service and Communi... agent productivity (time per case saved), consistency of responses
Hyper-personalization at scale can increase relevance of responses and customer engagement when fed high-quality signals.
Case studies and pilot deployments that applied personalization signals (customer history, behavioral data) and reported improved relevance/engagement metrics; evidence conditional on availability and quality of signals and largely non-randomized.
medium positive The Effectiveness of ChatGPT in Customer Service and Communi... response relevance; customer engagement (clicks, session length, follow-up conta...
24/7 automation reduces routine handling time and operational costs for simple, repetitive queries.
Operational deployments and pilot studies reporting reduced handling times and cost-per-interaction for routine queries; some vendor-supplied before/after or A/B comparisons, but heterogeneous measurements and limited randomized evidence.
medium positive The Effectiveness of ChatGPT in Customer Service and Communi... routine handling time; operational cost per interaction
With appropriate policies and ecosystem building, AI offers strategic opportunities for 'leapfrogging' in service delivery (for example, healthcare diagnostics and precision agriculture) that can raise productivity and welfare.
Synthesis of case studies and prior empirical work showing promising AI applications; the assertion remains inferential and the paper calls for pilots and empirical validation.
medium positive Towards Responsible Artificial Intelligence Adoption: Emergi... service delivery performance (diagnostic rates, agricultural yields), productivi...
Investing in human capital—technical skills, digital literacy, and institutional capacity—is critical for African actors to capture value from AI and to design culturally aligned systems.
Policy and academic literature synthesis linking human capital investment to technology adoption and innovation; no primary training program evaluation in the paper.
medium positive Towards Responsible Artificial Intelligence Adoption: Emergi... number of trained AI professionals, digital literacy rates, local innovation out...
Context‑sensitive interventions—stronger governance, capacity building, multi‑stakeholder collaboration, and locally tailored strategies—are necessary to steer AI toward inclusive outcomes in Africa.
Policy and literature synthesis recommending interventions; recommendations are normative and inferential without empirical pilots in this paper.
medium positive Towards Responsible Artificial Intelligence Adoption: Emergi... local capacity metrics (skills, institutions), stakeholder participation rates, ...
AI adoption in Africa is already transforming multiple sectors (healthcare, finance, agriculture, education, industry, governance) and has the potential to improve productivity, service delivery, and decision-making.
Desk-based literature synthesis of prior empirical studies, policy reports and case studies; no primary data or field experiments reported in this paper.
medium positive Towards Responsible Artificial Intelligence Adoption: Emergi... sectoral productivity, service delivery quality, decision-making accuracy (e.g.,...
Audit cycles and inter-rater reliability studies should be used to improve assessment validity.
Suggested under Evaluation/Research Designs and Implementation Artifacts: the paper recommends systematic audits and inter-rater reliability studies as validity checks. This is a recommended practice, not an empirically validated result within the paper.
medium positive Curriculum engineering: organisation, orientation, and manag... assessment validity metrics (inter-rater reliability coefficients, audit consist...
Better competency mapping and standardized, machine-readable program outputs facilitate automated matching platforms and reduce search/matching costs in AI labour markets.
Stated in Implications for AI Economics: the paper links machine-readable competency outputs to improved labour-market matching. This is a theoretical implication; no empirical matching-cost estimates are presented.
medium positive Curriculum engineering: organisation, orientation, and manag... matching efficiency (time-to-hire, vacancy durations), search costs
The approach increases traceability and compliance readiness, facilitating audits and regulatory verification.
Paper cites audit-ready documentation, systematic audits, and versioned curriculum artifacts as outputs and recommends audit cycles and inter-rater reliability studies. This is an asserted benefit without reported empirical testing.
medium positive Curriculum engineering: organisation, orientation, and manag... compliance scores, audit findings, ability to support third-party verification
IT integration is necessary for documentation, traceability, and continuous monitoring of curriculum artifacts.
Listed among core components and implementation artifacts (version-controlled documentation, traceability logs, IT-backed traceability). Support is prescriptive and conceptual rather than empirical.
medium positive Curriculum engineering: organisation, orientation, and manag... documentation traceability (presence of version control, audit logs), monitoring...
Logical modelling tools (logigrams and algorigrams) support lesson planning and audits by formalising decision rules and automated workflows.
Described as a core component and implementation artifact; paper explains process modelling using logigrams/algorigrams to formalise instructional algorithms and audit workflows. No empirical validation provided.
medium positive Curriculum engineering: organisation, orientation, and manag... degree of formalisation of lesson plans and audit workflows; consistency/repeata...
A curriculum-engineering framework that combines organisational orientation, management-system investigation, audit-ready documentation, and logical modelling (logigrams/algorigrams) can produce traceable, compliance-aligned lesson plans and career-pathway outputs.
Presented as the paper's main finding and framework design: description of core components (organisational orientation, management systems, audit-ready documentation, logigrams/algorigrams) and the claimed outputs. No empirical trial results, sample sizes, or quantitative validation are reported — the support is conceptual and methodologic.
medium positive Curriculum engineering: organisation, orientation, and manag... traceability and compliance alignment of lesson plans and career-pathway documen...
Aligning the dynamic equivalency framework with UNESCO and SADC mutual recognition instruments will support cross-border acceptance of equivalency decisions.
Normative/legal recommendation referencing international/regional instruments; no case-study evidence showing increased acceptance after alignment is presented.
medium positive Establishes a technical and academic bridge between the educ... cross-border recognition rate of equivalency decisions, number of mutual recogni...
Operations Research / probabilistic models can estimate the probability of successful professional integration given measurable inputs (e.g., hours, equipment, faculty qualifications, grades).
Proposed analytical approach in the paper describing OR models and predictive variables; no model calibration, holdout validation data, or predictive performance metrics presented.
medium positive Establishes a technical and academic bridge between the educ... predicted probability of professional integration; predictive validity against o...
Statistical sequencing and anomaly detection methods can identify irregular grading patterns across regions and institutions.
Methodological proposal referencing time-series and statistical sequencing techniques for anomaly detection; no applied dataset, detection rates, or validation sample size reported.
medium positive Establishes a technical and academic bridge between the educ... anomaly detection rate, false positive and false negative rates in grade irregul...
A dual-layer audit — technical audit (verify workshop hours, laboratory equipment, faculty qualifications) plus system audit (validate data-analysis models) — is necessary to make equivalency decisions valid and defensible.
Prescriptive audit design described in the paper, with recommended verification items and model-validation steps; no audit trial or measured effect sizes reported.
medium positive Establishes a technical and academic bridge between the educ... audit pass rates, reduction in fraudulent/invalid equivalency certifications, le...
A centralized MIS enables centralized verification, easier longitudinal tracking, and streamlined credential processing.
Stated operational advantages drawn from systems-design reasoning and described data workflows (student records, transcripts, lab logs); no quantitative performance data or pilot comparisons provided.
medium positive Establishes a technical and academic bridge between the educ... credential processing time, verification accuracy, completeness of longitudinal ...
The framework should combine a centralized Management Information System (MIS), operations-research validation models, and a dual-layer audit (technical + system).
Design prescription in the paper synthesizing technical, statistical, and governance requirements; described methods include MIS data schemas, OR models, and audit protocols; no implemented pilot or evaluation reported.
medium positive Establishes a technical and academic bridge between the educ... robustness and defensibility of equivalency decisions (measured by reproducibili...
A dynamic, data-driven Qualification Framework Equivalency is required to translate DRC technical qualifications (Diplôme d'État, Graduat/Licence) into South Africa’s NQF (levels 1–10).
Argument based on gap analysis of curricula, proposed operations-research validation models, and system design rationale presented in the paper; no empirical trial or sample size reported.
medium positive Establishes a technical and academic bridge between the educ... validity/accuracy of equivalency assignments between DRC technical qualification...