Evidence (4560 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	378	106	59	455	1007
Governance & Regulation	379	176	116	58	739
Research Productivity	240	96	34	294	668
Organizational Efficiency	370	82	63	35	553
Technology Adoption Rate	296	118	66	29	513
Firm Productivity	277	34	68	10	394
AI Safety & Ethics	117	177	44	24	364
Output Quality	244	61	23	26	354
Market Structure	107	123	85	14	334
Decision Quality	168	74	37	19	301
Fiscal & Macroeconomic	75	52	32	21	187
Employment Level	70	32	74	8	186
Skill Acquisition	89	32	39	9	169
Firm Revenue	96	34	22	—	152
Innovation Output	106	12	21	11	151
Consumer Welfare	70	30	37	7	144
Regulatory Compliance	52	61	13	3	129
Inequality Measures	24	68	31	4	127
Task Allocation	75	11	29	6	121
Training Effectiveness	55	12	12	16	96
Error Rate	42	48	6	—	96
Worker Satisfaction	45	32	11	6	94
Task Completion Time	78	5	4	2	89
Wages & Compensation	46	13	19	5	83
Team Performance	44	9	15	7	76
Hiring & Recruitment	39	4	6	3	52
Automation Exposure	18	17	9	5	50
Job Displacement	5	31	12	—	48
Social Protection	21	10	6	2	39
Developer Productivity	29	3	3	1	36
Worker Turnover	10	12	—	3	25
Skill Obsolescence	3	19	2	—	24
Creative Output	15	5	3	1	24
Labor Share of Income	10	4	9	—	23

Productivity Remove filter

Continuous online adaptation of models and policies—updating from streaming user interactions—enables per-session and lifetime personalization that improves engagement and conversion outcomes.

Modeling pipeline includes streaming updates and online adaptation; evaluations include online experiments and retention/engagement measurements. (No numerical magnitudes or update frequencies provided.)

medium positive Personalized Content Selection in Marketing Using BERT and G... per-session CTR, engagement metrics, conversion rate, retention

An RL layer that formulates content selection as a contextual bandit / policy optimisation problem improves content selection and delivery using real-time reward signals (CTR, dwell time, conversions).

Paper describes RL-based policy optimisation using reward signals (CTR, session length, conversion events, LTV proxies) and reports online experiments/A/B tests where adaptive policies outperform static rules; exact algorithms and sample sizes not detailed.

medium positive Personalized Content Selection in Marketing Using BERT and G... CTR, session length (dwell time), conversion events, lifetime value proxies

RAG anchors generated content to up-to-date product/catalog/contextual knowledge and reduces hallucinations, increasing factuality of marketing messages.

Architectural description of RAG combining retrieved structured/unstructured knowledge with generative models; factuality/reduction in hallucinations evaluated in offline generation quality assessments using human raters and automatic factuality metrics.

medium positive Personalized Content Selection in Marketing Using BERT and G... factuality scores, rate of hallucinated assertions in generated content

GPT-family decoders generate tailored marketing content (ad copy, email text, chat responses) that matches user context and tone more effectively than template-based generation.

System uses GPT conditioned on user context and product info; generation quality evaluated via human raters and automatic relevance/factuality metrics in offline evaluations. (No quantitative effect sizes reported.)

medium positive Personalized Content Selection in Marketing Using BERT and G... generation relevance, tone match, human-rated content quality, automatic relevan...

An integrated BERT–GPT pipeline augmented with retrieval-augmented generation (RAG) and reinforcement learning (RL) substantially outperforms conventional rule-based or template-driven marketing automation.

Comparative evaluations and case studies reported in the paper, including online A/B or multi-armed tests comparing the full pipeline vs baseline automation and measuring CTR, engagement, conversion rate, retention, and revenue per user. (Sample sizes and statistical details are not specified in the paper.)

medium positive Personalized Content Selection in Marketing Using BERT and G... click-through rate (CTR), engagement metrics, conversion rate, retention, revenu...

Continuous human-in-the-loop oversight, monitoring, and retraining are required to maintain quality and prevent model drift.

Practitioner reports and conceptual literature synthesized in the review advocating monitoring and retraining; no longitudinal empirical study provided here.

medium positive The Effectiveness of ChatGPT in Customer Service and Communi... model performance over time, incidence of drift, quality-control metrics

Transparent disclosure to customers about AI involvement helps preserve trust.

Conceptual analyses and referenced empirical/regulatory discussions in the literature aggregated by the review; this paper presents no new experimental evidence on disclosure effects.

medium positive The Effectiveness of ChatGPT in Customer Service and Communi... consumer trust/satisfaction as a function of disclosure of AI use

Hybrid designs that automate low-risk, high-volume tasks while routing complex, judgment-sensitive cases to humans produce the best operational outcomes.

Inferred best-practice from aggregated empirical studies, industry examples, and conceptual reasoning; no controlled comparative trials presented in this review.

medium positive The Effectiveness of ChatGPT in Customer Service and Communi... operational outcomes including cost, resolution quality, customer trust, and esc...

Agent augmentation via suggested responses, summarization, and information retrieval improves agent productivity.

Aggregated evidence from prior empirical research and practitioner reports cited in the review; no new measurements or sample sizes presented here.

medium positive The Effectiveness of ChatGPT in Customer Service and Communi... agent productivity metrics (e.g., response time, task throughput, resolution rat...

Generative AI enables personalization at scale through automated tailoring of messaging and recommendations.

Qualitative synthesis of empirical studies and industry reports showing automated personalization use-cases; no systematic effect-size estimates or new quantitative data in this review.

medium positive The Effectiveness of ChatGPT in Customer Service and Communi... degree of message personalization/recommendation relevance and scale (number of ...

Generative AI provides 24/7 availability and cost-effective scaling of routine interactions.

Industry case examples and prior empirical studies aggregated in the review; no original data or quantified sample sizes provided in this paper.

medium positive The Effectiveness of ChatGPT in Customer Service and Communi... availability (hours of operation), cost per interaction, throughput for routine ...

Generative AI can materially transform customer service and strategic communication by enabling continuous automation, scalable hyper-personalization, and effective agent augmentation.

Nano review: qualitative aggregation and synthesis of existing empirical studies, industry case examples, and conceptual analyses. No novel primary data or sample size; conclusion drawn from heterogeneous secondary sources and practitioner reports (not a systematic meta-analysis).

medium positive The Effectiveness of ChatGPT in Customer Service and Communi... degree of automation, personalization scale, and agent productivity in customer ...

There is a need for standards around evaluation, bias mitigation, provenance, and accountability in AI-assisted ideation and design.

Policy recommendation motivated by documented biases, errors, and provenance issues in the reviewed studies; grounded in the synthesis's critique of existing practice.

medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... existence and adoption of evaluation/mitigation/provenance/accountability standa...

There will likely be complementarity-driven increases in demand for evaluative, integrative, and domain-expert roles (curators, synthesizers, implementation experts).

Inference from task-level studies and economic reasoning about complementarities between AI generative capability and human evaluative skills; empirical labor-market evidence is limited in the reviewed literature.

medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... employment demand for evaluative/integrative/domain-expert roles

Lower search and idea-generation costs enabled by LLMs may speed early-stage R&D and increase the gross flow of candidate innovations.

Theoretical economic interpretation supported by empirical findings of increased idea volumes in experimental/field studies summarized in the review; no long-run causal firm-level evidence presented.

medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... volume/rate of candidate ideas generated and pace of early-stage R&D activity

Generative AI accelerates early-stage hypothesis and prototype development by providing scaffolded prompts and procedural suggestions.

Applied case evidence and experimental studies summarized in the review showing reduced time or increased productivity in early-stage experimental/design tasks when using LLM assistance; no pooled effect size presented.

medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... time-to-hypothesis or prototype, number of prototype iterations in early-stage d...

Empirical studies document that AI-assisted tools can help break cognitive fixation and generate cross-domain analogies.

Cited experimental tasks and lab studies in the literature showing higher incidence of analogical or cross-domain suggestions from LLMs and improvements on fixation-related task metrics; heterogeneity across tasks and measures.

medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... frequency/quality of cross-domain analogies and fixation-related performance met...

Generative AI provides scaffolded, structured support that aids systematic hypothesis formation, prototyping steps, and decomposition of complex problems.

Review of design/ideation studies and applied case evidence where LLMs produced stepwise plans, decomposition prompts, or hypothesis scaffolds; evidence drawn from multiple short-term experimental and applied studies, sample sizes and exact designs vary by study.

medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... speed and/or quality of early-stage hypothesis generation and prototype developm...

Generative models rapidly produce many candidate ideas, analogies, and associative prompts that help overcome cognitive fixation.

Synthesis of experimental ideation and design studies reporting increases in number of ideas and examples of reduced fixation when participants used LLM outputs; heterogeneous sample sizes across cited studies (not reported in review).

medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... idea quantity and measures of fixation (e.g., fixation errors, number of distinc...

Generative AI can raise per-worker productivity for tasks involving brainstorming, drafting, and prototyping, but realized gains depend on downstream filtering and implementation costs.

User studies showing higher output on specific tasks (brainstorming/drafting), combined with qualitative reports of filtering/implementation effort; many studies measure immediate task output but not net realized productivity after implementation.

medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... task output (ideas/drafts) per worker; downstream filtering effort; implemented ...

Generative AI can increase creative output in both lab and field tasks as judged by external raters.

Controlled experiments and field studies reporting higher judged creativity/novelty scores for AI-assisted outputs versus controls; judged creativity/novelty is typically assessed by human raters using rubric-based scoring.

medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... rated creativity/novelty scores; externally judged idea quality

AI assistance helps people overcome fixation and produces cross-domain analogies that they might not generate alone.

Experimental studies and qualitative analyses documenting reductions in fixation effects and increases in cross-domain analogical suggestions when participants use generative models.

medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... measures of fixation (e.g., repetition of prior solutions); count/quality of cro...

Generative AI supports systematic problem breakdown and early-stage prototyping, accelerating hypothesis generation and prototype development.

Field case studies of AI-supported prototyping and lab/user studies reporting reduced time-to-prototype and generated hypotheses; measures include time-to-prototype and user-reported usefulness.

medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... time-to-prototype; number/quality of generated hypotheses/prototypes; user-perce...

Generative AI boosts ideational fluency—the quantity and diversity of ideas produced in brainstorming tasks.

Controlled experiments and user studies measuring number and diversity of ideas with and without AI assistance; typical study designs compare participant idea counts/uniqueness across conditions (note: many studies use small or convenience samples).

medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... number of ideas generated; diversity indices of ideas

When used as a 'cognitive co-pilot' that expands the solution space and challenges assumptions while humans curate and evaluate, generative AI generates economic value.

Inferred from experimental and field findings showing increased idea quantity/diversity and faster prototyping combined with qualitative studies showing human curation is needed; economic interpretation drawn from the review rather than direct macroeconomic measurement.

medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... idea space breadth; time-to-prototype; downstream implemented/valued ideas (larg...

Generative AI serves a dual cognitive role: (1) a high-volume catalyst for divergent idea generation and cross-domain analogy-making, and (2) a structured assistant for deconstructing complex problems and scaffolding hypotheses and prototypes.

Synthesis of controlled experiments, lab studies, field case studies, and qualitative analyses summarized in the review; evidence includes measures of idea fluency/diversity, examples of analogy production, and observations of AI-assisted problem decomposition in prototyping tasks. (Note: underlying studies are heterogeneous and often short-term or convenience samples.)

medium positive ChatGPT as an Innovative Tool for Idea Generation and Proble... ideational fluency/diversity; incidence of cross-domain analogies; quality/speed...

Agent augmentation (drafting replies, summarizing histories, suggesting actions) raises frontline productivity and can improve response consistency.

Pilot deployments and internal A/B tests cited that measure time saved by agents and improvements in draft quality/consistency; mostly short-run and firm-specific reports.

medium positive The Effectiveness of ChatGPT in Customer Service and Communi... agent productivity (time per case saved), consistency of responses

Hyper-personalization at scale can increase relevance of responses and customer engagement when fed high-quality signals.

Case studies and pilot deployments that applied personalization signals (customer history, behavioral data) and reported improved relevance/engagement metrics; evidence conditional on availability and quality of signals and largely non-randomized.

medium positive The Effectiveness of ChatGPT in Customer Service and Communi... response relevance; customer engagement (clicks, session length, follow-up conta...

24/7 automation reduces routine handling time and operational costs for simple, repetitive queries.

Operational deployments and pilot studies reporting reduced handling times and cost-per-interaction for routine queries; some vendor-supplied before/after or A/B comparisons, but heterogeneous measurements and limited randomized evidence.

medium positive The Effectiveness of ChatGPT in Customer Service and Communi... routine handling time; operational cost per interaction

Perceptions—specifically trust and perceived accuracy—are central frictions in AI adoption within finance; interventions that raise perceived and demonstrable accuracy (e.g., explainability, transparent validation) will increase uptake and productivity gains.

Study finds correlations between perceptions and adoption/productivity proxies from questionnaire and performance data; authors combine these empirical associations with qualitative insights to recommend explainability/validation as interventions. Evidence is correlational and inferential (causal impact of interventions not estimated in summary).

medium positive Human-AI Synergy in Financial Decision-Making: Exploring Tru... AI uptake/adoption; productivity gains

Higher perceived accuracy of AI outputs is associated with increased perceived utility of AI for forecasting and risk-management tasks.

Survey items measuring perceived accuracy and perceived utility for specific tasks (forecasting, risk management) and quantitative association analysis; supported by interview excerpts illustrating task-specific utility; exact effect sizes and sample counts not provided in summary.

medium positive Human-AI Synergy in Financial Decision-Making: Exploring Tru... perceived utility for forecasting and risk-management tasks

Greater trust in AI correlates with greater willingness to adopt AI tools and to incorporate AI recommendations into decisions.

Correlational findings from structured questionnaires linking measures of trust with adoption intentions and self-reported incorporation of AI recommendations; supported by qualitative interview evidence; sample across multinational financial institutions (size not specified).

medium positive Human-AI Synergy in Financial Decision-Making: Exploring Tru... willingness to adopt AI tools; incorporation of AI recommendations into decision...

When trust and accuracy are high, human–AI collaboration improves organizational agility, enabling faster, data-driven strategic pivots and better risk management.

Quantitative analysis estimating relationships between perceived trust/accuracy and organizational agility indicators (speed of strategic pivots, risk-management metrics) augmented by interview accounts describing faster responses; sample: finance professionals across multinational financial institutions (sample size and exact agility metrics not specified).

medium positive Human-AI Synergy in Financial Decision-Making: Exploring Tru... organizational agility (speed of strategic pivots, risk management performance)

Perceived accuracy of AI-generated insights increases decision confidence and perceived utility for forecasting and risk management.

Quantitative questionnaire measures of perceived accuracy correlated with self-reported decision confidence and perceived utility for forecasting/risk management, with qualitative interviews used to explain mechanisms; sample: finance professionals across multinational financial institutions (sample size not specified).

medium positive Human-AI Synergy in Financial Decision-Making: Exploring Tru... decision confidence; perceived utility for forecasting and risk management

Perceived trust in AI tools is a key driver of finance professionals' willingness to use AI and their confidence in AI-assisted decisions.

Mixed-methods: quantitative analysis of structured questionnaires measuring perceived trust together with measures of willingness to use AI and decision confidence, supplemented by semi-structured interview evidence; sample described as finance professionals across multinational financial institutions (sample size not specified in summary).

medium positive Human-AI Synergy in Financial Decision-Making: Exploring Tru... willingness to use AI tools; confidence in AI-assisted decision-making

The Adaptive Agent Routing and Coordination (AARC) module performs intent recognition with confidence scoring, triggers proactive clarification dialogues on low confidence, and provides a planning feedback loop to refine plans during execution.

System design description: AARC includes intent classifier confidence thresholds, clarification dialogue behavior, and a feedback loop. Its role is supported by routing/coordination performance improvements and ablation experiments, but the summary lacks quantitative measures of clarification frequency or confidence calibration.

medium positive Context-Rich Adaptive Embodied Agents: Enhancing LLM-Powered... Agent Routing Success Rate; frequency and effectiveness of clarifications; plan ...

The Multi-Modal Contextual Memory (MMCM) stores multi-modal (visual, linguistic, temporal) contextual memory units in a relational graph and uses an advanced retrieval mechanism with temporal decay weighting to support multi-hop reasoning.

System design and implementation description: MMCM encodes modality, timestamp, and relational links; retrieval uses similarity plus temporal decay. Its effectiveness for multi-hop QA is supported by the reported improvement in Knowledge Base Response Validity and ablation results, though quantitative retrieval performance metrics are not provided in the summary.

medium positive Context-Rich Adaptive Embodied Agents: Enhancing LLM-Powered... Multi-hop question-answering validity (Knowledge Base Response Total Validity); ...

The Semantic-Enhanced Task Planning (SETP) module enriches LLM-generated plans with object-relationship graphs, hierarchical task decomposition, and implicit physical/affordance constraints to improve plan plausibility.

System design description: SETP augments LLM plans with semantic object graphs and hierarchy enforcement. Its contribution is supported indirectly by ablation results showing performance drop when SETP is removed; direct quantitative attribution to specific SETP mechanisms not detailed in the summary.

medium positive Context-Rich Adaptive Embodied Agents: Enhancing LLM-Powered... Plan plausibility/validity and Task Planning Accuracy

An ablation study shows that removing any of the three core modules (SETP, MMCM, AARC) degrades CRAEA's performance; each module contributes meaningfully to overall gains.

Ablation experiments reported in the paper where SETP, MMCM, and AARC were each removed in turn and performance degradation was observed across metrics. The summary describes the qualitative outcome but omits numerical ablation results and sample sizes.

medium positive Context-Rich Adaptive Embodied Agents: Enhancing LLM-Powered... Change in performance metrics (Task Planning Accuracy, KB Response Validity, Rou...

Human evaluators rate CRAEA higher on perceived coherence, naturalness, and user satisfaction compared to baselines.

Subjective human evaluation studies reported in the paper—comparative ratings on coherence, naturalness, and satisfaction. The summary does not specify number of human raters, rating scales, or statistical significance.

medium positive Context-Rich Adaptive Embodied Agents: Enhancing LLM-Powered... Human subjective ratings: coherence, naturalness, user satisfaction

CRAEA improves Agent Routing and Coordination success relative to baseline agents.

Objective metric 'Agent Routing Success Rate' measured in simulation; CRAEA compared to baseline LLM-driven agents (e.g., memoryless or statically routed controllers) with reported higher routing success. Exact task counts and effect sizes not included in the summary.

medium positive Context-Rich Adaptive Embodied Agents: Enhancing LLM-Powered... Agent Routing Success Rate

CRAEA yields higher Knowledge Base Response Total Validity (improved multi-hop question answering from memory) than baselines.

Simulated multi-hop QA evaluations using the system's memory; comparisons to baseline agents reported improved 'Knowledge Base Response Total Validity'. Experimental details (number of QA items, statistical tests) not provided in the summary.

medium positive Context-Rich Adaptive Embodied Agents: Enhancing LLM-Powered... Knowledge Base Response Total Validity (multi-hop QA accuracy/validity)

CRAEA outperforms baseline LLM-driven embodied agents on Task Planning Accuracy in simulated household tidying tasks.

Objective metric 'Task Planning Accuracy' measured in simulation and compared against baseline LLM-driven agents lacking one or more CRAEA components. The summary reports consistent improvements but does not provide sample size or effect magnitude.

medium positive Context-Rich Adaptive Embodied Agents: Enhancing LLM-Powered... Task Planning Accuracy

CRAEA substantially improves home-robot performance on long-horizon, high-level natural language instructions by combining semantic task planning, multi-modal contextual memory, and adaptive routing/coordination.

Experimental evaluation in a simulated household tidying environment comparing CRAEA to baseline LLM-driven embodied agents; reported consistent improvements across multiple objective metrics (Task Planning Accuracy, Knowledge Base Response Validity, Agent Routing Success Rate). Specific task counts, effect sizes, and statistical details not provided in the summary.

medium positive Context-Rich Adaptive Embodied Agents: Enhancing LLM-Powered... Overall home-robot performance on long-horizon, high-level NL instructions (aggr...

With appropriate policies and ecosystem building, AI offers strategic opportunities for 'leapfrogging' in service delivery (for example, healthcare diagnostics and precision agriculture) that can raise productivity and welfare.

Synthesis of case studies and prior empirical work showing promising AI applications; the assertion remains inferential and the paper calls for pilots and empirical validation.

medium positive Towards Responsible Artificial Intelligence Adoption: Emergi... service delivery performance (diagnostic rates, agricultural yields), productivi...

Investing in human capital—technical skills, digital literacy, and institutional capacity—is critical for African actors to capture value from AI and to design culturally aligned systems.

Policy and academic literature synthesis linking human capital investment to technology adoption and innovation; no primary training program evaluation in the paper.

medium positive Towards Responsible Artificial Intelligence Adoption: Emergi... number of trained AI professionals, digital literacy rates, local innovation out...

Context‑sensitive interventions—stronger governance, capacity building, multi‑stakeholder collaboration, and locally tailored strategies—are necessary to steer AI toward inclusive outcomes in Africa.

Policy and literature synthesis recommending interventions; recommendations are normative and inferential without empirical pilots in this paper.

medium positive Towards Responsible Artificial Intelligence Adoption: Emergi... local capacity metrics (skills, institutions), stakeholder participation rates, ...

AI adoption in Africa is already transforming multiple sectors (healthcare, finance, agriculture, education, industry, governance) and has the potential to improve productivity, service delivery, and decision-making.

Desk-based literature synthesis of prior empirical studies, policy reports and case studies; no primary data or field experiments reported in this paper.

medium positive Towards Responsible Artificial Intelligence Adoption: Emergi... sectoral productivity, service delivery quality, decision-making accuracy (e.g.,...

Policy measures are needed to support reskilling, algorithmic accountability, data governance standards, and protections against discriminatory automated decisions to ensure equitable benefits from data-driven HRM adoption.

Policy implications section of the review synthesizing concerns and recommendations from the included literature.

medium positive Data-Driven Strategies in Human Resource Management: The Rol... policy interventions (reskilling programs, accountability frameworks), equity of...

Richer firm-level HR data resulting from data-driven HRM enables economists to better identify causal effects of workforce policies and technology adoption.

Methodological implication stated in the review: improved measurement and data availability noted across included studies as aiding empirical identification.

medium positive Data-Driven Strategies in Human Resource Management: The Rol... quality of empirical identification, availability of firm-level HR data

« Prev 1 2 3 … 76 77 78 … 91 92 Next »