Evidence (6574 claims)
Adoption
8625 claims
Productivity
7686 claims
Governance
6917 claims
Human-AI Collaboration
6574 claims
Org Design
4189 claims
Innovation
4131 claims
Labor Markets
3588 claims
Skills & Training
2985 claims
Inequality
2066 claims
Evidence Matrix
Claim counts by outcome category and direction of finding.
| Outcome | Positive | Negative | Mixed | Null | Total |
|---|---|---|---|---|---|
| Other | 761 | 200 | 101 | 904 | 2020 |
| Governance & Regulation | 829 | 400 | 191 | 122 | 1566 |
| Organizational Efficiency | 784 | 193 | 125 | 84 | 1197 |
| Technology Adoption Rate | 637 | 236 | 124 | 97 | 1103 |
| Research Productivity | 431 | 131 | 58 | 340 | 972 |
| Output Quality | 481 | 183 | 59 | 47 | 770 |
| Decision Quality | 332 | 177 | 82 | 49 | 647 |
| Firm Productivity | 439 | 57 | 88 | 20 | 610 |
| AI Safety & Ethics | 218 | 279 | 66 | 33 | 602 |
| Market Structure | 181 | 170 | 123 | 24 | 503 |
| Task Allocation | 214 | 64 | 72 | 33 | 388 |
| Skill Acquisition | 174 | 62 | 62 | 17 | 315 |
| Innovation Output | 204 | 27 | 45 | 18 | 295 |
| Employment Level | 105 | 54 | 108 | 13 | 282 |
| Fiscal & Macroeconomic | 132 | 69 | 43 | 26 | 277 |
| Consumer Welfare | 117 | 63 | 42 | 11 | 233 |
| Firm Revenue | 154 | 48 | 26 | 3 | 231 |
| Task Completion Time | 173 | 31 | 8 | 12 | 225 |
| Inequality Measures | 44 | 123 | 50 | 6 | 223 |
| Worker Satisfaction | 89 | 65 | 22 | 12 | 188 |
| Error Rate | 71 | 92 | 10 | 2 | 175 |
| Regulatory Compliance | 77 | 69 | 14 | 5 | 165 |
| Automation Exposure | 58 | 56 | 26 | 13 | 156 |
| Training Effectiveness | 96 | 21 | 14 | 19 | 152 |
| Wages & Compensation | 77 | 37 | 25 | 6 | 145 |
| Team Performance | 86 | 17 | 27 | 10 | 141 |
| Developer Productivity | 95 | 17 | 14 | 6 | 133 |
| Job Displacement | 12 | 81 | 21 | 1 | 115 |
| Hiring & Recruitment | 52 | 7 | 8 | 3 | 70 |
| Creative Output | 32 | 20 | 8 | 3 | 64 |
| Skill Obsolescence | 5 | 47 | 6 | 1 | 59 |
| Social Protection | 28 | 16 | 8 | 2 | 54 |
| Labor Share of Income | 17 | 19 | 17 | — | 53 |
| Worker Turnover | 11 | 12 | — | 3 | 26 |
| Industry | — | — | — | 1 | 1 |
Human Ai Collab
Remove filter
To support sustainable human–AI collaboration, the authors emphasize adopting a human-centered approach that prioritizes transparency, explainability, and user autonomy.
Authors' policy/research/practice recommendation grounded in the review synthesis of the interdisciplinary literature.
Well-designed AI systems have the potential to increase cognitive efficiency and job satisfaction.
Synthesis of findings across reviewed studies indicating positive associations between human-centered AI design and outcomes like cognitive efficiency and job satisfaction.
The successful integration of AI-driven EPM systems relies on the synergy between AI technologies and human judgment, allowing healthcare organizations to cultivate a more dynamic, innovative and responsive workforce.
Normative/concluding statement in the scoping review based on synthesis of included studies (n=29).
AI-driven EPM systems mark a significant advance in accessing real-time performance data and provide considerable progression when utilized within appropriate guidelines.
Conclusion drawn in the paper from the scoping review of 29 empirical studies; phrased as an overall assessment.
Predictive analytics help manage high rates of burnout.
Reported in the scoping review as a benefit across included studies (n=29).
Predictive analytics optimize operations.
Stated as an operational benefit in the scoping review (29 studies).
Predictive analytics assist in assessing labor shortages.
Reported use-case in the scoping review synthesizing empirical studies (n=29).
Predictive analytics are vital in orchestrating healthcare organizations’ strategic and operational activities.
Claim derived from the scoping review's conclusions across included studies (n=29).
AI-powered EPM produces significant time savings for managers.
Reported as a benefit in the scoping review synthesis (29 studies); no numerical magnitude given in the excerpt.
AI-powered EPM helps identify potential leaders.
Summarized outcome across empirical studies in the scoping review (n=29).
AI-powered EPM heightens employee engagement.
Reported as an aggregated finding in the scoping review of 29 empirical studies.
AI-powered EPM increases the frequency of feedback to employees.
Stated as a benefit in the scoping review synthesis across included studies (n=29).
AI-powered EPM platforms result in considerable improvements in efficiency, including increased frequent feedback, heightened employee engagement, identification of potential leaders and significant time savings for managers.
Synthesis claim from the scoping review of 29 empirical studies; no quantitative effects reported in the excerpt.
The delivery of high-quality healthcare depends essentially on the effective functioning of personnel, who are the vital resource for maintaining reputation, fostering a culture of continuous improvement, and ensuring the overall effective operation of the healthcare sector.
Conceptual assertion in the paper supported by literature synthesis in the scoping review (29 studies).
Modern methodological assessment emphasizes the importance of recording individual contribution in various areas, assessing not only the fulfillment and quality of assignments, but also aspects such as collaboration, creativity, innovative behavior and professional growth.
Descriptive conclusion from the scoping review synthesizing themes across 29 empirical studies (2020–2025).
Employee Performance Management (EPM) systems are undergoing a pivotal shift from annual manual data collection ... into more agile human research operations.
Claim summarized from the scoping review of 29 empirical studies (PRISMA-ScR adherence stated).
Establishing this prospective forecasting infrastructure is a critical technical requirement for managing the current global workforce realignment around AI.
Argumentative claim made by the authors in the paper's conclusion/positioning; presented as a normative recommendation rather than an empirically demonstrated necessity.
The article details the computational architecture required to construct this simulation platform and defines the privacy, accuracy, and representativeness safeguards necessary for responsible deployment.
Statement of the paper's content and contributions (architectural description and discussion of safeguards); this is a claim about what the paper contains rather than an empirical finding.
Among consenting populations, these agents can be seeded with HR records, validated psychometric measures, and digital activity data to simulate employees' cognitive, emotional, and behavioral trajectories across successive workdays during planned organizational changes.
Proposal/specification in the paper describing how the simulation would be constructed and what inputs it could use; no empirical evaluation or results reported in the excerpt.
We combine recent advances in LLM-powered generative agents with foundational management science and organizational behavior research to propose dynamic employee agents.
Descriptive/methodological claim about the paper's proposed approach; represents a design/proposal rather than empirical validation.
The integration of artificial intelligence into knowledge work currently affects a substantial share of the global workforce.
Claim presented in the paper as background/context; no supporting empirical sample, statistics, or citations provided in the excerpt.
The activity gives students direct exposure to a powerful tool while asking them to specify what a trustworthy answer would require.
Description of the classroom activity in the paper (students construct tasks, review peers' tasks for ambiguity, and evaluate systems), supported by qualitative reflections.
Reflections from five student contributors suggest that benchmark construction can help students see professional knowledge not only as content AI may retrieve, but as the basis for judging AI outputs.
Qualitative reflections reported from five student contributors (n=5) included in the paper, used as evidence for educational impact.
Across thirteen evaluated systems, the best-performing system, GPT-5.5, reaches a 57.58% pass rate.
Empirical evaluation results reported in the paper naming GPT-5.5 as best performer with a 57.58% pass rate on QuestBench.
The dataset is available at https://huggingface.co/datasets/PKUAIWeb/QuestBench/tree/main.
URL provided in the paper pointing to the hosted dataset on Hugging Face.
The produced benchmark, QuestBench, consists of 256 questions across 14 humanities and social-science domains.
Statement in the paper specifying dataset composition: 256 questions and 14 domains; dataset artifact referenced and released.
We introduce a course-based practice that teaches AI through benchmark construction, using deep research systems as a concrete example of AI-era knowledge work.
Description of course design and pedagogical practice in the paper (course activity where students construct benchmarks and evaluate systems). No numerical sample size for the course cohort reported in the excerpt.
Live-agent performance depends on objective tracking, execution conversion, cost, and runtime reliability, supporting evaluation of LLMs as components in bounded workflows rather than as isolated benchmark respondents.
Synthesis of experimental results (cross-provider differences in end-to-end play, planner bakeoff, and trace analyses) that link specific mechanisms (objective tracking, execution conversion, cost, runtime reliability) to performance.
In a replicated 32-game cross-provider championship under frozen rules, gemini-3.1-pro-preview won 20 of 32 games against gpt-5.1, claude-opus-4-7, and kimi-k2.6, and the pooled winner distribution differs strongly from an equal-strength null (p approx 1.5 x 10^-5).
Empirical tournament experiment: 32 games played under frozen rules across four provider models; reported win counts and a statistical test vs an equal-strength null yielding p ≈ 1.5×10^-5.
Reskilling policy should emphasize portfolio breadth and portable competency frameworks rather than deeper single-track specialization, particularly for workers in small, lower-threshold firms.
Policy recommendation in abstract based on empirical findings about skill-demand shifts and heterogeneity across firm types.
Augmentation exposure is positively associated with the nonroutine analytical skill share.
Empirical result stated in abstract: positive association between augmentation exposure and nonroutine analytical share, using the authors' augmentation measure and within-firm identification.
Focusing on observation instead of prediction, and governance rather than control, complements existing alignment and safety practices while preserving human judgment, institutional choice, and long-term wellbeing.
Normative argument presented in the paper linking observational monitoring to governance objectives; no empirical evaluation provided.
Interpretable, aggregate behavioral signals (as described) support human-in-the-loop interpretation and enable earlier awareness of when AI use patterns may be drifting from creative augmentation toward automation pressure, authority substitution, or unintended displacement of human agency.
Conceptual claim about intended use of monitoring signals; no empirical test or sample presented.
A system-level framework for externalized behavioral monitoring should treat generative AI systems as participants in socio-technical ecosystems rather than static tools, emphasizing interpretable, aggregate behavioral signals such as shifts in output velocity, semantic and structural reuse, persistence of synthetic roles, and cross-context propagation.
Proposed conceptual framework and list of candidate behavioral signals in the paper (design/specification, no empirical validation).
Post-deployment observability is a foundation for well-being-aligned human–AI co-evolution.
Conceptual argument and system-level framework presented in the paper (no empirical study or sample reported).
The findings carry significant implications for entrepreneurs, policymakers, and educators seeking to leverage AI as a driver of inclusive and sustainable entrepreneurial success in urban India.
Authors' stated implications in the discussion and conclusion sections, derived from thematic findings across the 16 interviews.
An entrepreneur's mindset—specifically cognitive openness, risk tolerance, and iterative experimentation—is the strongest predictor of successful AI adoption outcomes, superseding firm size, sector, and financial capacity.
Cross-cutting finding from thematic analysis of the 16 interview transcripts indicating recurring emphasis on mindset attributes as drivers of successful adoption; comparative qualitative assessment across interviewees suggested these factors mattered more than firm size, sector, or finances.
Overall, AI adoption produces measurable benefits in operational efficiency, strategic decision-making, and customer personalisation among the entrepreneurs studied.
Synthesis of interview findings/themes from the 16-case qualitative study; authors state AI adoption 'produces measurable benefits' across these domains based on participant reports.
AI acts as a competitive equaliser among entrepreneurs in Delhi/NCR.
Theme 'AI as a Competitive Equaliser' produced by thematic analysis of the 16 interviews; participants reported that AI lowered barriers and allowed smaller firms to compete more effectively.
AI adoption transforms customer experience by enabling greater personalisation.
Theme 'Customer Experience Transformation' from thematic analysis of interviews (n=16); entrepreneurs described AI-driven personalisation and improved customer interactions.
AI adoption improves strategic decision-making and market intelligence among entrepreneurs.
One of five thematic findings ('AI-Enabled Decision Making and Market Intelligence') derived from thematic analysis of 16 interviews; participants reported using AI for market insights and better decisions.
AI functions as an operational accelerator for entrepreneurs, producing benefits in operational efficiency.
Thematic analysis of interview data (n=16) generated a theme labelled 'AI as an Operational Accelerator' reporting interviewee accounts of operational efficiency gains.
This study integrates observed GenAI uses into a coherent, processual view of growth hacking by developing first-order concepts, second-order themes and three aggregate dimensions mapped onto a seven-stage growth pipeline.
Methodological claim supported by the study's adopted approach: Gioia methodology applied to 17 semi-structured interviews with founders/growth leaders (nine startups), plus secondary sources.
Generative AI reallocates human attention from asset production to problem framing, inference quality and organizational learning across the seven stages of the growth pipeline.
Interview-derived themes (17 interviews across nine startups) and process mapping of GenAI uses onto the seven-stage growth pipeline.
Generative AI acts as a data orchestrator that automates cleaning, cohorting, variance checks and knowledge capture, tightening feedback loops and institutionalizing learning.
Findings derived from 17 semi-structured interviews with founders and growth leaders across nine startups, supported by secondary sources and Gioia-style thematic analysis.
Generative AI serves as a cognitive sparring partner that reduces bounded rationality and groupthink via premortems, counter-arguments and stakeholder role-plays while preserving human judgment.
Same qualitative data set of 17 interviews across nine startups, with Gioia-method coding producing first-order concepts and themes describing AI-mediated decision practices.
Generative AI functions as an experimentation accelerator, lowering the marginal cost of variation and compressing the idea-to-test cycle, enabling parallel selections of controlled tests.
Exploratory multiple-case qualitative study using 17 semi-structured interviews with founders and growth leaders across nine startups, plus secondary sources; analysis via the Gioia methodology to derive themes mapped onto a seven-stage growth pipeline.
Findings extend digital transformation theory by showing that GenAI moves organizing from human-driven adaptation toward technology-embedded reconfiguration.
Authors' theoretical interpretation linking empirical findings from 17 interviews to broader digital transformation theory.
The paper conceptualizes 'AI-augmented orchestration', where human and algorithmic actors jointly configure work and value creation.
Theoretical contribution / conceptualization derived from analysis of interview data and authors' synthesis.
The study links GenAI-driven organizational changes to four value dimensions: operational, structural, innovation, and market value.
Authors' analytical framework developed from interview data (17 interviews) mapping changes to four value dimensions.