The Commonplace
Home Dashboard Papers Evidence Syntheses Digests 🎲

Evidence (13870 claims)

Adoption
8467 claims
Productivity
7558 claims
Governance
6805 claims
Human-AI Collaboration
6363 claims
Org Design
4132 claims
Innovation
4065 claims
Labor Markets
3526 claims
Skills & Training
2945 claims
Inequality
2066 claims

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome Positive Negative Mixed Null Total
Other 749 196 98 892 1984
Governance & Regulation 817 394 188 121 1544
Organizational Efficiency 771 189 124 83 1177
Technology Adoption Rate 627 233 123 96 1088
Research Productivity 411 123 56 332 933
Output Quality 467 178 59 47 751
Decision Quality 320 174 75 42 618
Firm Productivity 435 55 88 20 604
AI Safety & Ethics 214 276 65 33 593
Market Structure 178 167 122 24 496
Task Allocation 207 64 71 32 379
Skill Acquisition 165 59 60 17 301
Innovation Output 203 27 43 18 292
Employment Level 105 52 107 13 279
Fiscal & Macroeconomic 131 69 43 26 276
Consumer Welfare 116 63 42 11 232
Firm Revenue 150 48 26 3 227
Inequality Measures 44 122 49 6 221
Task Completion Time 169 29 8 12 219
Worker Satisfaction 89 63 20 12 184
Error Rate 69 92 10 2 173
Regulatory Compliance 76 68 14 5 163
Training Effectiveness 93 21 13 19 148
Wages & Compensation 77 36 25 6 144
Automation Exposure 51 54 22 12 142
Team Performance 86 17 27 9 140
Developer Productivity 94 17 14 6 132
Job Displacement 12 80 20 1 113
Hiring & Recruitment 51 7 8 3 69
Creative Output 31 17 7 3 59
Skill Obsolescence 5 46 6 1 58
Social Protection 27 16 8 2 53
Labor Share of Income 17 17 17 51
Worker Turnover 11 12 3 26
Industry 1 1
We recast hospital mechanism design as program synthesis for language models: typed, inspectable rule programs are executed and scored by Medi-Sim, a multi-agent simulator with five strategic provider channels (coding, selection, delay, effort, triage).
Implementation and description in the paper: development of Medi-Sim simulator with five provider channels and execution/scoring of typed rule programs.
high positive Healthcare Mechanisms from Policy-as-Code Search under Strat... existence and design of Medi-Sim simulator (methodological/tool contribution)
The article proposes a Strategic Action Framework to support more inclusive and context-responsive AI ecosystems.
Policy recommendation/framework presented by the authors as a conclusion; not empirically evaluated within the study.
high positive Compressed professionalization in informal economies: a soci... Strategic Action Framework (policy intervention)
Empirical observations show that youth mobilize AI tools for translation, content creation, customer engagement, and micro-entrepreneurial activities, enabling partial and situational approximation of selected formal-sector practices.
Qualitative interview data from the 125 semi-structured interviews in three DRC cities, used as illustrative grounding for observed uses of AI by youth.
high positive Compressed professionalization in informal economies: a soci... use of AI for translation, content creation, customer engagement, and micro-entr...
By bridging established knowledge with emerging governance challenges, this study advances a more comprehensive understanding of platform governance and outlines future research avenues related to technological change, dynamic capabilities, and ecosystem perception.
Authors' stated contribution based on their integrative framework and literature synthesis of 644 publications.
high positive Mission: Orchestration – Governance Mechanisms And Future Re... advancement of understanding of platform governance and identification of future...
The paper proposes a research agenda that examines how emerging technologies, including algorithmic governance, generative AI, and agentic systems, are reshaping governance practices.
Paper's concluding/prospective section proposing future research directions; conceptual proposal rather than empirical test.
high positive Mission: Orchestration – Governance Mechanisms And Future Re... proposed future research topics concerning the impact of emerging technologies o...
The identified governance mechanisms foster innovation in platform ecosystems.
Claim based on the paper's integrative synthesis of 644 publications indicating governance's role in fostering innovation.
high positive Mission: Orchestration – Governance Mechanisms And Future Re... innovation outcomes in platform ecosystems associated with governance mechanisms
The identified governance mechanisms ensure quality in platform ecosystems.
Argument and synthesis from the systematic literature review of 644 publications as presented in the paper's framework.
high positive Mission: Orchestration – Governance Mechanisms And Future Re... quality assurance in platform ecosystems via governance mechanisms
The identified governance mechanisms (incentives, control, boundary resources) enable platform owners to coordinate value creation.
Argument based on the integrative framework derived from the systematic literature review (644 publications).
high positive Mission: Orchestration – Governance Mechanisms And Future Re... coordination of value creation by platform owners using governance mechanisms
There are three core types of governance mechanisms that enable platform owners to coordinate value creation, ensure quality, and foster innovation: incentives, control, and boundary resources.
Synthesis and classification resulting from the systematic literature review of 644 publications, producing an integrative framework that identifies the three mechanism types.
high positive Mission: Orchestration – Governance Mechanisms And Future Re... identification of three governance mechanism types (incentives, control, boundar...
This study conducts a systematic literature review of 644 publications to synthesize the governance landscape and develop an integrative framework.
Methodological statement from the paper reporting the authors performed a systematic literature review analyzing 644 publications.
high positive Mission: Orchestration – Governance Mechanisms And Future Re... number of publications reviewed and use of SLR to develop framework
Platform owners orchestrate complementor participation through governance mechanisms.
Synthesis and conceptual argument based on the systematic literature review of 644 publications.
high positive Mission: Orchestration – Governance Mechanisms And Future Re... use of governance mechanisms by platform owners to orchestrate participation
Digital platform ecosystems rely on loosely coupled complementors to jointly create value with platform owners.
Synthesis of prior literature via the paper's systematic literature review (644 publications); conceptual framing in the literature on platform ecosystems.
high positive Mission: Orchestration – Governance Mechanisms And Future Re... reliance of platform ecosystems on loosely coupled complementors for joint value...
Work Flexibility is the strongest predictor of Employee Productivity (β = 0.562, p < 0.001), indicating flexible working conditions play an important role in improving employee performance and work efficiency.
Reported quantitative result from the study using PLS-SEM; β and p-value provided in the paper indicating the largest standardized effect among predictors. Sample size not reported in the excerpt.
Human-Centric AI Adoption has a positive and statistically significant effect on Employee Productivity (β = 0.263, p = 0.028).
Reported quantitative result from the study using Partial Least Squares Structural Equation Modeling (PLS-SEM); β and p-value provided in the paper. Sample size not reported in the excerpt.
The productivity-enhancing effect of fintech is stronger in regions with higher levels of economic development.
Heterogeneity/subsample analysis reported for regional economic development levels using the sample of Chinese A-share listed manufacturing firms (2015–2023); paper states fintech's effect on TFP is more pronounced in more economically developed regions (no subgroup sample sizes or quantitative estimates provided in the excerpt).
high positive Research on the Impact of Financial Technology on the Total ... corporate total factor productivity (heterogeneous effect by regional developmen...
The productivity-enhancing effect of fintech is more pronounced in high-tech industries.
Heterogeneity/subsample analysis in the paper using the sample of Chinese A-share listed manufacturing firms (2015–2023); paper reports stronger fintech–TFP effects in high-tech industry subsample (no subgroup sample sizes or coefficients provided in the excerpt).
high positive Research on the Impact of Financial Technology on the Total ... corporate total factor productivity (heterogeneous effect by industry tech-inten...
The positive effect of fintech on corporate total factor productivity operates primarily through the channels of supply chain finance and innovation effects.
Mediation/ mechanism analysis reported in the study using the same sample of Chinese A-share listed manufacturing firms (2015–2023); paper states supply chain finance and innovation as the primary channels (specific mediation estimates not provided in the excerpt).
high positive Research on the Impact of Financial Technology on the Total ... corporate total factor productivity (through supply chain finance and innovation...
Fintech development can significantly enhance corporate total factor productivity for Chinese A-share listed manufacturing firms.
Empirical analysis on a sample of Chinese A-share listed manufacturing enterprises covering 2015–2023; result described as statistically significant in the paper (specific estimation methods and sample size not provided in the excerpt).
high positive Research on the Impact of Financial Technology on the Total ... corporate total factor productivity
Artificial intelligence (AI) increasingly participates in strategic decision-making, challenging leadership theories that assume human agency at the top of organizations.
Concept-centric literature review integrating management and information systems (IS) research; theoretical synthesis of prior empirical and conceptual studies (no primary empirical sample reported).
high positive Hybrid Upper Echelons: A Theorizing Review On Ai In Executiv... participation of AI in strategic decision-making
Dijital platformlar insan deneyimini veriye dönüştürerek ekonomik değere tahvil eden yeni bir rejim (gözetim kapitalizmi) kurmuştur.
Teorik ve kavramsal analiz; çalışma Zuboff'un gözetim kapitalizmi yaklaşımına atıf yapmaktadır. No empirical sample or quantitative evidence reported.
high positive GÖZETİM KAPİTALİZMİNİN HUKUKSAL TEMELLERİ: FOUCAULTCU BİR AN... dijital platformların insan deneyimini veriye dönüştürme ve bunun ekonomik değer...
The findings suggest that twin-based market research is no longer gated by data design, but by item volume, model selection, and a small set of construction-level decisions.
Interpretive conclusion based on empirical results across the construction-method grid and performance patterns (discussion/implication in paper).
high positive Synthetic Personalities: How Well Can LLMs Mimic Individual ... primary constraints on successful twin-based market research (data design vs. it...
Best-cell Fisher-z rank-order correlation reaches r = 0.590 on the SOEP held-out evaluation set.
Reported best-performing cell Fisher-z (or Fisher-transformed correlation) from held-out evaluation on SOEP.
high positive Synthetic Personalities: How Well Can LLMs Mimic Individual ... rank-order correlation (Fisher-z) of twin responses vs. held-out answers
Best-cell accuracy reaches 78.8% on the SOEP held-out evaluation set.
Reported best-performing cell accuracy from held-out evaluation on SOEP.
high positive Synthetic Personalities: How Well Can LLMs Mimic Individual ... hold-out accuracy (best-performing cell)
Switching the embedding from a narrative persona summary to a raw dialog history of past responses raises hold-out accuracy in every model-by-reasoning cell at the 100 percent depth.
Empirical comparison between two embedding methods at 100% information depth across all model-by-reasoning cells (reported in results).
high positive Synthetic Personalities: How Well Can LLMs Mimic Individual ... hold-out accuracy as a function of embedding method (narrative persona summary v...
Twin quality rises with information depth but with diminishing returns past the 75 percent entropy quartile, which acts as a cost-efficient Pareto point relative to the best-performing 100 percent cells.
Empirical evaluation across information-depth conditions, comparing hold-out performance by normalized Shannon entropy quartiles (reported in results).
high positive Synthetic Personalities: How Well Can LLMs Mimic Individual ... twin quality (hold-out performance / accuracy) as a function of information dept...
The field can be organized around an integrated decision-system framework consisting of five connected constructs—delegation frontier, reliance wedge, decision-useful XAI, meaningful oversight, and reflexive AI loop—to support cumulative research on investment, trading, credit, asset management, risk, compliance, and financial regulation.
Proposal of a conceptual framework grounded in the paper’s integrative literature review (no empirical validation or sample size reported in the abstract).
high positive Human–AI hybrid finance: from AI tools to decision systems utility of the proposed decision-system framework for structuring future researc...
The review integrates evidence on methods, data, scenarios, explainability, trust, governance, financial large language models (FinLLMs), and agentic finance.
Descriptive claim about the scope of this paper’s literature synthesis (the review itself; content-based rather than empirical).
high positive Human–AI hybrid finance: from AI tools to decision systems breadth of topics integrated in the review
The central question is moving from model performance to decision architecture: how authority, oversight, and accountability should be allocated across financial workflows.
Argument based on synthesis of prior literature across relevant fields (conceptual review; no single empirical study or sample size reported).
high positive Human–AI hybrid finance: from AI tools to decision systems allocation of authority, oversight, and accountability in financial decision wor...
AI is moving from a predictive tool to a component of human–AI hybrid financial decision systems.
Integrative conceptual literature review synthesizing work across finance, management, human–computer interaction (HCI), and AI (no primary empirical sample reported).
high positive Human–AI hybrid finance: from AI tools to decision systems role of AI within financial decision workflows (predictive tool vs. integrated d...
The benchmark is publicly available at: https://github.com/ant-research/meta-agent-challenge.
Statement of public release and URL provided in the paper.
high positive The Meta-Agent Challenge: Are Current Agents Capable of Auto... public availability / open-source access
MAC provides a rigorous, open-source benchmark for autonomous AI research and development and offers an empirical proxy for evaluating recursive self-improvement.
Claim about the utility and intended purpose of the released benchmark; supported by the benchmark's design and experiments described in the paper.
high positive The Meta-Agent Challenge: Are Current Agents Capable of Auto... empirical evaluation of recursive self-improvement
The few meta-agents that do match human-engineered baselines are dominated by proprietary frontier models.
Experimental observations reported in the paper indicating that successful meta-agents rely on proprietary frontier models; details (counts, model names) not provided in abstract.
high positive The Meta-Agent Challenge: Are Current Agents Capable of Auto... composition of successful meta-agents (proprietary vs non-proprietary)
To ensure evaluation integrity, the framework is secured by multi-layer defenses against reward hacking.
Methodological claim in paper about security measures implemented in the benchmark.
high positive The Meta-Agent Challenge: Are Current Agents Capable of Auto... resistance to reward hacking / evaluation integrity
In MAC a code agent (the meta-agent) is given a sandboxed environment, an evaluation API, and a time limitation to iteratively program an agent artifact that maximizes performance on a held-out test set across five domains.
Method description of the benchmark setup; specification includes 'held-out test set across five domains'.
high positive The Meta-Agent Challenge: Are Current Agents Capable of Auto... performance on a held-out test set
We introduce the Meta-Agent Challenge (MAC), an evaluation framework designed to test the capacity of frontier models for autonomous agent development.
Paper contribution: description of a new evaluation framework (methodological introduction).
high positive The Meta-Agent Challenge: Are Current Agents Capable of Auto... capacity of models to develop autonomous agents
Policy recommendations: improve digital infrastructure in less-developed areas, support digital upskilling, and strengthen regulations to ensure inclusive and equitable digital development.
Policy conclusions and recommendations derived from the study's empirical results (panel analysis of 31 provinces, 2011–2021) and discussion of distributional implications; these are prescriptive rather than causal findings.
high positive The Impact of the Digital Economy on Income Distribution: Ev... policy actions to promote inclusive digital development
The income-increasing effect of the digital economy operates primarily through wage growth.
Mechanism analysis reported in the paper based on the same two-way fixed effects panel (31 provinces, 2011–2021) that decomposes channels and finds wages as the main mediator.
Digital economic development significantly increases household income in China.
Two-way fixed effects panel regression using provincial-level panel data for 31 Chinese provinces (2011–2021), with robustness checks reported in the paper.
Computable static rules raise signal boundary mass more sharply than ambiguous static rules (0.403 versus 0.281).
Same ABM/RL simulation described in the paper (see run counts and 2,880,000-row firm-period panel referenced in abstract).
high positive When Firms Learn to Game the Rules signal boundary mass (proximity in computable enforcement signal)
Computable static rules raise conduct boundary mass relative to ambiguous static rules (0.411 versus 0.367).
Agent-based reinforcement-learning (ABM/RL) simulation reported in paper; results summarized across runs including a 2,880,000-row firm-period panel and multiple scenario/sweep designs (150 seed-level scenario runs, 378 common-random-number computability-sweep runs, 288 Latin-hypercube global-design runs).
high positive When Firms Learn to Game the Rules conduct boundary mass
AI-flagged complaints are disproportionately associated with first-time filers rather than repeat filers.
Linking complaint AI-flag status to filer metadata indicating prior filing history; reported disproportionate association with first-time filers.
high positive The New Pro Se: Generative AI and the Surge in Federal Civil... association between AI-flagged complaints and filer repeat status (first-time vs...
AI-flagged complaints are more citation-dense.
Comparison of citation counts/density between AI-flagged complaints and other complaints using complaint text metadata.
high positive The New Pro Se: Generative AI and the Surge in Federal Civil... citation density (citations per complaint)
Against a threshold calibrated to the pre-GenAI baseline, the net AI-flagged share is 13.9% of post-GenAI non-form complaints.
Application of the stylometric AI-consistent drafting measure with calibration to pre-GenAI baseline; reported net share for post-GenAI non-form complaints.
high positive The New Pro Se: Generative AI and the Surge in Federal Civil... share of post-GenAI non-form complaints flagged as AI-consistent
The authors develop an interpretable measure of AI-consistent drafting using stylometric AI detection indicators.
Methodological claim: linking case metadata to complaint text and applying stylometric AI-detection indicators to build an interpretable AI-consistent drafting measure.
high positive The New Pro Se: Generative AI and the Surge in Federal Civil... presence/score of AI-consistent drafting in complaint text
The increase in pro se filings is especially pronounced in Civil Rights and Other Statutory cases.
Subgroup analyses by case type within the full civil filing dataset; authors highlight stronger increases in these categories.
high positive The New Pro Se: Generative AI and the Surge in Federal Civil... pro se filing rate by case category (Civil Rights and Other Statutory)
The study dataset comprises roughly 2.8 million civil filings covering FY2008–2025.
Authors state they use civil filing data from FY2008-2025 and reference ~2.8 million filings.
high positive The New Pro Se: Generative AI and the Surge in Federal Civil... sample coverage (number of filings / timeframe)
The federal civil pro se plaintiff rate rose from 11.33% pre-GenAI to 16.94% post-GenAI, a 5.61 percentage-point increase that persists after trend and covariate-adjusted robustness checks.
Analysis of ~2.8 million federal civil filings (FY2008-2025) comparing pre- and post-GenAI periods; authors report trend and covariate-adjusted robustness checks.
Under linear local composition, every protocol tree defines a barycentric coordinate chart on the simplex of leaf weights; Tamari-cover reparameterizations of protocol trees preserve complementarity, and for N = 4 these reparameterizations satisfy the pentagon identity.
Mathematical construction and proofs in the paper linking protocol trees, barycentric coordinates, Tamari lattice reparameterizations, and the pentagon identity (theoretical work; no empirical sample).
high positive Tree-Based Formalization of Multi-Agent Complementarity in H... structural invariances of aggregation protocols (preservation of complementarity...
For N = 2 in regression under squared loss, the optimal linear-pooling weight has a closed form and admits a residual-correction interpretation.
Closed-form derivation and interpretation provided in the paper (mathematical derivation; no empirical sample).
high positive Tree-Based Formalization of Multi-Agent Complementarity in H... optimal pooled prediction weight (performance of 2-agent aggregation)
Across our large-scale empirical analysis, Parthenon substantially improves the performance of state-of-the-art models and harnesses on legal-matter tasks.
Reported evaluations in the paper comparing baseline state-of-the-art models/harnesses to the Parthenon framework across their empirical dataset (Harvey LAB), claiming substantial performance gains.
high positive Parthenon Law: A Self-Evolving Legal-Agent Framework performance on legal-matter tasks (aggregate metric unspecified in abstract)