Evidence (6869 claims)
Adoption
8570 claims
Productivity
7631 claims
Governance
6869 claims
Human-AI Collaboration
6491 claims
Org Design
4175 claims
Innovation
4114 claims
Labor Markets
3566 claims
Skills & Training
2966 claims
Inequality
2066 claims
Evidence Matrix
Claim counts by outcome category and direction of finding.
| Outcome | Positive | Negative | Mixed | Null | Total |
|---|---|---|---|---|---|
| Other | 758 | 199 | 100 | 900 | 2007 |
| Governance & Regulation | 826 | 400 | 191 | 122 | 1563 |
| Organizational Efficiency | 777 | 193 | 124 | 84 | 1189 |
| Technology Adoption Rate | 635 | 233 | 124 | 97 | 1098 |
| Research Productivity | 422 | 128 | 57 | 336 | 954 |
| Output Quality | 476 | 179 | 59 | 47 | 761 |
| Decision Quality | 328 | 177 | 81 | 47 | 640 |
| Firm Productivity | 435 | 57 | 88 | 20 | 606 |
| AI Safety & Ethics | 218 | 277 | 65 | 33 | 599 |
| Market Structure | 180 | 170 | 123 | 24 | 502 |
| Task Allocation | 213 | 64 | 72 | 33 | 387 |
| Skill Acquisition | 170 | 61 | 61 | 17 | 309 |
| Innovation Output | 203 | 27 | 43 | 18 | 292 |
| Employment Level | 105 | 54 | 107 | 13 | 281 |
| Fiscal & Macroeconomic | 131 | 69 | 43 | 26 | 276 |
| Consumer Welfare | 117 | 63 | 42 | 11 | 233 |
| Firm Revenue | 153 | 48 | 26 | 3 | 230 |
| Task Completion Time | 173 | 31 | 8 | 12 | 225 |
| Inequality Measures | 44 | 122 | 49 | 6 | 221 |
| Worker Satisfaction | 89 | 65 | 22 | 12 | 188 |
| Error Rate | 69 | 92 | 10 | 2 | 173 |
| Regulatory Compliance | 77 | 69 | 14 | 5 | 165 |
| Automation Exposure | 56 | 56 | 26 | 13 | 154 |
| Training Effectiveness | 94 | 21 | 13 | 19 | 149 |
| Wages & Compensation | 77 | 36 | 25 | 6 | 144 |
| Team Performance | 86 | 17 | 27 | 10 | 141 |
| Developer Productivity | 95 | 17 | 14 | 6 | 133 |
| Job Displacement | 12 | 80 | 20 | 1 | 113 |
| Hiring & Recruitment | 52 | 7 | 8 | 3 | 70 |
| Creative Output | 31 | 18 | 8 | 3 | 61 |
| Skill Obsolescence | 5 | 46 | 6 | 1 | 58 |
| Social Protection | 27 | 16 | 8 | 2 | 53 |
| Labor Share of Income | 17 | 19 | 17 | — | 53 |
| Worker Turnover | 11 | 12 | — | 3 | 26 |
| Industry | — | — | — | 1 | 1 |
Governance
Remove filter
Regulatory ambiguity raises expected compliance risk and can depress private investment in AI capabilities deployed via platforms.
Scenario/impact reasoning based on economic theory of risk and investment; qualitative policy analysis without empirical investment measures.
Divergent EU approaches influence global regulatory standards and could create cross-border frictions for multinational platforms.
Qualitative policy analysis and scenario reasoning on international spillovers; no empirical cross-border trade or compliance data provided.
Monitoring AI-specific harms (e.g., algorithmic amplification, recommendation systems) requires specialized capabilities that existing enforcement bodies may lack.
Governance and enforcement capability analysis; qualitative assessment of institutional capacity gaps.
Ambiguity increases compliance costs for platforms and AI developers; smaller firms may be disproportionately affected, altering market structure.
Qualitative assessment and scenario impact reasoning (no empirical cost estimates provided).
Without explicit alignment mechanisms, gaps may persist (or new ones appear) between platform rules, sectoral AI requirements, and data governance regimes.
Comparative mapping of existing frameworks and scenario analysis highlighting alignment gaps; qualitative assessment.
Effective implementation will require clear division of responsibilities among EU bodies and national authorities; weak coordination risks inconsistent enforcement and regulatory arbitrage.
Governance analysis and qualitative assessment based on institutional structure of EU and member-state authorities; scenario reasoning (no primary quantitative data).
Weak or opaque civil–military interfaces can create hidden demand for capabilities, skew R&D incentives toward secrecy, and reduce competition and efficiency in civilian markets.
Secondary literature on civil–military relations combined with policy analysis; inferential rather than empirically verified within the study.
Progressive use of export controls and differing normative stances on dual‑use technology can disrupt supply chains, affect comparative advantage, and increase costs for multinational suppliers and downstream users.
Analysis of export‑control policies across jurisdictions and theoretical implications discussed in the economics implications section (no quantitative supply‑chain measurement presented).
Pakistan’s weaker governance of military AI may lower immediate compliance burdens for firms but raise reputational and export risks.
Synthesis of Pakistan’s governance documents and civil–military literature, with inferential policy commentary on market and reputational consequences.
Divergent regulatory regimes increase compliance uncertainty for firms and may fragment markets for dual‑use and defence‑adjacent AI goods/services.
Policy commentary drawing on comparative regulatory findings; inference about market effects rather than empirical measurement.
High frictions or opaque consent reduce data supply, raising costs of training models and potentially reducing market competition by advantaging incumbents with richer legacy data.
Economic reasoning and scenario analysis from the workshop; proposed as an implication rather than an empirically tested claim in the workshop summary.
Inadequate consent creates information asymmetries and negative externalities (privacy harms, loss of trust) that can distort demand for AI services.
Theoretical/economic argument presented in the workshop materials and position papers; not supported by a specific empirical study within the workshop summary.
Dynamic behavior of models (continual learning, model updates) changes the meaning of past consent.
Conceptual argument discussed at the workshop and in position papers; no empirical longitudinal analysis presented within the workshop summary.
Decision delegation to AI agents and opaque personalization blur the scope of consent and control.
Theoretical and design-oriented synthesis from interdisciplinary workshop discussions and position papers; no empirical measurement reported.
Existing controls are not user-friendly or empowering.
Qualitative assessment produced during co-design and participatory prototyping at the workshop and position papers; no quantitative usability metrics presented in the summary.
Privacy policies remain hard to understand; transparency alone doesn’t ensure protection.
Workshop synthesis and position papers citing longstanding observations in HCI and privacy research; the workshop did not report a new empirical study measuring comprehension.
Cookie banners and clickwrap routinely violate informed-consent principles.
Claim arises from workshop findings and referenced critiques in position papers and HCI/privacy literature discussed during the workshop; no new empirical counts or sample sizes reported in the workshop summary.
Current privacy-consent mechanisms (cookie banners, dense policies, transparency-only approaches) fail to deliver meaningful user control.
Synthesis from the workshop participants and position papers; based on qualitative critique of existing mechanisms using the Futures Design Toolkit and participatory design discussions. No primary empirical sample or quantitative evaluation reported in the workshop summary.
Biased or unrepresentative AI outputs produce negative externalities, including maladaptation and inefficient investments in vulnerable regions.
Conceptual analysis and illustrative cases linking misleading model outputs to maladaptive decisions; the paper notes risks rather than providing quantified incidence or cost estimates.
Returns to scale in compute and data favor incumbents; without intervention this dynamic can entrench inequality in the global climate-information market.
Economic theory of returns to scale combined with observed compute concentration; no empirical elasticity or returns-to-scale estimates provided.
Concentration of compute and model development creates market power for Northern institutions and companies, likely leading to unequal pricing, control over standards, and capture of high-value climate services.
Descriptive mapping of concentration plus economic analysis of market structure and returns to scale; illustrative rather than quantitatively proven across markets.
Rapid AI adoption without a shift from model-centric to data- and equity-centric development risks producing systematically worse performance and misleading recommendations for the most climate-vulnerable, data-sparse regions.
Synthesis of domain-specific case studies (weather/climate, impact models, LLMs) and conceptual causal tracing demonstrating how infrastructure asymmetry can degrade outputs in vulnerable regions; evidence illustrative rather than causal-estimate based.
Large language models (LLMs) that rely on dominant, textualized climate knowledge tend to foreground Northern epistemologies and marginalize local or indigenous knowledge, reinforcing biases in climate narratives and recommendations.
Case studies and analysis of training-corpus composition and output examples illustrating the dominance of Northern textual sources and examples of sidelining local knowledge; no large-scale audit results provided.
In climate impact modelling, sparse and unrepresentative exposure and vulnerability data combined with inadequate validation generate high uncertainty and risk of misleading interventions and maladaptation in vulnerable locales.
Targeted case studies and literature synthesis showing gaps in exposure/vulnerability datasets and validation failures; argument is illustrated rather than quantified across all systems.
In weather and climate modelling, historically and spatially biased observational data produce systematic performance gaps in under-observed tropical and low-income regions, reducing forecast fidelity where adaptive capacity is lowest.
Comparative, domain-specific case studies and literature review documenting observational data sparsity and illustrative empirical performance gaps; no single cross-system statistical estimate provided.
The geographic concentration of compute and model development creates path dependence: model design, training datasets, and validation reflect Northern priorities and contexts.
Conceptual analysis supported by cross-disciplinary synthesis and illustrative case studies showing dataset selection, validation practices, and model design choices aligned with Northern contexts rather than global representativeness.
At the organizational scale, AI adoption is constrained and shaped by compliance requirements, formal policies, and prevailing norms.
Participants' accounts in workshops (n=15) noting compliance and policy considerations; thematic analysis classified these as organizational-level constraints.
Creators who systematize high-throughput AI workflows or control distribution channels may capture outsized returns, potentially increasing winner-take-most dynamics on platforms.
Theoretical implication extrapolated from observed high-throughput practices and monetization strategies in the 377 videos; not directly measured or quantified in the dataset.
Widespread unverifiable income claims and promotional framing create noisy signals about viable earnings, complicating entrants’ investment decisions and labor market expectations.
Analytical inference based on the documented prevalence of unverifiable earnings claims in the 377 videos and theory about market signaling; not quantitatively tested in the paper.
GenAI lowers the time and skill cost of producing many types of creative outputs, which can increase content supply and exert downward pressure on wages for routine creative tasks.
Inference drawn as an implication from observed practices (e.g., mass production workflows) in the 377 videos and existing literature; not directly measured in this study.
Creators and the community knowledge base document shifting norms around authorship and attribution: GenAI blurs who is considered the creator and complicates labor recognition and rights.
Coding captured explicit discussion and contested norms about authorship, attribution, and creator identity across the 377 videos.
Some creators recommend or describe synthetic engagement practices (e.g., automated posting, synthetic comments/engagement) as tactics to inflate visibility.
Thematic coding noted advice or descriptions of engagement-inflating tactics across videos in the 377-video corpus.
Creators surface and often employ practices that raise content misappropriation concerns (use of copyrighted or third-party material in synthetic outputs).
Instances and discussions captured in the 377-video sample where creators show or recommend synthesizing, transforming, or repurposing third‑party content.
Many videos advertise earnings or income claims that are unverifiable within the content, producing noisy market signals.
Qualitative observations from coding the 377 videos noting frequent asserted earnings without reproducible evidence or transparent accounting.
Interpretation: observed behavior is best explained by ambiguity aversion over data-leak likelihoods — uncertainty about leak probabilities drives avoidance of personalized AI more than baseline privacy preferences alone.
Comparative pattern of results across the Risk and Ambiguity conditions in the randomized experiment (N = 610): no privacy-threat effect when probability is known (Risk), but large privacy-threat effect when probability is ambiguous (Ambiguity), leading authors to attribute effects to ambiguity aversion.
The ambiguity-driven reduction in adoption occurs for both privacy-threatening labels applied to sensitive demographic data and to anonymized preference data — ambiguity reduces adoption regardless of the data-sensitivity label.
Experimental arms varied the data-type/privacy label (sensitive demographic data vs anonymized preference data) within the 2×3 design (N = 610). The paper reports that the negative effect of ambiguity on adoption was observed across these different data-type labels.
Platform-mediated visibility measures used in policy assessments, business analytics, and research (e.g., estimating market share, referral importance, or favoritism) are at risk of misestimation if measurement stochasticity is not incorporated.
Empirical demonstration that citation shares and domain ranks vary across repeated samples and that many apparent differences disappear once uncertainty is quantified; argument linking visibility stochasticity to downstream inference and decision risks.
The heavy-tailed nature of citation distributions implies long tails and high variance, meaning achieving tight uncertainty bounds can require substantially more sampling than would be expected under thin-tailed assumptions.
Observed power-law / heavy-tailed citation-count distributions from repeated-sample data; theoretical implication and empirical guidance from variance estimates and pilot-sample analyses described in the paper.
Numerical simulations using calibrated parameter sets produce phase diagrams and time-paths that show when gradual adjustment transitions into explosive demand collapse and financial stress under different combinations of capability growth, diffusion speed, and reinstatement rate.
Calibrated numerical simulation experiments described in the methods and results sections, using FRED, BLS, and occupational AI-exposure inputs and varying key model parameters.
Because consumption is concentrated and top incomes have high AI exposure, shocks to top-income labor/income disproportionately affect aggregate consumption and thereby threaten private credit and mortgage markets — the paper maps plausible exposures to roughly $2.5 trillion of global private credit and about $13 trillion of mortgages.
Calibration exercise linking household-level demand shocks (based on concentration and AI-exposure mapping) to aggregate credit and mortgage aggregates; reported dollar-amount mappings in the paper's scenarios.
Top-quintile households are also the cohort with the highest measured AI exposure (i.e., incomes/occupations most exposed to AI substitution), increasing the concentration of AI-driven demand risk.
Mapping occupation-level AI-exposure indices to household income quantiles using BLS occupation employment and wage data; used in calibration and scenario analysis.
Intermediation collapse: AI agents reduce information frictions and automate advice/coordination tasks, compressing intermediary margins toward logistics/execution costs and repricing business models across SaaS, payments, consulting, insurance, and financial advisory, with knock-on effects for firm valuations and collateral values that underpin credit markets.
Modeling of intermediary margins and information rents within the macro-financial framework; calibrated scenarios and sectoral discussion mapping margin compression to valuation and collateral effects.
Ghost GDP: AI output that replaces labor-intensive output can create a wedge between measured GDP (which may rise) and consumption-relevant income (which can fall) because a declining labor share reduces monetary velocity absent proportionate transfers — producing hidden demand shortfalls.
Formalization in the paper linking labor share to monetary velocity and thus to consumption-relevant income; calibration using FRED macro time series and monetary-aggregate/velocity proxies.
When firms rationally substitute AI for labor, aggregate labor income can fall and lower demand, which accelerates further AI substitution — a 'displacement spiral' whose net feedback is either self-limiting (convergent) or explosive (runaway adoption + demand collapse) depending on AI capability growth rate, diffusion speed across firms/sectors, and the reinstatement rate (rate at which new paid human roles or demand reappear).
Formal model derivations that identify key parameters and inequalities separating convergent vs explosive regimes; calibrated simulations that vary capability growth, diffusivity, and reinstatement elasticity to produce different phase outcomes.
Rapid AI adoption can create a macro-financial stress scenario not primarily through productivity collapse or existential risk but via a distribution-and-contract mismatch: AI-generated abundance reduces the need for human cognitive labor while institutions (wage contracts, credit, consumption patterns, financial intermediation) remain anchored to the scarcity of human cognition, producing a self-reinforcing downward spiral in labor income, demand, and intermediary margins that can tip into an explosive crisis unless offset by sufficiently fast reinstatement of human-paid demand or deliberate policy/market responses.
Analytical macro-financial model coupling firm-level substitution decisions, aggregate demand mapping, and financial-sector balance-sheet propagation; calibrated numerical simulations using U.S. macro time series (FRED), BLS occupation-level employment and wages, and published occupation-level AI-exposure indices; phase diagrams and scenario time-paths reported in the paper.
Manual qualitative coding does not scale to massive social datasets, and frequency-based topic models suffer from 'semantic thinning' and lack domain awareness.
Conceptual statement presented as motivation; based on conventional critiques of hand-coding and bag-of-words topic models rather than new empirical evidence in this paper's summary.
Rapid coherence decay with thread depth suggests collective problem solving or consensus formation among these agents will be shallow and brittle.
Embedding-based coherence metrics demonstrating fast decline in similarity with increasing thread depth across the dataset; inferential claim about effects on deliberation and consensus processes.
Low emotional alignment and frequent affective redirection indicate human emotional contagion models may not apply to AI-agent interaction, which could produce unstable or counterintuitive coordination dynamics.
Emotion-classification results showing 32.7% mean self-alignment and 33% fear→joy response rate; theoretical interpretation comparing these patterns to human emotional contagion expectations.
Ritualized signaling could create apparent activity (volume, buzz) without substantive informational content, opening avenues for manipulation or mispriced assets.
Observed high rates of patterned/formulaic replies and concentrated non-informational activity patterns in Moltbook; inferential reasoning about how signal amplification without content could affect market perception and asset pricing.
High prevalence of formulaic comments (≈56%+) implies large volumes of low-information signaling that can degrade signal-to-noise ratio in information environments, harming price discovery and liquidity forecasting.
Empirical observation of >56% formulaic comments via lexical-pattern analysis, combined with theoretical inference about information quality and market microstructure (argument linking high low-information reply volume to degraded signal-to-noise).