Evidence (3231 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	609	159	77	738	1617
Governance & Regulation	671	334	160	99	1285
Organizational Efficiency	626	147	105	70	955
Technology Adoption Rate	502	176	98	78	861
Research Productivity	349	109	48	322	838
Output Quality	391	121	45	40	597
Firm Productivity	385	46	85	17	539
Decision Quality	277	145	63	34	526
AI Safety & Ethics	189	244	59	30	526
Market Structure	152	154	109	20	440
Task Allocation	158	50	56	26	295
Innovation Output	178	23	38	17	257
Skill Acquisition	137	52	50	13	252
Fiscal & Macroeconomic	120	64	38	23	252
Employment Level	93	46	96	12	249
Firm Revenue	130	43	26	3	202
Consumer Welfare	99	51	40	11	201
Inequality Measures	36	106	40	6	188
Task Completion Time	134	18	6	5	163
Worker Satisfaction	79	54	16	11	160
Error Rate	64	79	8	1	152
Regulatory Compliance	69	66	14	3	152
Training Effectiveness	82	16	13	18	131
Wages & Compensation	70	25	22	6	123
Team Performance	74	16	21	9	121
Automation Exposure	41	48	19	9	120
Job Displacement	11	71	16	1	99
Developer Productivity	71	14	9	3	98
Hiring & Recruitment	49	7	8	3	67
Social Protection	26	14	8	2	50
Creative Output	26	14	6	2	49
Skill Obsolescence	5	37	5	1	48
Labor Share of Income	12	13	12	—	37
Worker Turnover	11	12	—	3	26
Industry	—	—	—	1	1

Labor Markets Remove filter

Breakthroughs in structure prediction arise from end‑to‑end deep models that combine evolutionary information (MSAs, coevolutionary signals), geometric constraints and equivariant architectures, and large‑scale pretraining on sequence databases.

Paper describes methodological components: end‑to‑end architectures using MSAs, SE(3)/E(3)-equivariant layers, transformer‑based pretraining on UniRef/UniProt/metagenomic catalogs; no quantitative ablation studies are provided in the text.

high positive Protein structure prediction powered by artificial intellige... improvement in predictive performance attributable to combined modeling componen...

Canada emphasizes teacher-led assessment, cautious regulation, and a focus on equity and professional development in responding to AI-related assessment issues.

Country case study based on Canadian policy documents and secondary sources highlighting teacher-led approaches and regulatory caution; illustrative description.

high positive The Future of Assessment: Rethinking Evaluation in an AI-Ass... policy emphasis on teacher-led assessment and professional development

Creators explicitly name advertising, direct sales, affiliate marketing, and revenue-sharing models as common monetization channels for GenAI-enabled content.

Explicit references to these monetization channels appeared repeatedly across the 377 videos and were extracted during thematic coding.

high positive Monetizing Generative AI: YouTubers' Collective Knowledge on... types of monetization channels mentioned in videos

Integrating AI (notably ML and NLP) meaningfully automates routine software engineering tasks across requirements management, code generation, testing, and maintenance.

Systematic literature review of prior AI-for-SE work combined with an empirical survey of software engineering professionals reporting usage and examples of tool-supported automation; sample size for the survey not specified in the summary.

high positive Artificial Intelligence as a Catalyst for Innovation in Soft... degree of task automation (e.g., frequency or share of routine tasks automated)

Coordination-Risk Cues—task-conditioned priors on disagreement/tie rates—capture coordination difficulty across tasks.

Method description: disagreement/tie rates computed per cluster from pairwise preference comparisons to form priors indicating coordination risk. Data source: Chatbot Arena pairwise comparisons; tie/disagreement rate computation described but numeric values not provided here.

high positive Task-Aware Delegation Cues for LLM Agents tie/disagreement rate per task cluster (coordination difficulty prior)

Capability Profiles—task-conditioned win-rate maps—can be computed per cluster to summarize agent strengths.

Method description: win-rate maps derived by computing agent win rates conditional on task clusters from the Chatbot Arena pairwise comparisons. Implementation reported in paper; no numeric summary of win-rate differences provided here.

high positive Task-Aware Delegation Cues for LLM Agents agent win-rate per task cluster

Semantic clustering on Chatbot Arena pairwise comparisons induces an interpretable task taxonomy (taxonomy induction).

Methodological claim: authors applied semantic clustering to tasks/queries from Chatbot Arena pairwise preference data to produce clusters described as interpretable. Data source: Chatbot Arena pairwise comparisons; specific clustering algorithm and hyperparameters not specified here.

high positive Task-Aware Delegation Cues for LLM Agents interpretable task clusters (taxonomy)

A speculative WikiRAT instantiation on Wikipedia illustrates RATs' design and potential uses.

The paper presents WikiRAT as a speculative prototype/illustration; no large-scale deployment or user study of WikiRAT is reported.

high positive Chasing RATs: Tracing Reading for and as Creative Activity existence of a prototype illustration (WikiRAT)

RATs record sequences of interaction: traversal (what is read and in what order), association (links and connections the reader forms), and reflection (annotations, notes, time spent), producing inspectable, shareable trajectories.

Design specification within the paper and description of data types RATs would collect (ordered page/navigation logs, hyperlinks followed, time-on-page, annotations, saved excerpts, tags, notes). This is a definitional claim about the proposed system rather than empirical measurement.

high positive Chasing RATs: Tracing Reading for and as Creative Activity captured interaction traces (traversal, association, reflection) as data

Dataset and code (CFD, CFM, CFR) are publicly released.

Repository link provided in the summary (https://github.com/ZhengyaoFang/CFM) and paper states public release of dataset and code.

high positive Too Vivid to Be Real? Benchmarking and Calibrating Generativ... public availability of dataset and code

The Color Fidelity Dataset (CFD) is a large-scale dataset of over 1.3 million images containing both real photographs and synthetic T2I outputs, organized with ordered levels of color realism to support objective evaluation.

Dataset construction described in paper and repository: size stated as >1.3M images; contains a mixture of real photos and synthetic images annotated/organized with ordered realism labels enabling relative judgments of color fidelity.

high positive Too Vivid to Be Real? Benchmarking and Calibrating Generativ... dataset size and composition; presence of ordered color-realism labels enabling ...

Standards and governance frameworks (for model auditability, security, and alignment) will become economic infrastructure influencing adoption costs and market trust.

Conceptual argument linking governance to adoption and trust, drawing on normative risk analysis; no empirical governance impact studies included.

high positive How AI Will Transform the Daily Life of a Techie within 5 Ye... existence and adoption of standards/governance frameworks and their effect on AI...

Increasing AI autonomy magnifies ethical, safety, and value‑alignment concerns; robust human oversight and institutional governance are required.

Normative and risk analysis based on projected increases in system autonomy and illustrative failure modes; no formal safety audits included.

high positive How AI Will Transform the Daily Life of a Techie within 5 Ye... need/extent of human oversight and governance mechanisms (existence and strength...

Models and systems must include robust governance: transparency, explainability, provenance logging, versioning, and compliance checks to maintain trust and satisfy auditors/regulators.

Normative claim supported by recommended governance and evaluation practices described in the paper; no regulatory testing or audit case studies reported.

high positive Next-Generation Financial Analytics Frameworks for AI-Enable... governance/compliance indicators (e.g., presence of explainability reports, audi...

Cloud and distributed compute (data lakes, distributed training, streaming pipelines) provide the scalability needed to handle growing data and model complexity in financial analytics.

Technical claim supported by proposed infrastructure components in the paper; no benchmarking or capacity measurements provided.

high positive Next-Generation Financial Analytics Frameworks for AI-Enable... scalability measures (e.g., throughput, latency under load, time to train models...

Such frameworks—designed to be modular, scalable, and interoperable—enable pluggable AI modules (scenario analysis, cash‑flow forecasting, dynamic pricing) and easier integration with ERP/BI systems.

Architectural claim supported by system design principles listed in the paper (modular model repositories, model-serving layers, feature stores, API integration); presented as design best-practices rather than empirical validation.

high positive Next-Generation Financial Analytics Frameworks for AI-Enable... system integration metrics (e.g., number of pluggable modules, integration time,...

A systematic RM process—risk identification → analysis/assessment → evaluation/response → control implementation → monitoring and reporting—is a core component of effective practice.

Convergence of process descriptions across ISO 31000, COSO ERM, and multiple reviewed publications identified via thematic analysis.

high positive The Role of Risk Management as an Organizational Management ... completeness/consistency of RM processes

Integration of risk management with strategy-setting and operational processes is essential to realize RM benefits.

Thematic findings from the literature review and recommendations in established frameworks (ISO 31000, COSO ERM); synthesized across peer-reviewed and practitioner literature.

high positive The Role of Risk Management as an Organizational Management ... alignment of RM with strategy and operations; realized RM benefits

An embedded risk culture and clear accountability across the organization are necessary enablers for effective risk management.

Repeatedly reported across reviewed literature and standards (e.g., ISO/COSO) in the thematic synthesis; supported by multiple secondary sources in the ten-year scope.

high positive The Role of Risk Management as an Organizational Management ... degree of RM cultural embedding; accountability; RM effectiveness

Leadership and governance commitment (board and senior management buy-in) is a core component required for effective risk management implementation.

Consistent identification of leadership/governance as an enabling factor across multiple peer-reviewed articles, books, and risk frameworks synthesized in the review; thematic analysis of literature over the last ten years.

high positive The Role of Risk Management as an Organizational Management ... effectiveness of risk management implementation / successful RM adoption

The task frontier expands: new tasks become profitable and are created endogenously as coordination costs decline.

Analytical derivation in the model (proposition about task frontier) and simulation exercises that permit endogenous task entry.

high positive AI as Coordination-Compressing Capital: Task Reallocation, O... task frontier (set/number of profitable tasks)

Aggregate output increases when coordination costs fall because reduced frictions and endogenous task creation raise productive capacity.

Analytical result (one of the five propositions) showing comparative statics of output with respect to coordination compression; supported by calibrated numerical simulations.

high positive AI as Coordination-Compressing Capital: Task Reallocation, O... aggregate output (economy-wide production)

Lower coordination costs expand managers’ spans of control (managers can supervise more subordinates).

Analytical comparative statics derived in the model (one of the five propositions) and corroborating numerical simulations with heterogeneous agents.

high positive AI as Coordination-Compressing Capital: Task Reallocation, O... span of control (number of subordinates per manager)

A one standard-deviation increase in AI adoption causally increases employment in occupations requiring complex problem-solving and interpersonal skills by 1.8%.

Same panel (38 OECD countries, 2019–2025) and AI Adoption Index; IV estimation with occupational employment classified by task type (complex problem-solving & interpersonal); fixed effects and robustness checks reported.

high positive Artificial Intelligence and Labor Market Transformation: Emp... Employment in complex problem-solving and interpersonal occupations (percent cha...

Overinvestment increases inequality (greater tail concentration of income).

Model computations showing that exponential returns amplify income at the top; comparative statics indicate inequality measures rise with greater investment/technology under lognormal wage assumption.

high positive Janus-Faced Technological Progress and the Arms Race in the ... income inequality (tail concentration measures/Gini-like outcomes)

Overinvestment increases measured GDP (output).

Comparative statics in the theoretical model linking higher private investment/technology adoption to higher aggregate output; model shows positive effect on measured GDP despite welfare loss possibilities.

high positive Janus-Faced Technological Progress and the Arms Race in the ... aggregate GDP/output

The exponential returns to skill and technology create strong private incentives for agents to escalate skill (education) investment toward the high tail of the distribution (an educational arms race).

Equilibrium analysis and comparative statics in the theoretical model showing that marginal returns to additional investment are increasing toward the distribution tail, producing higher optimal private investment at the top relative to social optimum.

high positive Janus-Faced Technological Progress and the Arms Race in the ... individual education/skill investment level

When wages follow a lognormal distribution, technological progress makes wages increase exponentially in both skill and technology.

Analytical derivation in the paper's economic model that assumes a lognormal wage distribution and specifies wages as an exponential function of skill and a technology parameter; result follows from model algebra (no empirical data).

high positive Janus-Faced Technological Progress and the Arms Race in the ... individual wage level

Research priorities include developing robust measures of AI adoption and using causal methods (difference-in-differences, synthetic controls, RDD, IV) to estimate effects of AI and regulation on productivity, employment, and inequality.

Methodological recommendations in the report based on identified evidence gaps and normative evaluation of empirical priorities.

high positive AI Governance and Data Privacy: Comparative Analysis of U.S.... quality of AI adoption measures and causal estimates for productivity, employmen...

The American Artificial Intelligence Initiative emphasizes R&D and innovation leadership, standards development, workforce readiness, and fostering 'trustworthy AI' (transparency, fairness, accountability).

Primary source policy documents from the U.S. American Artificial Intelligence Initiative reviewed in the report.

high positive AI Governance and Data Privacy: Comparative Analysis of U.S.... policy emphasis areas (R&D investment, standards, workforce readiness, trustwort...

The paper introduces a Predictive Skill Gap Intelligence Hub — an AI-driven platform that combines macro- and micro-level indicators with probabilistic growth models and intelligent skill-synthesis to proactively forecast regional and sectoral labor demand–supply gaps.

Description of system architecture and modeling approach in the paper (methods section). No numerical evaluation metrics or datasets provided for this descriptive claim.

high positive AI-Based Predictive Skill Gap Analysis for Workforce Plannin... ability to forecast regional and sectoral labor demand–supply gaps (descriptive ...

Recommended research priorities for economists include measuring how adoption changes task mixes and wages, quantifying verification/remediation costs, estimating productivity gains net of security/IP costs, and studying market dynamics from centralized model providers.

Author recommendations based on identified gaps in the empirical literature synthesized by the paper.

high positive ChatGPT as a Tool for Programming Assistance and Code Develo... generation of targeted empirical studies addressing task mix, wage impacts, veri...

Recommended policy levers include data-governance rules, provenance and watermarking standards, liability frameworks, copyright clarifications, competition policy, and taxes/subsidies to internalize externalities.

Policy recommendations synthesized from legal, regulatory, and economic literatures within the review; presented as qualitative guidance rather than tested policy interventions.

high positive Ethical and societal challenges to the adoption of generativ... effectiveness of specified regulatory instruments in mitigating harms from gener...

A structured three-stage framework (input/process/output) clarifies where different risks and regulatory rules apply to generative audiovisual systems.

Framework presented in the paper as a conceptual synthesis of reviewed literatures; supported by cross-references to legal, technical, and ethical sources within the review.

high positive Ethical and societal challenges to the adoption of generativ... clarity and mapping of risk types to development/use stages

The paper introduces IJOPM’s Africa Initiative (AfIn) to support Africa-based OSCM research, outlining motivation, objectives, review process, and researcher support mechanisms.

Descriptive account within the paper (administrative/initiative description rather than empirical evidence).

high positive Continental shift: operations and supply chain management re... institutional support mechanisms for Africa-based OSCM research and publication ...

The paper proposes specific metrics and empirical follow-ups (e.g., generation-to-verification throughput ratios, defect accumulation rates, time-to-acceptance for machine-generated artifacts, incident rates attributable to unverified AI outputs) to validate the model.

Explicit recommendations and measurement proposals listed in the paper; no empirical implementation provided.

high positive Overton Framework v1.0: Cognitive Interlocks for Integrity i... proposed measurement constructs (generation:verification ratio, defect accumulat...

Recommended next steps include building and calibrating ABMs with agent heterogeneity, prototyping technical implementations of token verification (proof-of-query receipts, cryptographic attestation), and red-teaming for spoofing/evasion.

Paper's research & policy next-steps and operational recommendations; no implementation results included.

high positive Token Taxes: mitigating AGI's economic risks research progress on ABMs and token verification prototypes

Enhanced gross‑flows estimation using longitudinal microdata can better track transitions (job-to-job, upskilling, unemployment spells) and measure occupational churn and reallocation.

Established econometric practice cited in paper; recommendation to use panel/admin microdata (CPS longitudinal supplements, LEHD/LODES, UI records); no new empirical results but aligns with standard methods.

high positive Enhancing BLS Methodologies for Projecting AI's Impact on Em... transition rates, spell durations, occupation-to-occupation flows, upskilling in...

Team Situation Awareness (shared perception, comprehension, projection) remains a useful analytic anchor for HAT even with agentic AI.

Conceptual analysis mapping Team SA components onto agentic AI interactions; literature review of Team SA utility in HAT contexts.

high positive Visioning Human-Agentic AI Teaming: Continuity, Tension, and... usefulness of Team Situation Awareness as an analytic framework

Automated equivalency systems require algorithmic oversight features (audit trails, human-in-the-loop checks) to maintain trust and labor-market legitimacy.

Governance recommendation following best practices in algorithmic accountability; not supported by empirical testing of oversight mechanisms in this context.

high positive Establishes a technical and academic bridge between the educ... user trust metrics, appeal/review rates, correctness of overturned automated dec...

AI tools (automated document parsing/NLP, translation, equivalency-prediction classifiers, anomaly detection) can scale credential processing and reduce transaction costs and processing time.

Paper cites potential AI capabilities and application areas; the claim is inferential from known AI functionalities, with no implementation benchmark or throughput numbers provided.

high positive Establishes a technical and academic bridge between the educ... processing throughput, average processing time per credential, operational costs

Continuous monitoring and observability for performance, compliance, and drift are essential to maintain operational stability and detect model or process degradation.

Prescriptive claim grounded in engineering practice and comparative analysis of failure modes; supported by illustrative deployments; no quantitative evaluation of monitoring impact reported.

high positive Governed Hyperautomation for CRM and ERP: A Reference Patter... detection rate/time for performance degradation, compliance violations, model dr...

Core governance components should include policy enforcement integrated into development and deployment pipelines, risk controls for data/model behavior/automated actions, explicit human-in-the-loop and human-on-the-loop oversight, continuous monitoring/logging/incident-response, and role-based governance structures linking legal, compliance, IT, and business units.

Prescriptive design based on literature synthesis and practitioner experience; described as core components in the proposed reference pattern (conceptual, case-illustrated).

high positive Governed Hyperautomation for CRM and ERP: A Reference Patter... presence and integration of specified governance controls and organizational rol...

Research needs include empirically measuring prevalence and average loss from prompt fraud incidents, evaluating effectiveness and cost-effectiveness of technical mitigations (watermarking, provenance), and modeling firm-level investment decisions under varying regulatory/insurance regimes.

Authors' recommended agenda for further research based on identified gaps in the paper's qualitative analysis.

high positive Prompt Engineering or Prompt Fraud? Governance Challenges fo... existence and quality of empirical datasets and models addressing prevalence, lo...

All data are openly available at https://www.antscan.info.

Explicit statement of public repository/portal and URL provided in the paper.

high positive High-throughput phenomics of global ant biodiversity data accessibility (public availability and repository URL)

The dataset includes metadata such as taxonomic labels, collection/locality data, and links to genome projects where available.

Paper states dataset contents include whole-body volumes/meshes and associated metadata (taxonomic labels, locality, genome links).

high positive High-throughput phenomics of global ant biodiversity presence and type of metadata fields associated with scans

The scanning pipeline was optimized and standardized to enable digitizing hundreds to thousands of specimens.

Authors describe an optimized, standardized pipeline and cite the achieved output (2,193 scans) as demonstration.

high positive High-throughput phenomics of global ant biodiversity pipeline throughput/scale (hundreds–thousands of specimens)

The project demonstrated a high-throughput application of synchrotron X-ray microtomography for whole-organism digitization at scale.

Combination of method (synchrotron microCT), standardized pipeline, and production of 2,193 scans presented as evidence of high-throughput capability.

high positive High-throughput phenomics of global ant biodiversity throughput of whole-organism digitization (number of scans produced using the pi...

Imaging modality used is synchrotron X-ray microtomography (high-resolution 3D imaging).

Method section details use of synchrotron X-ray microtomography for whole-body imaging.

high positive High-throughput phenomics of global ant biodiversity imaging modality applied

Scans were acquired with standardized parameters to facilitate automated and replicable analysis and benchmarking.

Paper describes a standardized acquisition protocol and pipeline (synchrotron X-ray microtomography) and notes standardized parameters and metadata format.

high positive High-throughput phenomics of global ant biodiversity use of standardized scanning parameters and metadata format

« Prev 1 2 3 … 28 29 30 … 64 65 Next »