The Commonplace
Home Dashboard Papers Evidence Syntheses Digests 🎲
← Papers

AI-written goals score higher on objective quality but demotivate people: participants given LLM-authored goals reported weaker ownership and commitment and were far less likely to act within two weeks, with ownership loss driving the drop—especially among low self-efficacy users.

Optimized but Unowned: How AI-Authored Goals Undermine the Motivation They Are Meant to Drive
Vivienne Bihe Chi, Roman Rietsche, Andreas Göldi, Lyle Ungar, Sharath Chandra Guntuku · May 12, 2026
arxiv rct medium evidence 7/10 relevance Source PDF
In a preregistered RCT (N=470), LLM-generated goals were judged higher on SMART criteria but produced lower psychological ownership, commitment, perceived importance, and substantially less follow-through at two-week follow-up, with reduced ownership mediating the effects.

As AI tools become embedded in productivity and self-improvement contexts, a pressing question emerges: what happens when AI does the goal-setting for us? While large language models can generate goals that are objectively well-formed, the motivational consequences of delegating this cognitively and emotionally significant task remain unknown. In a preregistered experiment (N = 470), we compared self-authored goals against LLM-authored goals derived from a personal reflection. Although LLM-generated goals scored higher on SMART criteria (specificity, measurability, achievability, relevance, and time-boundedness; d = 2.26), participants in the LLM condition reported lower psychological ownership (d = 1.38), commitment (d = 1.19), and perceived importance (d = 1.13). At two-week follow-up, 72.8% of self-authored participants had acted on two or more of their goals, compared to 46.6% in the LLM condition. Mediation analyses identified psychological ownership as the mechanism: it mediated the authorship effect on every downstream motivational and behavioral outcome, while objective goal quality did not. Critically, individuals low in trait self-efficacy, those most likely to seek AI assistance, experienced the steepest ownership erosion. These findings reveal a quality-motivation dissociation in AI-assisted goal-setting and identify authorship preservation as a design priority for AI tools deployed in identity-relevant, behavior-dependent tasks.

Summary

Main Finding

LLM-authored personal goals are objectively higher quality (far better on SMART criteria) but substantially undermine motivation and follow-through versus self-authored goals. In a preregistered experiment (N = 470) LLM-generated goals scored higher on SMART criteria (d = 2.26) while producing large declines in psychological ownership (d = 1.38), commitment (d = 1.19), and perceived importance (d = 1.13). At two-week follow-up, 72.8% of self-authored participants had acted on two or more goals versus 46.6% in the LLM condition. Mediation analyses identify reduced psychological ownership — not objective goal quality — as the mechanism driving the motivational and behavioral losses. People low in trait self-efficacy experienced the steepest ownership erosion.

Key Points

  • Experimental manipulation: participants provided the same personal reflection; final goal text was either self-authored or generated by an LLM from that reflection, isolating authorship while holding content constant.
  • Objective quality vs motivation dissociation:
    • LLM goals were far more rubric-compliant on SMART dimensions (d = 2.26).
    • Despite higher formal quality, participants felt the LLM goals were less “theirs” and less motivating.
  • Measured outcomes and effect sizes:
    • Psychological ownership: d = 1.38 (LLM < self)
    • Commitment: d = 1.19 (LLM < self)
    • Perceived importance: d = 1.13 (LLM < self)
    • Two-week behavioral follow-through (≥2 goals acted on): 72.8% self-authored vs 46.6% LLM-authored.
  • Mechanism: psychological ownership fully mediated authorship effects on motivational outcomes and partially mediated behavior; objective goal quality did not mediate motivational or behavioral outcomes.
  • Heterogeneity: participants low in trait self-efficacy — a group more likely to seek AI help — suffered the largest decline in ownership and thus motivation.
  • Design implication emphasized by authors: preserve authorship / self-investment in AI-assisted goal tools (co-creation, scaffolding rather than replacement).

Data & Methods

  • Design: Preregistered between-subjects experiment (N = 470).
  • Conditions: (1) self-authored goals (participants write their own goals), (2) LLM-authored goals (an LLM drafts goals from the participant’s personal reflection).
  • Measures:
    • Objective goal quality via SMART scoring (likely coded against standard rubric).
    • Self-report scales: psychological ownership, commitment, goal self-efficacy, perceived importance, perceived difficulty/clarity.
    • Behavioral follow-up: self-reported action on goals measured at two-week follow-up (e.g., number of goals acted upon; threshold reported: two or more).
    • Moderator: baseline trait self-efficacy.
  • Analysis:
    • Between-group comparisons (effect sizes reported as Cohen’s d).
    • Mediation analysis testing psychological ownership and objective goal quality as mediators of authorship effects.
    • Moderation analysis by trait self-efficacy.
  • Key validity features: preregistration, holding reflective input constant to isolate authorship, and behavioral follow-up (two weeks).

Implications for AI Economics

  • Reassessing value-from-AI claims:
    • Firms and economists should not equate improved artifact quality (e.g., more structured plans) with realized productivity gains. Behavioral follow-through (the end product of goal articulation) can decline even as artifact quality rises.
    • Cost–benefit estimates of AI productivity features must incorporate an “ownership discount” (reduced take-up/implementation) rather than valuing improvements only by output quality metrics.
  • Product design and monetization:
    • AI tools that fully autogenerate goals or plans risk lowering users’ commitment and thus long-run engagement; monetization plans premised on increased user productivity may be overstated.
    • Design choices that preserve felt authorship (co-creation, mandatory edits, incremental scaffolding, visible provenance, interactive prompts that require user input) are economically valuable—they can convert formal quality gains into realized behavioral gains.
  • Market segmentation and inequality:
    • Low self-efficacy users are both more likely to adopt AI help and more harmed by authorship erosion, suggesting AI assistance could exacerbate inequality in behavior-change outcomes and human-capital accumulation.
    • Targeted product variants (e.g., guided scaffolding for low-efficacy users) may be necessary to avoid negative distributional effects.
  • Measurement and evaluation for firms and researchers:
    • Include behavioral follow-through, psychological ownership, and heterogeneity analyses in A/B tests and ROI models rather than relying solely on artifact-quality or short-term engagement metrics.
    • Incorporate mediation and moderation (ownership, self-efficacy) into causal models of AI impacts.
  • Labor and organizational implications:
    • In workplace settings, delegating goal or task formulation to AI could reduce workers’ commitment to objectives and thus blunt productivity gains. Productivity models that assume immediate transfer of formal task quality into output should be revised.
    • Training and skill-formation: overuse of generative tools for planning risks undercutting employees’ development of self-regulation and planning skills—an effect with long-term human-capital consequences.
  • Policy and disclosure:
    • Regulators and platforms may consider guidelines around authorship transparency and design standards to ensure consumers can retain agency and ownership when using AI for identity- or behavior-relevant tasks.
  • Research directions for economists:
    • Model realized benefit as: realized_gain = f(objective_quality, psychological_ownership), where ownership is endogenous to interface design and user traits.
    • Run field experiments where adoption, follow-through, and downstream productivity are jointly measured; test co-creation interventions (minimal suggestions, required edits, shared control) and estimate welfare impacts across population subgroups.
    • Quantify long-term dynamic effects: repeated delegation may reduce skill accumulation; incorporate this into dynamic human-capital models.

Practical takeaway: AI-produced improvements in plan or goal text do not automatically translate to greater real-world outcomes. For economic assessments and product strategy, preserving users’ sense of authorship and self-investment is critical to realizing the behavioral and productivity dividends AI promises.

Assessment

Paper Typerct Evidence Strengthmedium — Randomized design with preregistration and a two-week behavioral follow-up gives strong internal validity for short-term causal effects, but external validity is limited by reliance on a single experimental context, likely convenience sample, self-reported behavioral outcomes, short follow-up horizon, and use of a particular LLM/prompting setup; mediation results are suggestive but do not fully rule out alternative pathways. Methods Rigorhigh — The study is preregistered, uses random assignment, reports standardized effect sizes for multiple psychological outcomes, includes a behavioral follow-up, and conducts mediation analyses; however, rigor is tempered by reliance on self-report measures for behavior, potential demand characteristics, single LLM/prompt implementation, and limited follow-up duration. SamplePreregistered experiment with N = 470 adult participants randomized to either self-authored goals or LLM-authored goals based on a personal reflection; measured SMART criteria ratings, psychological ownership, commitment, perceived importance, trait self-efficacy, and self-reported goal-directed actions at two-week follow-up. Themeshuman_ai_collab productivity IdentificationPreregistered randomized controlled experiment assigning participants (N = 470) to either self-authored or LLM-authored goals derived from a personal reflection; causal effects identified via random assignment, with mediation analysis testing psychological ownership as the mechanism between authorship condition and downstream motivational and behavioral outcomes. GeneralizabilityLikely convenience/online sample rather than representative population, Short (two-week) follow-up—uncertain long-term effects, Self-reported behavioral outcomes rather than objective measures of goal attainment, Findings may depend on the specific LLM, prompt design, and how goals were presented, Single task/context (personal goal-setting); results may not generalize to workplace or firm-level productivity applications, Cultural or demographic differences in ownership and AI attitudes may alter effects

Claims (9)

ClaimDirectionConfidenceOutcomeDetails
LLM-generated goals scored higher on SMART criteria (specificity, measurability, achievability, relevance, and time-boundedness). Output Quality positive high SMART criteria score (objective goal quality)
n=470
d = 2.26
1.0
Participants in the LLM condition reported lower psychological ownership (d = 1.38). Worker Satisfaction negative high psychological ownership (self-reported)
n=470
d = 1.38
1.0
Participants in the LLM condition reported lower commitment (d = 1.19). Worker Satisfaction negative high commitment (self-reported)
n=470
d = 1.19
1.0
Participants in the LLM condition reported lower perceived importance (d = 1.13). Worker Satisfaction negative high perceived importance of goals (self-reported)
n=470
d = 1.13
1.0
At two-week follow-up, 72.8% of self-authored participants had acted on two or more of their goals, compared to 46.6% in the LLM condition. Task Completion Time positive high proportion of participants who acted on two or more goals within two weeks (behavioral follow-up)
72.8% vs 46.6%
1.0
Mediation analyses identified psychological ownership as the mechanism: it mediated the authorship effect on every downstream motivational and behavioral outcome, while objective goal quality did not. Worker Satisfaction positive high mediating effect of psychological ownership on authorship => motivational and behavioral outcomes
n=470
0.6
Individuals low in trait self-efficacy experienced the steepest ownership erosion (i.e., AI-authorship reduced psychological ownership most for low self-efficacy participants). Worker Satisfaction negative high change/erosion in psychological ownership as moderated by trait self-efficacy
n=470
0.6
There is a quality–motivation dissociation in AI-assisted goal-setting: AI-authored goals are objectively higher quality but produce lower motivation and worse behavioral follow-through. Output Quality mixed high divergence between objective goal quality (SMART) and motivational/behavioral outcomes
n=470
0.6
Authorship preservation should be a design priority for AI tools deployed in identity-relevant, behavior-dependent tasks. Governance And Regulation positive high design recommendation (no empirical outcome measured)
0.1

Notes