AI-written goals score higher on objective quality but demotivate people: participants given LLM-authored goals reported weaker ownership and commitment and were far less likely to act within two weeks, with ownership loss driving the drop—especially among low self-efficacy users.
As AI tools become embedded in productivity and self-improvement contexts, a pressing question emerges: what happens when AI does the goal-setting for us? While large language models can generate goals that are objectively well-formed, the motivational consequences of delegating this cognitively and emotionally significant task remain unknown. In a preregistered experiment (N = 470), we compared self-authored goals against LLM-authored goals derived from a personal reflection. Although LLM-generated goals scored higher on SMART criteria (specificity, measurability, achievability, relevance, and time-boundedness; d = 2.26), participants in the LLM condition reported lower psychological ownership (d = 1.38), commitment (d = 1.19), and perceived importance (d = 1.13). At two-week follow-up, 72.8% of self-authored participants had acted on two or more of their goals, compared to 46.6% in the LLM condition. Mediation analyses identified psychological ownership as the mechanism: it mediated the authorship effect on every downstream motivational and behavioral outcome, while objective goal quality did not. Critically, individuals low in trait self-efficacy, those most likely to seek AI assistance, experienced the steepest ownership erosion. These findings reveal a quality-motivation dissociation in AI-assisted goal-setting and identify authorship preservation as a design priority for AI tools deployed in identity-relevant, behavior-dependent tasks.
Summary
Main Finding
LLM-authored personal goals are objectively higher quality (far better on SMART criteria) but substantially undermine motivation and follow-through versus self-authored goals. In a preregistered experiment (N = 470) LLM-generated goals scored higher on SMART criteria (d = 2.26) while producing large declines in psychological ownership (d = 1.38), commitment (d = 1.19), and perceived importance (d = 1.13). At two-week follow-up, 72.8% of self-authored participants had acted on two or more goals versus 46.6% in the LLM condition. Mediation analyses identify reduced psychological ownership — not objective goal quality — as the mechanism driving the motivational and behavioral losses. People low in trait self-efficacy experienced the steepest ownership erosion.
Key Points
- Experimental manipulation: participants provided the same personal reflection; final goal text was either self-authored or generated by an LLM from that reflection, isolating authorship while holding content constant.
- Objective quality vs motivation dissociation:
- LLM goals were far more rubric-compliant on SMART dimensions (d = 2.26).
- Despite higher formal quality, participants felt the LLM goals were less “theirs” and less motivating.
- Measured outcomes and effect sizes:
- Psychological ownership: d = 1.38 (LLM < self)
- Commitment: d = 1.19 (LLM < self)
- Perceived importance: d = 1.13 (LLM < self)
- Two-week behavioral follow-through (≥2 goals acted on): 72.8% self-authored vs 46.6% LLM-authored.
- Mechanism: psychological ownership fully mediated authorship effects on motivational outcomes and partially mediated behavior; objective goal quality did not mediate motivational or behavioral outcomes.
- Heterogeneity: participants low in trait self-efficacy — a group more likely to seek AI help — suffered the largest decline in ownership and thus motivation.
- Design implication emphasized by authors: preserve authorship / self-investment in AI-assisted goal tools (co-creation, scaffolding rather than replacement).
Data & Methods
- Design: Preregistered between-subjects experiment (N = 470).
- Conditions: (1) self-authored goals (participants write their own goals), (2) LLM-authored goals (an LLM drafts goals from the participant’s personal reflection).
- Measures:
- Objective goal quality via SMART scoring (likely coded against standard rubric).
- Self-report scales: psychological ownership, commitment, goal self-efficacy, perceived importance, perceived difficulty/clarity.
- Behavioral follow-up: self-reported action on goals measured at two-week follow-up (e.g., number of goals acted upon; threshold reported: two or more).
- Moderator: baseline trait self-efficacy.
- Analysis:
- Between-group comparisons (effect sizes reported as Cohen’s d).
- Mediation analysis testing psychological ownership and objective goal quality as mediators of authorship effects.
- Moderation analysis by trait self-efficacy.
- Key validity features: preregistration, holding reflective input constant to isolate authorship, and behavioral follow-up (two weeks).
Implications for AI Economics
- Reassessing value-from-AI claims:
- Firms and economists should not equate improved artifact quality (e.g., more structured plans) with realized productivity gains. Behavioral follow-through (the end product of goal articulation) can decline even as artifact quality rises.
- Cost–benefit estimates of AI productivity features must incorporate an “ownership discount” (reduced take-up/implementation) rather than valuing improvements only by output quality metrics.
- Product design and monetization:
- AI tools that fully autogenerate goals or plans risk lowering users’ commitment and thus long-run engagement; monetization plans premised on increased user productivity may be overstated.
- Design choices that preserve felt authorship (co-creation, mandatory edits, incremental scaffolding, visible provenance, interactive prompts that require user input) are economically valuable—they can convert formal quality gains into realized behavioral gains.
- Market segmentation and inequality:
- Low self-efficacy users are both more likely to adopt AI help and more harmed by authorship erosion, suggesting AI assistance could exacerbate inequality in behavior-change outcomes and human-capital accumulation.
- Targeted product variants (e.g., guided scaffolding for low-efficacy users) may be necessary to avoid negative distributional effects.
- Measurement and evaluation for firms and researchers:
- Include behavioral follow-through, psychological ownership, and heterogeneity analyses in A/B tests and ROI models rather than relying solely on artifact-quality or short-term engagement metrics.
- Incorporate mediation and moderation (ownership, self-efficacy) into causal models of AI impacts.
- Labor and organizational implications:
- In workplace settings, delegating goal or task formulation to AI could reduce workers’ commitment to objectives and thus blunt productivity gains. Productivity models that assume immediate transfer of formal task quality into output should be revised.
- Training and skill-formation: overuse of generative tools for planning risks undercutting employees’ development of self-regulation and planning skills—an effect with long-term human-capital consequences.
- Policy and disclosure:
- Regulators and platforms may consider guidelines around authorship transparency and design standards to ensure consumers can retain agency and ownership when using AI for identity- or behavior-relevant tasks.
- Research directions for economists:
- Model realized benefit as: realized_gain = f(objective_quality, psychological_ownership), where ownership is endogenous to interface design and user traits.
- Run field experiments where adoption, follow-through, and downstream productivity are jointly measured; test co-creation interventions (minimal suggestions, required edits, shared control) and estimate welfare impacts across population subgroups.
- Quantify long-term dynamic effects: repeated delegation may reduce skill accumulation; incorporate this into dynamic human-capital models.
Practical takeaway: AI-produced improvements in plan or goal text do not automatically translate to greater real-world outcomes. For economic assessments and product strategy, preserving users’ sense of authorship and self-investment is critical to realizing the behavioral and productivity dividends AI promises.
Assessment
Claims (9)
| Claim | Direction | Confidence | Outcome | Details |
|---|---|---|---|---|
| LLM-generated goals scored higher on SMART criteria (specificity, measurability, achievability, relevance, and time-boundedness). Output Quality | positive | high | SMART criteria score (objective goal quality) |
n=470
d = 2.26
1.0
|
| Participants in the LLM condition reported lower psychological ownership (d = 1.38). Worker Satisfaction | negative | high | psychological ownership (self-reported) |
n=470
d = 1.38
1.0
|
| Participants in the LLM condition reported lower commitment (d = 1.19). Worker Satisfaction | negative | high | commitment (self-reported) |
n=470
d = 1.19
1.0
|
| Participants in the LLM condition reported lower perceived importance (d = 1.13). Worker Satisfaction | negative | high | perceived importance of goals (self-reported) |
n=470
d = 1.13
1.0
|
| At two-week follow-up, 72.8% of self-authored participants had acted on two or more of their goals, compared to 46.6% in the LLM condition. Task Completion Time | positive | high | proportion of participants who acted on two or more goals within two weeks (behavioral follow-up) |
72.8% vs 46.6%
1.0
|
| Mediation analyses identified psychological ownership as the mechanism: it mediated the authorship effect on every downstream motivational and behavioral outcome, while objective goal quality did not. Worker Satisfaction | positive | high | mediating effect of psychological ownership on authorship => motivational and behavioral outcomes |
n=470
0.6
|
| Individuals low in trait self-efficacy experienced the steepest ownership erosion (i.e., AI-authorship reduced psychological ownership most for low self-efficacy participants). Worker Satisfaction | negative | high | change/erosion in psychological ownership as moderated by trait self-efficacy |
n=470
0.6
|
| There is a quality–motivation dissociation in AI-assisted goal-setting: AI-authored goals are objectively higher quality but produce lower motivation and worse behavioral follow-through. Output Quality | mixed | high | divergence between objective goal quality (SMART) and motivational/behavioral outcomes |
n=470
0.6
|
| Authorship preservation should be a design priority for AI tools deployed in identity-relevant, behavior-dependent tasks. Governance And Regulation | positive | high | design recommendation (no empirical outcome measured) |
0.1
|