Personality determines who benefits from AI coaching: a theory-driven conversational coach helps overcontrolled negotiators, resilient users improve more from a traditional handbook, and undercontrolled users gain little, suggesting AI coaching should adapt support intensity to individual readiness.
AI-driven conversational coaching is increasingly used to support workplace negotiation, yet prior work assumes uniform effectiveness across users. We challenge this assumption by examining how individual differences, particularly personality traits, moderate coaching outcomes. We conducted a between-subjects experiment (N=267) comparing theory-driven AI (Trucey), general-purpose AI (Control-AI), and a traditional negotiation handbook (Control-NoAI). Participants were clustered into three profiles -- resilient, overcontrolled, and undercontrolled -- based on the Big-Five personality traits and ARC typology. Resilient workers achieved broad psychological gains primarily from the handbook, overcontrolled workers showed outcome-specific improvements with theory-driven AI, and undercontrolled workers exhibited minimal effects despite engaging with the frameworks. These patterns suggest personality as a predictor of readiness beyond stage-based tailoring: vulnerable users benefit from targeted rather than comprehensive interventions. The study advances understanding of personality-determined intervention prerequisites and highlights design implications for adaptive AI coaching systems that align support intensity with individual readiness, rather than assuming universal effectiveness.
Summary
Main Finding
AI negotiation coaching is not universally effective: personality profiles predict who benefits. Resilient individuals (high extraversion/agreeableness/conscientiousness/openness; low neuroticism) gained most from a static handbook (Control-NoAI). Overcontrolled individuals (low openness/extraversion) showed outcome-specific gains—especially increased self-efficacy—from a theory-driven AI coach (Trucey). Undercontrolled individuals (high neuroticism; low agreeableness/conscientiousness) showed minimal psychological or linguistic gains from any intervention, indicating many self-directed digital supports may not reach the most vulnerable users.
Key Points
- Experiment: between-subjects randomized study (N = 267) comparing:
- Trucey — theory-driven AI coach (GPT-4o-mini; Brett & Thompson negotiation framework embedded);
- Control-AI — general-purpose conversational AI (active control);
- Control-NoAI — static negotiation handbook (passive control).
- Sample splits: Trucey n=134, Control-AI n=66, Control-NoAI n=67.
- Personality segmentation: k-means clustering on participants’ Big Five (BFI-10) plus ARC typology produced three clusters labeled resilient, overcontrolled, undercontrolled.
- Outcomes measured pre/post: occupational self-efficacy (OSS-6), psychological empowerment (PEU), negotiation preparedness (willingness/fear), plus system usability/appropriateness.
- Interaction design (Trucey): five-stage workflow — scenario assignment, personality calibration (supervisor BFI), theory-guided advice, role-play rehearsal (supervisor persona), iterative feedback. Technical stack: GPT-4o-mini, few-shot prompting, embeddings for profile matching.
- Main empirical patterns:
- Resilient: largest psychological gains from Control-NoAI (handbook) — suggests high-resource users preserve autonomy and benefit from self-directed materials; Trucey increased some linguistic engagement (e.g., verbosity/accessibility) but didn’t outperform handbook on psychological measures.
- Overcontrolled: Trucey produced significant improvements in self-efficacy vs. Control-AI — the theory-driven scaffold acted as a “novelty bypass” enabling strategy uptake.
- Undercontrolled: minimal changes across conditions despite engagement — indicates a mismatch with self-guided or AI-mediated coaching and a need for more intensive/targeted interventions.
- Linguistic and adoption findings: personality clusters showed different linguistic engagement with AI (verbosity, accessibility) and different uptake of negotiation frameworks; effects were uneven across metrics.
Data & Methods
- Recruitment: Prolific, U.S. workers with workplace experience; diverse demographic/work-domain representation reported (see Table 1 in paper).
- Power: a priori power calculation targeting f^2 = 0.15, power = 0.90, α = .05; target n ≥ 226; final N = 267.
- Design: randomized, between-subjects; pre-task survey → 15–20 minute intervention → post-task survey.
- Measures:
- Big Five (BFI-10) for personality.
- Occupational self-efficacy (OSS-6), psychological empowerment (PEU), negotiation preparedness (fear/willingness).
- System usability and perceived appropriateness.
- Log data: chat transcripts for linguistic analysis; adoption of negotiation concepts coded.
- Analysis:
- k-means clustering on Big Five to form resilient/overcontrolled/undercontrolled groups (ARC-inspired).
- Comparison of change scores across conditions and clusters to identify heterogeneity.
- Linguistic feature analysis and framework-adoption coding to probe mechanisms.
Implications for AI Economics
- Heterogeneous returns to AI coaching: average treatment effects overstate benefits for some users and understate non-response for others. Economic evaluations (ROI, cost-effectiveness) must incorporate heterogeneous treatment effects by personality or readiness.
- Distributional consequences and equity risks:
- If firms adopt uniform AI coaching at scale, resilient workers may capture most gains (or do equally well with cheap handbooks), while undercontrolled workers—who may already be disadvantaged—receive little benefit, potentially widening wage/advancement gaps.
- Overcontrolled workers can benefit from targeted theory-driven AI, suggesting a role for segmented deployment to reduce disparities.
- Optimal resource allocation:
- Firms and policymakers should consider triage/adaptive allocation: inexpensive static materials for high-readiness workers; theory-driven AI scaffolds for intermediate-readiness workers; higher-touch human coaching or more intensive interventions for low-readiness (undercontrolled) workers.
- Cost-benefit analyses should compare marginal gains per dollar across modalities (handbook, theory-driven AI, general-purpose AI, human coaching) conditional on user type.
- Design and scaling trade-offs:
- Theory-grounded prompting (as in Trucey) can be more effective than general-purpose AI for specific subpopulations, implying that investing in domain-specific prompting and calibrated scaffolds can increase effectiveness per interaction.
- However, additional engineering and content costs are needed to implement adaptive systems that detect readiness and route users appropriately.
- Labor-market impacts:
- Effective negotiation coaching can alter bargaining behavior, potentially increasing wages for beneficiaries and changing wage dispersion. Heterogeneous uptake implies uneven effects on labor costs and internal pay dynamics.
- Firms may face strategic choices: enable broad low-cost gains (handbooks), selectively invest in AI coaching for targeted cohorts, or provide human coaching for those with the largest barriers.
- Research and policy recommendations:
- Economic evaluations, field trials, and audits should report heterogeneity by personality/skill/readiness; rely less on population averages.
- Implement stratified RCTs to estimate conditional average treatment effects and inform allocation policies.
- Consider safeguards to avoid exacerbating inequality: subsidized human coaching or hybrid approaches for low-readiness workers; transparency about who benefits most.
- Future work should model long-run career dynamics (compounding effects of negotiation success) and quantify firm-level wage bill implications of heterogeneous coaching uptake.
Summary recommendation for practitioners: Don’t assume one-size-fits-all. Use simple screening (personality/readiness proxies) to route users to the most cost-effective intervention—static materials, theory-driven AI, or human coaching—so firms maximize aggregate gains while minimizing distributional harm.
Assessment
Claims (8)
| Claim | Direction | Confidence | Outcome | Details |
|---|---|---|---|---|
| We conducted a between-subjects experiment (N=267) comparing theory-driven AI (Trucey), general-purpose AI (Control-AI), and a traditional negotiation handbook (Control-NoAI). Decision Quality | mixed | high | effectiveness of coaching modalities (psychological and negotiation performance measures) |
n=267
1.0
|
| Participants were clustered into three profiles -- resilient, overcontrolled, and undercontrolled -- based on the Big-Five personality traits and ARC typology. Skill Acquisition | mixed | high | personality profile membership (resilient, overcontrolled, undercontrolled) |
n=267
1.0
|
| Resilient workers achieved broad psychological gains primarily from the handbook. Worker Satisfaction | positive | medium | psychological gains (broad, unspecified psychological measures) |
0.36
|
| Overcontrolled workers showed outcome-specific improvements with theory-driven AI. Decision Quality | positive | medium | outcome-specific improvements (unspecified in abstract; likely negotiation-relevant performance/psychological measures) |
0.36
|
| Undercontrolled workers exhibited minimal effects despite engaging with the frameworks. Worker Satisfaction | null_result | medium | measured intervention effects (minimal/no change across reported outcomes) |
0.36
|
| These patterns suggest personality as a predictor of readiness beyond stage-based tailoring: vulnerable users benefit from targeted rather than comprehensive interventions. Training Effectiveness | positive | medium | readiness/responsiveness to interventions (i.e., likelihood of benefit from targeted vs. comprehensive interventions) |
n=267
0.06
|
| Design implication: adaptive AI coaching systems should align support intensity with individual readiness, rather than assuming universal effectiveness. Training Effectiveness | positive | high | appropriateness of intervention intensity (design recommendation) |
0.1
|
| AI-driven conversational coaching is increasingly used to support workplace negotiation, yet prior work assumes uniform effectiveness across users. Adoption Rate | mixed | high | adoption/use of AI coaching in workplace negotiation |
0.3
|