Personality determines who benefits from AI coaching: a theory-driven conversational coach helps overcontrolled negotiators, resilient users improve more from a traditional handbook, and undercontrolled users gain little, suggesting AI coaching should adapt support intensity to individual readiness.

Not My Truce: Personality Differences in AI-Mediated Workplace Negotiation

Veda Duddu, Jash Rajesh Parekh, Andy Mao, Hanyi Min, Ziang Xiao, Vedant Das Swain, Koustuv Saha · April 01, 2026

arxiv rct medium evidence 7/10 relevance Source PDF

Personality profiles moderate coaching effectiveness: resilient participants gained most from a traditional handbook, overcontrolled participants saw targeted benefits from the theory-driven conversational AI, and undercontrolled participants showed minimal improvement across interventions.

AI-driven conversational coaching is increasingly used to support workplace negotiation, yet prior work assumes uniform effectiveness across users. We challenge this assumption by examining how individual differences, particularly personality traits, moderate coaching outcomes. We conducted a between-subjects experiment (N=267) comparing theory-driven AI (Trucey), general-purpose AI (Control-AI), and a traditional negotiation handbook (Control-NoAI). Participants were clustered into three profiles -- resilient, overcontrolled, and undercontrolled -- based on the Big-Five personality traits and ARC typology. Resilient workers achieved broad psychological gains primarily from the handbook, overcontrolled workers showed outcome-specific improvements with theory-driven AI, and undercontrolled workers exhibited minimal effects despite engaging with the frameworks. These patterns suggest personality as a predictor of readiness beyond stage-based tailoring: vulnerable users benefit from targeted rather than comprehensive interventions. The study advances understanding of personality-determined intervention prerequisites and highlights design implications for adaptive AI coaching systems that align support intensity with individual readiness, rather than assuming universal effectiveness.

Summary

Main Finding

AI negotiation coaching is not universally effective: personality profiles predict who benefits. Resilient individuals (high extraversion/agreeableness/conscientiousness/openness; low neuroticism) gained most from a static handbook (Control-NoAI). Overcontrolled individuals (low openness/extraversion) showed outcome-specific gains—especially increased self-efficacy—from a theory-driven AI coach (Trucey). Undercontrolled individuals (high neuroticism; low agreeableness/conscientiousness) showed minimal psychological or linguistic gains from any intervention, indicating many self-directed digital supports may not reach the most vulnerable users.

Key Points

Experiment: between-subjects randomized study (N = 267) comparing:
- Trucey — theory-driven AI coach (GPT-4o-mini; Brett & Thompson negotiation framework embedded);
- Control-AI — general-purpose conversational AI (active control);
- Control-NoAI — static negotiation handbook (passive control).
- Sample splits: Trucey n=134, Control-AI n=66, Control-NoAI n=67.
Personality segmentation: k-means clustering on participants’ Big Five (BFI-10) plus ARC typology produced three clusters labeled resilient, overcontrolled, undercontrolled.
Outcomes measured pre/post: occupational self-efficacy (OSS-6), psychological empowerment (PEU), negotiation preparedness (willingness/fear), plus system usability/appropriateness.
Interaction design (Trucey): five-stage workflow — scenario assignment, personality calibration (supervisor BFI), theory-guided advice, role-play rehearsal (supervisor persona), iterative feedback. Technical stack: GPT-4o-mini, few-shot prompting, embeddings for profile matching.
Main empirical patterns:
- Resilient: largest psychological gains from Control-NoAI (handbook) — suggests high-resource users preserve autonomy and benefit from self-directed materials; Trucey increased some linguistic engagement (e.g., verbosity/accessibility) but didn’t outperform handbook on psychological measures.
- Overcontrolled: Trucey produced significant improvements in self-efficacy vs. Control-AI — the theory-driven scaffold acted as a “novelty bypass” enabling strategy uptake.
- Undercontrolled: minimal changes across conditions despite engagement — indicates a mismatch with self-guided or AI-mediated coaching and a need for more intensive/targeted interventions.
Linguistic and adoption findings: personality clusters showed different linguistic engagement with AI (verbosity, accessibility) and different uptake of negotiation frameworks; effects were uneven across metrics.

Data & Methods

Recruitment: Prolific, U.S. workers with workplace experience; diverse demographic/work-domain representation reported (see Table 1 in paper).
Power: a priori power calculation targeting f^2 = 0.15, power = 0.90, α = .05; target n ≥ 226; final N = 267.
Design: randomized, between-subjects; pre-task survey → 15–20 minute intervention → post-task survey.
Measures:
- Big Five (BFI-10) for personality.
- Occupational self-efficacy (OSS-6), psychological empowerment (PEU), negotiation preparedness (fear/willingness).
- System usability and perceived appropriateness.
- Log data: chat transcripts for linguistic analysis; adoption of negotiation concepts coded.
Analysis:
- k-means clustering on Big Five to form resilient/overcontrolled/undercontrolled groups (ARC-inspired).
- Comparison of change scores across conditions and clusters to identify heterogeneity.
- Linguistic feature analysis and framework-adoption coding to probe mechanisms.

Implications for AI Economics

Heterogeneous returns to AI coaching: average treatment effects overstate benefits for some users and understate non-response for others. Economic evaluations (ROI, cost-effectiveness) must incorporate heterogeneous treatment effects by personality or readiness.
Distributional consequences and equity risks:
- If firms adopt uniform AI coaching at scale, resilient workers may capture most gains (or do equally well with cheap handbooks), while undercontrolled workers—who may already be disadvantaged—receive little benefit, potentially widening wage/advancement gaps.
- Overcontrolled workers can benefit from targeted theory-driven AI, suggesting a role for segmented deployment to reduce disparities.
Optimal resource allocation:
- Firms and policymakers should consider triage/adaptive allocation: inexpensive static materials for high-readiness workers; theory-driven AI scaffolds for intermediate-readiness workers; higher-touch human coaching or more intensive interventions for low-readiness (undercontrolled) workers.
- Cost-benefit analyses should compare marginal gains per dollar across modalities (handbook, theory-driven AI, general-purpose AI, human coaching) conditional on user type.
Design and scaling trade-offs:
- Theory-grounded prompting (as in Trucey) can be more effective than general-purpose AI for specific subpopulations, implying that investing in domain-specific prompting and calibrated scaffolds can increase effectiveness per interaction.
- However, additional engineering and content costs are needed to implement adaptive systems that detect readiness and route users appropriately.
Labor-market impacts:
- Effective negotiation coaching can alter bargaining behavior, potentially increasing wages for beneficiaries and changing wage dispersion. Heterogeneous uptake implies uneven effects on labor costs and internal pay dynamics.
- Firms may face strategic choices: enable broad low-cost gains (handbooks), selectively invest in AI coaching for targeted cohorts, or provide human coaching for those with the largest barriers.
Research and policy recommendations:
- Economic evaluations, field trials, and audits should report heterogeneity by personality/skill/readiness; rely less on population averages.
- Implement stratified RCTs to estimate conditional average treatment effects and inform allocation policies.
- Consider safeguards to avoid exacerbating inequality: subsidized human coaching or hybrid approaches for low-readiness workers; transparency about who benefits most.
- Future work should model long-run career dynamics (compounding effects of negotiation success) and quantify firm-level wage bill implications of heterogeneous coaching uptake.

Summary recommendation for practitioners: Don’t assume one-size-fits-all. Use simple screening (personality/readiness proxies) to route users to the most cost-effective intervention—static materials, theory-driven AI, or human coaching—so firms maximize aggregate gains while minimizing distributional harm.

Assessment

Paper Typerct Evidence Strengthmedium — Random assignment provides credible internal validity for treatment effects, but the sample size is moderate, details on sample representativeness, pre-registration, blinding, and robustness checks are not reported here, and effects are measured in a single-task negotiation setting which limits external validity. Methods Rigormedium — Use of an RCT and theoretically grounded personality clustering are strengths, but potential concerns include post-hoc clustering decisions, unreported balance and attrition diagnostics, possible reliance on self-reported psychological outcomes, and limited information on implementation fidelity and statistical power for subgroup analyses. SampleN=267 adult participants randomized to three arms (Trucey theory-driven conversational AI, general-purpose AI, and negotiation handbook); participants were profiled into three personality clusters (resilient, overcontrolled, undercontrolled) using Big Five measures and ARC typology; outcomes include negotiation performance and psychological measures of preparedness/confidence (measured shortly after intervention). Themeshuman_ai_collab skills_training labor_markets IdentificationRandomized between-subjects experiment assigning N=267 participants to one of three conditions (theory-driven AI 'Trucey', general-purpose AI, or a traditional handbook); causal effects of the interventions are identified by random assignment, with moderation tested via pre-treatment clustering on Big Five traits/ARC typology and interaction analyses between treatment and personality profile. GeneralizabilityModerate sample size and unclear representativeness (likely convenience/online sample rather than representative workforce), Single-context: simulated negotiation task may not reflect real-world workplace negotiations or long-term outcomes, Specific AI system (Trucey) and one general-purpose AI; results may not generalize to other coaching models or interfaces, Short-term follow-up; unclear persistence of effects, Potential cultural/geographic limits if sample concentrated in one region

Claims (8)

Claim	Direction	Confidence	Outcome	Details
We conducted a between-subjects experiment (N=267) comparing theory-driven AI (Trucey), general-purpose AI (Control-AI), and a traditional negotiation handbook (Control-NoAI). Decision Quality	mixed	high	effectiveness of coaching modalities (psychological and negotiation performance measures)	n=267 1.0
Participants were clustered into three profiles -- resilient, overcontrolled, and undercontrolled -- based on the Big-Five personality traits and ARC typology. Skill Acquisition	mixed	high	personality profile membership (resilient, overcontrolled, undercontrolled)	n=267 1.0
Resilient workers achieved broad psychological gains primarily from the handbook. Worker Satisfaction	positive	medium	psychological gains (broad, unspecified psychological measures)	0.36
Overcontrolled workers showed outcome-specific improvements with theory-driven AI. Decision Quality	positive	medium	outcome-specific improvements (unspecified in abstract; likely negotiation-relevant performance/psychological measures)	0.36
Undercontrolled workers exhibited minimal effects despite engaging with the frameworks. Worker Satisfaction	null_result	medium	measured intervention effects (minimal/no change across reported outcomes)	0.36
These patterns suggest personality as a predictor of readiness beyond stage-based tailoring: vulnerable users benefit from targeted rather than comprehensive interventions. Training Effectiveness	positive	medium	readiness/responsiveness to interventions (i.e., likelihood of benefit from targeted vs. comprehensive interventions)	n=267 0.06
Design implication: adaptive AI coaching systems should align support intensity with individual readiness, rather than assuming universal effectiveness. Training Effectiveness	positive	high	appropriateness of intervention intensity (design recommendation)	0.1
AI-driven conversational coaching is increasingly used to support workplace negotiation, yet prior work assumes uniform effectiveness across users. Adoption Rate	mixed	high	adoption/use of AI coaching in workplace negotiation	0.3