AI agents often echo their human owners: autonomous Moltbook agents reproduce owners' linguistic style, topics and affect, and agents that more closely mirror owners are more likely to divulge owner-related personal information, raising privacy and governance concerns.
AI agents powered by large language models are increasingly acting on behalf of humans in social and economic environments. Prior research has focused on their task performance and effects on human outcomes, but less is known about the relationship between agents and the specific individuals who deploy them. We ask whether agents systematically reflect the behavioral characteristics of their human owners, functioning as behavioral extensions rather than producing generic outputs. We study this question using 10,659 matched human-agent pairs from Moltbook, a social media platform where each autonomous agent is publicly linked to its owner's Twitter/X account. By comparing agents' posts on Moltbook with their owners' Twitter/X activity across features spanning topics, values, affect, and linguistic style, we find systematic transfer between agents and their specific owners. This transfer persists among agents without explicit configuration, and pairs that align on one behavioral dimension tend to align on others. These patterns are consistent with transfer emerging through accumulated interaction between owners (or owners' computer environments) and their agents in everyday use. We further show that agents with stronger behavioral transfer are more likely to disclose owner-related personal information in public discourse, suggesting that the same owner-specific context that drives behavioral transfer may also create privacy risk during ordinary use. Taken together, our results indicate that AI agents do not simply generate content, but reflect owner-related context in ways that can propagate human behavioral heterogeneity into digital environments, with implications for privacy, platform design, and the governance of agentic systems.
Summary
Main Finding
AI agents deployed by individuals systematically inherit owner-specific behavioral characteristics across topics, values, affect, and linguistic style — even when owners do not explicitly configure the agents. This "behavioral transfer" is coherent across multiple dimensions and predicts privacy leakage: agents that more closely mirror their owners are more likely to disclose owner-related personal information in public agent posts.
Key Points
- Empirical scale and setting
- Analysis uses 10,659 matched human-agent pairs from Moltbook (agents built on the OpenClaw framework) and the owners’ Twitter/X activity.
- Dataset: 44,588 Moltbook posts (from these agents) and 69,559 non-verification tweets (from matched humans).
- Behavioral measurement
- Authors construct 43 text-based features in four dimensions: Topics (6), Values (7), Affect (12), and Style (18).
- Features include topic keyword rates, moral foundations, political ideology, sentiment/emotion metrics, and linguistic/style measures (e.g., TTR, pronoun rates).
- Evidence of transfer
- Strong, robust positive associations between each agent and its specific owner across many features. Agents resemble their owners more than randomly matched users.
- Transfer remains when agents lack public bios/configuration, reducing the likelihood that purely deliberate owner-specified instructions explain the effect.
- Cross-dimension coherence: pairs that align strongly on one set of features tend to align on others — consistent with accumulated exposure rather than narrowly defined, dimension-specific configuration.
- Privacy implications
- Using a validated LLM-as-judge pipeline, 34.6% of agents surfaced owner-related sensitive personal information in public posts that was not present in public agent configuration.
- A one-standard-deviation increase in a pair’s holistic transfer score is associated with a 1.32 percentage-point higher probability that the agent leaks private information.
- Results robust to simulation-based checks, classification noise, and tests intended to rule out automation/“puppet” account confounds.
- Mechanism interpretation
- Two candidate channels: (i) explicit configuration (system prompts/character files), and (ii) accumulated interaction and local-environment context (persistent chats, access to files, tools).
- Empirical patterns (transfer without bios; cross-feature coherence) favour the accumulated interaction/environment explanation, though configuration may still contribute.
Data & Methods
- Platforms and architecture
- OpenClaw: open-source agent framework enabling local deployment, persistent sessions, configuration via local character files (system prompts), and optional access to local tools/files.
- Moltbook: social platform where OpenClaw agents operate autonomously and are publicly linkable to owners’ Twitter/X accounts via verification tweets.
- Sample construction and dates
- Moltbook data scraped Feb 2, 2026 (platform activity Jan 27–Feb 2, 2026); Twitter data collected Feb 7, 2026.
- Of ~1.6M registered Moltbook agents, 20,894 posted at least once; 17,745 of those linked Twitter handles; final matched sample after cleaning: 10,659 human-agent pairs.
- Verification tweets (standardized claim tweets) removed using a conservative keyword filter.
- Feature construction
- 43 features computed separately for agents (Moltbook posts) and humans (Twitter tweets) and compared at the pair level using rank correlations and regression-based analyses.
- Topic, moral foundation, and emotion features derived from dictionaries; sentiment via VADER; political ideology scored via LLM (Gemini 2.5-Flash) and dictionary methods; style features via standard NLP metrics.
- Identification strategy
- Main identification leverages matched pairs and cross-pair variation: if an owner uses more of feature X, the agent deployed by that owner should also display more of feature X if transfer occurs.
- Authors argue reverse causality is unlikely (agents did not shape prior human tweets); robustness checks and simulation analyses address alternative explanations (classification noise, scripted/puppet accounts).
- Tests to distinguish channels: (i) evaluate transfer among agents without public bios/configuration, and (ii) examine co-movement across distinct feature dimensions.
- Privacy-detection pipeline
- LLM-as-judge method used to classify whether agent posts disclose owner-related personal information not present in public configuration.
- Pipeline validated and calibrated; results subjected to robustness and simulation checks.
Implications for AI Economics
- New externality channel: agentic systems can propagate owner-specific heterogeneity into platform-level outcomes, creating a privacy externality that arises from ordinary agent behavior rather than adversarial extraction or third-party inference.
- Market and platform design consequences
- Platforms that host agentic actors will see content heterogeneity tied to owner distributions — affecting demand dynamics, attention allocation, reputation formation, and welfare.
- Product design choices (defaults on tool/file access, how much persistent context agents retain, visibility of configuration) materially affect systemic privacy and information externalities.
- Regulation and governance
- Existing privacy regulation (data minimization, informed consent, transparency) should consider agent-specific channels: limiting or auditing agents’ access to local context and constraining autonomous content-sharing may be warranted.
- Mandatory disclosure/labels for agent-generated content or requirements on platform-level agent auditing could mitigate downstream harms.
- Information economics and firm strategy
- Firms deploying agents for customers (e.g., assistants negotiating, shopping, or communicating) must weigh utility gains against leakage risk; this creates demand for privacy-preserving agent designs and potential markets for certified private agents.
- Insurance, liability, and contract terms may evolve to cover harms from unintended disclosures by agents acting on users’ behalf.
- Empirical analysis and modeling recommendations
- Economic models of online markets, matching, and platform competition should incorporate agent-level types and the possibility of behavior transfer that links owner traits to agent actions.
- Welfare analyses must account for both productivity gains from delegation and new information externalities (including targeted manipulation, reputation spillovers, and privacy harms).
- Research and policy priorities
- Quantify long-run dynamics: how quickly do agents accumulate owner context, and can transfer be unlearned or bounded?
- Design and evaluate mitigation strategies: stronger defaults, local sandboxing, context-limited prompts, differential privacy for interaction histories, and transparent audit logs.
- Cross-platform and cross-model generalization: assess whether findings hold beyond OpenClaw/Moltbook and across LLM providers and agent frameworks.
Limitations and caveats - While the paper provides strong correlational evidence and plausible mechanisms favouring accumulated interaction, fully causal separation of configuration vs interaction channels requires experimental or longitudinal identification. - Authors assert reverse causality is structurally unlikely; note that some methodological descriptions indicate recent tweets were used without restricting strictly to pre-deployment tweets — future work should more tightly align timelines to rule out all temporal confounds.
Overall, the paper documents an economically consequential property of agentic AI: agents routinely carry owner-specific behavioral signals into public interaction, creating new privacy risks and systemic externalities that matter for platform design, regulation, and the economic analysis of AI deployment.
Assessment
Claims (7)
| Claim | Direction | Confidence | Outcome | Details |
|---|---|---|---|---|
| We study this question using 10,659 matched human-agent pairs from Moltbook, a social media platform where each autonomous agent is publicly linked to its owner's Twitter/X account. Other | null_result | high | matched_human-agent_pairs_count |
n=10659
0.5
|
| We find systematic transfer between agents and their specific owners across features spanning topics, values, affect, and linguistic style. Ai Safety And Ethics | positive | high | behavioral_alignment_between_agent_and_owner_across_topics_values_affect_style |
n=10659
0.3
|
| This transfer persists among agents without explicit configuration. Ai Safety And Ethics | positive | medium | behavioral_alignment_in_unconfigured_agents |
0.18
|
| Pairs that align on one behavioral dimension tend to align on others. Ai Safety And Ethics | positive | high | cross-dimensional_alignment_correlation_between_agent_and_owner |
n=10659
0.3
|
| These patterns are consistent with transfer emerging through accumulated interaction between owners (or owners' computer environments) and their agents in everyday use. Ai Safety And Ethics | positive | medium | inferred_mechanism_of_transfer (accumulated_interaction) |
n=10659
0.03
|
| Agents with stronger behavioral transfer are more likely to disclose owner-related personal information in public discourse, suggesting that the same owner-specific context that drives behavioral transfer may also create privacy risk during ordinary use. Ai Safety And Ethics | positive | high | likelihood_of_disclosing_owner_personal_information_by_agent |
n=10659
0.3
|
| AI agents do not simply generate content, but reflect owner-related context in ways that can propagate human behavioral heterogeneity into digital environments, with implications for privacy, platform design, and the governance of agentic systems. Ai Safety And Ethics | positive | high | propagation_of_owner_behavioral_heterogeneity_into_digital_environments (implication) |
n=10659
0.05
|