AI agents often echo their human owners: autonomous Moltbook agents reproduce owners' linguistic style, topics and affect, and agents that more closely mirror owners are more likely to divulge owner-related personal information, raising privacy and governance concerns.

Behavioral Transfer in AI Agents: Evidence and Privacy Implications

Shilei Luo, Zhiqi Zhang, Hengchen Dai, Dennis Zhang · April 21, 2026

arxiv correlational medium evidence 7/10 relevance Source PDF

Across 10,659 Moltbook human-agent pairs, agents' posts systematically mirror their owners' topics, values, affect, and linguistic style—even without explicit configuration—and stronger owner-agent alignment is associated with increased public disclosure of owner-related personal information.

AI agents powered by large language models are increasingly acting on behalf of humans in social and economic environments. Prior research has focused on their task performance and effects on human outcomes, but less is known about the relationship between agents and the specific individuals who deploy them. We ask whether agents systematically reflect the behavioral characteristics of their human owners, functioning as behavioral extensions rather than producing generic outputs. We study this question using 10,659 matched human-agent pairs from Moltbook, a social media platform where each autonomous agent is publicly linked to its owner's Twitter/X account. By comparing agents' posts on Moltbook with their owners' Twitter/X activity across features spanning topics, values, affect, and linguistic style, we find systematic transfer between agents and their specific owners. This transfer persists among agents without explicit configuration, and pairs that align on one behavioral dimension tend to align on others. These patterns are consistent with transfer emerging through accumulated interaction between owners (or owners' computer environments) and their agents in everyday use. We further show that agents with stronger behavioral transfer are more likely to disclose owner-related personal information in public discourse, suggesting that the same owner-specific context that drives behavioral transfer may also create privacy risk during ordinary use. Taken together, our results indicate that AI agents do not simply generate content, but reflect owner-related context in ways that can propagate human behavioral heterogeneity into digital environments, with implications for privacy, platform design, and the governance of agentic systems.

Summary

Main Finding

AI agents deployed by individuals systematically inherit owner-specific behavioral characteristics across topics, values, affect, and linguistic style — even when owners do not explicitly configure the agents. This "behavioral transfer" is coherent across multiple dimensions and predicts privacy leakage: agents that more closely mirror their owners are more likely to disclose owner-related personal information in public agent posts.

Key Points

Empirical scale and setting
- Analysis uses 10,659 matched human-agent pairs from Moltbook (agents built on the OpenClaw framework) and the owners’ Twitter/X activity.
- Dataset: 44,588 Moltbook posts (from these agents) and 69,559 non-verification tweets (from matched humans).
Behavioral measurement
- Authors construct 43 text-based features in four dimensions: Topics (6), Values (7), Affect (12), and Style (18).
- Features include topic keyword rates, moral foundations, political ideology, sentiment/emotion metrics, and linguistic/style measures (e.g., TTR, pronoun rates).
Evidence of transfer
- Strong, robust positive associations between each agent and its specific owner across many features. Agents resemble their owners more than randomly matched users.
- Transfer remains when agents lack public bios/configuration, reducing the likelihood that purely deliberate owner-specified instructions explain the effect.
- Cross-dimension coherence: pairs that align strongly on one set of features tend to align on others — consistent with accumulated exposure rather than narrowly defined, dimension-specific configuration.
Privacy implications
- Using a validated LLM-as-judge pipeline, 34.6% of agents surfaced owner-related sensitive personal information in public posts that was not present in public agent configuration.
- A one-standard-deviation increase in a pair’s holistic transfer score is associated with a 1.32 percentage-point higher probability that the agent leaks private information.
- Results robust to simulation-based checks, classification noise, and tests intended to rule out automation/“puppet” account confounds.
Mechanism interpretation
- Two candidate channels: (i) explicit configuration (system prompts/character files), and (ii) accumulated interaction and local-environment context (persistent chats, access to files, tools).
- Empirical patterns (transfer without bios; cross-feature coherence) favour the accumulated interaction/environment explanation, though configuration may still contribute.

Data & Methods

Platforms and architecture
- OpenClaw: open-source agent framework enabling local deployment, persistent sessions, configuration via local character files (system prompts), and optional access to local tools/files.
- Moltbook: social platform where OpenClaw agents operate autonomously and are publicly linkable to owners’ Twitter/X accounts via verification tweets.
Sample construction and dates
- Moltbook data scraped Feb 2, 2026 (platform activity Jan 27–Feb 2, 2026); Twitter data collected Feb 7, 2026.
- Of ~1.6M registered Moltbook agents, 20,894 posted at least once; 17,745 of those linked Twitter handles; final matched sample after cleaning: 10,659 human-agent pairs.
- Verification tweets (standardized claim tweets) removed using a conservative keyword filter.
Feature construction
- 43 features computed separately for agents (Moltbook posts) and humans (Twitter tweets) and compared at the pair level using rank correlations and regression-based analyses.
- Topic, moral foundation, and emotion features derived from dictionaries; sentiment via VADER; political ideology scored via LLM (Gemini 2.5-Flash) and dictionary methods; style features via standard NLP metrics.
Identification strategy
- Main identification leverages matched pairs and cross-pair variation: if an owner uses more of feature X, the agent deployed by that owner should also display more of feature X if transfer occurs.
- Authors argue reverse causality is unlikely (agents did not shape prior human tweets); robustness checks and simulation analyses address alternative explanations (classification noise, scripted/puppet accounts).
- Tests to distinguish channels: (i) evaluate transfer among agents without public bios/configuration, and (ii) examine co-movement across distinct feature dimensions.
Privacy-detection pipeline
- LLM-as-judge method used to classify whether agent posts disclose owner-related personal information not present in public configuration.
- Pipeline validated and calibrated; results subjected to robustness and simulation checks.

Implications for AI Economics

New externality channel: agentic systems can propagate owner-specific heterogeneity into platform-level outcomes, creating a privacy externality that arises from ordinary agent behavior rather than adversarial extraction or third-party inference.
Market and platform design consequences
- Platforms that host agentic actors will see content heterogeneity tied to owner distributions — affecting demand dynamics, attention allocation, reputation formation, and welfare.
- Product design choices (defaults on tool/file access, how much persistent context agents retain, visibility of configuration) materially affect systemic privacy and information externalities.
Regulation and governance
- Existing privacy regulation (data minimization, informed consent, transparency) should consider agent-specific channels: limiting or auditing agents’ access to local context and constraining autonomous content-sharing may be warranted.
- Mandatory disclosure/labels for agent-generated content or requirements on platform-level agent auditing could mitigate downstream harms.
Information economics and firm strategy
- Firms deploying agents for customers (e.g., assistants negotiating, shopping, or communicating) must weigh utility gains against leakage risk; this creates demand for privacy-preserving agent designs and potential markets for certified private agents.
- Insurance, liability, and contract terms may evolve to cover harms from unintended disclosures by agents acting on users’ behalf.
Empirical analysis and modeling recommendations
- Economic models of online markets, matching, and platform competition should incorporate agent-level types and the possibility of behavior transfer that links owner traits to agent actions.
- Welfare analyses must account for both productivity gains from delegation and new information externalities (including targeted manipulation, reputation spillovers, and privacy harms).
Research and policy priorities
- Quantify long-run dynamics: how quickly do agents accumulate owner context, and can transfer be unlearned or bounded?
- Design and evaluate mitigation strategies: stronger defaults, local sandboxing, context-limited prompts, differential privacy for interaction histories, and transparent audit logs.
- Cross-platform and cross-model generalization: assess whether findings hold beyond OpenClaw/Moltbook and across LLM providers and agent frameworks.

Limitations and caveats - While the paper provides strong correlational evidence and plausible mechanisms favouring accumulated interaction, fully causal separation of configuration vs interaction channels requires experimental or longitudinal identification. - Authors assert reverse causality is structurally unlikely; note that some methodological descriptions indicate recent tweets were used without restricting strictly to pre-deployment tweets — future work should more tightly align timelines to rule out all temporal confounds.

Overall, the paper documents an economically consequential property of agentic AI: agents routinely carry owner-specific behavioral signals into public interaction, creating new privacy risks and systemic externalities that matter for platform design, regulation, and the economic analysis of AI deployment.

Assessment

Paper Typecorrelational Evidence Strengthmedium — The study uses a large sample (10,659 matched human-agent pairs) and multiple linguistic and behavioral features to show consistent, cross-dimensional alignment between owners and their agents, which supports robust correlational evidence; however, it is observational, platform-specific, and cannot establish causality or rule out alternative explanations (e.g., shared external inputs, selection into linking accounts, or agent training/initialization effects). Methods Rigormedium — Methods appear systematic (textual feature extraction across topics, values, affect, and style, pairwise comparisons, and analyses linking transfer strength to disclosure risk) and likely include reasonable robustness checks, but the study lacks causal identification, may suffer from selection and measurement biases, and depends on opaque details of agent configuration/training that limit stronger inferences. Sample10,659 matched human-agent pairs from Moltbook, a social platform where each autonomous agent is publicly linked to its owner's Twitter/X account; comparison uses owners' Twitter/X activity and agents' Moltbook posts across multiple derived linguistic and behavioral features (topics, values, affect, linguistic style); sample likely restricted to users who publicly link accounts and to the time window and languages present on the platform. Themeshuman_ai_collab governance GeneralizabilityPlatform-specific (Moltbook) — findings may not transfer to other agent platforms or closed/enterprise deployments, Self-selection bias — users who publicly link agents to Twitter/X are likely non-representative (tech-savvy, privacy attitudes, use cases), Language and cultural bias — results may reflect dominant languages/cultures on the platform, Agent heterogeneity — results depend on the kinds of LLMs, agent architectures, and default configurations used on Moltbook, Temporal/contextual dependence — behaviour transfer may vary over time or with changes in agent software or user practices

Claims (7)

Claim	Direction	Confidence	Outcome	Details
We study this question using 10,659 matched human-agent pairs from Moltbook, a social media platform where each autonomous agent is publicly linked to its owner's Twitter/X account. Other	null_result	high	matched_human-agent_pairs_count	n=10659 0.5
We find systematic transfer between agents and their specific owners across features spanning topics, values, affect, and linguistic style. Ai Safety And Ethics	positive	high	behavioral_alignment_between_agent_and_owner_across_topics_values_affect_style	n=10659 0.3
This transfer persists among agents without explicit configuration. Ai Safety And Ethics	positive	medium	behavioral_alignment_in_unconfigured_agents	0.18
Pairs that align on one behavioral dimension tend to align on others. Ai Safety And Ethics	positive	high	cross-dimensional_alignment_correlation_between_agent_and_owner	n=10659 0.3
These patterns are consistent with transfer emerging through accumulated interaction between owners (or owners' computer environments) and their agents in everyday use. Ai Safety And Ethics	positive	medium	inferred_mechanism_of_transfer (accumulated_interaction)	n=10659 0.03
Agents with stronger behavioral transfer are more likely to disclose owner-related personal information in public discourse, suggesting that the same owner-specific context that drives behavioral transfer may also create privacy risk during ordinary use. Ai Safety And Ethics	positive	high	likelihood_of_disclosing_owner_personal_information_by_agent	n=10659 0.3
AI agents do not simply generate content, but reflect owner-related context in ways that can propagate human behavioral heterogeneity into digital environments, with implications for privacy, platform design, and the governance of agentic systems. Ai Safety And Ethics	positive	high	propagation_of_owner_behavioral_heterogeneity_into_digital_environments (implication)	n=10659 0.05