Platform-embedded LLM assistants on Ctrip are adopted most by older, female and highly engaged users and act as a complementary exploratory interface rather than replacing search; chats appear early in purchase journeys and focus on discovery questions.
This paper provides some of the first large-scale descriptive evidence on how consumers adopt and use platform-embedded shopping AI in e-commerce. Using data on 31 million users of Ctrip, China's largest online travel platform, we study "Wendao," an LLM-based AI assistant integrated into the platform. We document three empirical regularities. First, adoption is highest among older consumers, female users, and highly engaged existing users, reversing the younger, male-dominated profile commonly documented for general-purpose AI tools. Second, AI chat appears in the same broad phase of the purchase journey as traditional search and well before order placement; among journeys containing both chat and search, the most common pattern is interleaving, with users moving back and forth between the two modalities. Third, consumers disproportionately use the assistant for exploratory, hard-to-keyword tasks: attraction queries account for 42% of observed chat requests, and chat intent varies systematically with both the timing of chat relative to search and the category of products later purchased within the same journey. These findings suggest that embedded shopping AI functions less as a substitute for conventional search than as a complementary interface for exploratory product discovery in e-commerce.
Summary
Main Finding
Platform-embedded shopping AI on Ctrip (Wendao) is adopted primarily by older, female, and highly engaged users, appears during the active search/browsing phase well before booking, and is used disproportionately for exploratory, hard-to-keyword tasks (especially attraction queries). Chat functions more as a complement to traditional search—often interleaved with it—rather than a simple substitute for keyword search.
Key Points
-
Sample & adoption
- Base sample: 31,142,353 unique users who logged in July 10–24, 2025.
- Adopters (initiated ≥1 chat by July 10, 2025): 1,904,368 users (6.1%).
- Only 188,647 adopters had observable chat transcripts in the analysis window.
-
Adoption profile (contrasts with general-purpose AI)
- Adoption increases with age (10.7% for 50+ vs. 4.6% for <24).
- Higher adoption among female users (7.1% vs. 6.0% male in gender-known subsample).
- Much higher adoption among loyal/engaged users (e.g., 12.8% for Black Diamond VIP).
- Engagement variables (login minutes, sessions, session duration) are the strongest predictors; adding engagement to demographics increased random-forest AUC from 0.635 → 0.962.
- Caveat: engagement may reflect greater exposure to the assistant, not causal propensity.
-
When chat occurs in the journey
- Journey construction: stitched login sessions (split when gaps ≥48 hours; split between order-containing sessions if inactivity gap >4 hours).
- Identified ~253 million journeys in June 10–Aug 15, 2025 window.
- Median chat event occurs at ~47% of journey progress; median hotel search at 55%; median order at 88% → chat clusters with search/clicks and precedes orders.
- Chat is predominantly pre-purchase (discovery/refinement phase), not post-booking support.
-
Modalities and patterns of interaction
- Chat-only journeys: 42% of chat journeys (shorter, lower conversion: median 19 hours; 18% conversion).
- Among all chat journeys: chat-between-search (interleaving) 31%, chat-before-search 15%, chat-after-search 12%.
- Conditional on journeys that contain both chat and search, interleaving dominates (53% interleaved; 26% chat-before; 21% chat-after).
- Interleaved journeys are longer, have many more searches (median ~150 vs ~21–24), and higher conversion (43% vs 39% chat-before and 31% chat-after).
-
What users ask the assistant
- Attraction-related queries are the largest single intent: 42% of observed chat requests.
- Chat intent aligns with downstream purchases: e.g., hotel-related chats are more likely followed by hotel orders; attraction chats by vacation/attraction orders.
- Interpretation: users delegate exploratory, difficult-to-keyword tasks to chat.
Data & Methods
-
Data sources (joined at anonymized user ID)
- User-level demographics & account attributes (age group, gender, city tier, device type, tenure, VIP level).
- Session & hotel-search logs (timestamped search queries, clicks, page views).
- Chat logs (full text of user requests and AI replies; platform-inferred intent categories).
- Booking/orders (timestamps, product categories).
-
Time windows & event definitions
- Base login window: July 10–24, 2025; behavioral window expanded to June 10–Aug 15, 2025 to capture episodic travel-planning.
- Journey construction: stitch sessions unless gap ≥48 hours; split between order-containing sessions with inactivity gap >4 hours. Robustness checks with alternative definitions (inactivity-only, 8/12-hour thresholds) reported in appendices.
-
Empirical approach
- Descriptive statistics and kernel density estimates to place chat relative to search and orders.
- Regression models (linear probability, logit, probit) for adoption correlates; marginal effects plotted.
- Machine learning: random forest for variable importance and classification (AUC reported).
- Journey-level and session-level classification to study ordering patterns (chat-before, chat-after, chat-between, chat-only).
- Topic/intention linkage using platform’s internal intent classifier; cross-tabulation of chat intent vs. downstream purchase categories.
- Robustness analyses: session-level checks, alternative journey definitions, and discussion of collinearity (login minutes × sessions × session duration).
-
Important methodological caveats
- Exposure vs. propensity: highly engaged users see the assistant more often; observed adoption gradients may partly reflect visibility.
- Selection vs. causal inference: descriptive design cannot separate selection into chat use from causal effects of AI access—authors highlight need for randomized visibility or access for causal claims.
- Mechanical interleaving bias: longer journeys mechanically allow more interleaving; authors mitigate this with session-level analyses.
Implications for AI Economics
-
Demand-side implications
- Embedded assistants can broaden AI adoption to older and less tech-savvy segments by reducing keyword/search interface frictions—changing the typical “early-adopter” demographics seen for general-purpose AI.
- Adoption concentrated among highly engaged users suggests incumbency effects: platforms with stronger user relationships and more engaged bases are likely to see faster uptake and greater impact.
-
Product design & platform strategy
- Design assistants to prioritize exploratory discovery and preference articulation (attractions, planning, comparison) rather than only transactional automation.
- Treat chat as a complement to search—optimizing transitions between chat and ranked listings, and surfacing links from conversational results to product detail pages.
- Monetization and ranking systems may need to adapt: interleaving implies AI can influence downstream search behavior and conversions, affecting ad impressions, sponsored listings, and recommender exposure.
-
Measurement & research design guidance
- Empirical work should instrument or randomize exposure/visibility (entry-point placement) to disentangle selection from treatment effects.
- Field experiments must account for the assistant’s potential to change users’ use of traditional search (spillovers, modality substitution/complementarity).
- Heterogeneous effects by tenure and loyalty suggest experiments should stratify by engagement tier and demographic groups.
-
Market structure & policy considerations
- Embedded assistants could strengthen incumbent platforms’ advantages by improving retention/engagement for existing users and increasing switching costs.
- Regulators and competition researchers should monitor whether assistants systematically route users to platform-owned inventory or biased rankings, given strong alignment between chat intent and downstream purchases.
-
Welfare and firm-side productivity
- For consumers: reduced search frictions and better articulation of preferences may raise consumer surplus, especially for those less comfortable with keyword search.
- For firms: potential increases in conversion (not uniform—interleaved, engaged journeys show higher conversion; chat-only lower conversion) and shifts in user journey dynamics affecting marketing ROI.
- Future work should quantify welfare gains, advertising/merchant revenue impacts, and cross-platform competitive effects using causal designs.
Overall, the paper provides large-scale, detailed descriptive evidence that shopping-specific, platform-embedded LLM assistants act largely as exploratory discovery tools used by older and more loyal users, with important implications for platform strategy, measurement, and regulation in AI-enabled e-commerce.
Assessment
Claims (7)
| Claim | Direction | Confidence | Outcome | Details |
|---|---|---|---|---|
| The analysis uses data on 31 million users of Ctrip, China's largest online travel platform, to study "Wendao," an LLM-based AI assistant integrated into the platform. Other | null_result | high | other |
n=31000000
0.3
|
| Adoption of the embedded shopping AI is highest among older consumers, female users, and highly engaged existing users, reversing the younger, male-dominated profile commonly documented for general-purpose AI tools. Adoption Rate | positive | high | adoption_rate |
n=31000000
0.18
|
| AI chat appears in the same broad phase of the purchase journey as traditional search and well before order placement. Task Allocation | positive | high | task_allocation |
0.18
|
| Among journeys containing both chat and search, the most common pattern is interleaving, with users moving back and forth between the two modalities. Task Allocation | positive | high | task_allocation |
0.18
|
| Consumers disproportionately use the assistant for exploratory, hard-to-keyword tasks: attraction queries account for 42% of observed chat requests. Task Allocation | positive | high | task_allocation |
42%
0.18
|
| Chat intent varies systematically with both the timing of chat relative to search and the category of products later purchased within the same journey. Task Allocation | mixed | high | task_allocation |
0.18
|
| Embedded shopping AI functions less as a substitute for conventional search than as a complementary interface for exploratory product discovery in e-commerce. Task Allocation | positive | high | task_allocation |
n=31000000
0.18
|