Academic research has barely touched NLP for bank marketing: only 8 of 109 peer‑reviewed studies directly examine it, and the literature clusters on retention rather than acquisition, personalization or external-data integration—pointing to large unrealised economic and managerial opportunities.

Natural language processing in bank marketing: a systematic review and strategic gap analysis

Christopher Gerling, Stefan Lessmann · March 11, 2026 · International Journal of Bank Marketing

openalex review_meta n/a evidence 7/10 relevance DOI Source PDF

A systematic PRISMA review of 109 peer‑reviewed articles (2014–2024) finds NLP in bank marketing is severely under‑studied—only 8 papers directly address it—with most existing work concentrated on customer retention and notable gaps in acquisition, personalization, external text sources, and operational use cases.

Purpose Natural language processing (NLP), a subfield of artificial intelligence, allows organizations to gain insights from unstructured text, such as e-mails, documents and social media posts. The automated interpretation of human language holds considerable potential to sharpen marketing strategies, deepen customer engagement and unlock new value. Yet academic knowledge on NLP in the context of bank marketing remains scattered. This paper consolidates that knowledge and identifies research gaps, with particular attention to where NLP can be integrated into the customer journey and operational excellence. Design/methodology/approach Following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses protocol, we screened peer-reviewed journal articles published between 2014 and 2024 (n = 109). We then conducted a structured review of analytical marketing in banking and NLP applications in general marketing. Finally, we used sentence-transformer embeddings and uniform manifold approximation and projection to visualize the thematic landscape and identify under-explored areas. Findings Only 8 papers study NLP within bank marketing; 74 additional papers examine NLP in marketing more generally, and a further 27 explore broader marketing applications in banking. The existing literature concentrates primarily on customer retention, whereas other areas, including customer acquisition, personalized engagement and the use of external text data, receive comparatively little attention. Originality/value This study provides one of the first systematic reviews focused specifically on NLP in bank marketing. By aligning prior research with the customer journey and the marketing mix, it offers a structured reference for researchers and practitioners interested in applying NLP for growth, improved customer experience and innovation in the banking sector.

Summary

Main Finding

A systematic review (PRISMA protocol) of 109 peer‑reviewed articles (2014–2024) finds that NLP applications in bank marketing are severely under‑studied: only 8 papers directly examine NLP in bank marketing, while 74 study NLP in marketing broadly and 27 study marketing in banking without NLP. Existing work concentrates on customer retention; customer acquisition, personalized engagement, use of external text data, and operational excellence remain important gaps.

Key Points

Scope and sample: 109 peer‑reviewed journal articles published 2014–2024 were screened and reviewed.
Coverage breakdown: 8 papers on NLP in bank marketing; 74 on NLP in marketing (general); 27 on broader marketing in banking (non‑NLP).
Thematic concentration: literature is clustered around customer retention (e.g., churn prediction, complaint handling, relationship management).
Under‑explored areas: customer acquisition, personalization at scale, external text sources (social media, news, public reviews), operational process improvement, and cross‑channel integration.
Novelty of the study: one of the first systematic reviews focused specifically on NLP in bank marketing; organizes findings along the customer journey and the marketing mix to provide a practical taxonomy.
Visualization/analysis tools: sentence‑transformer embeddings to encode article text and UMAP (Uniform Manifold Approximation and Projection) to map the thematic landscape and identify sparsely populated research areas.

Data & Methods

Literature protocol: Preferred Reporting Items for Systematic Reviews and Meta‑Analyses (PRISMA).
Selection: peer‑reviewed journal articles between 2014 and 2024; final sample n = 109.
Analytical approach:
- Structured review of two overlapping domains: analytical marketing in banking and NLP applications in marketing.
- NLP embedding technique: sentence‑transformer models to produce dense vector representations of article abstracts/text.
- Dimensionality reduction/visualization: UMAP to project embeddings into a low‑dimensional thematic map, enabling cluster identification and gap detection.
Outcome measures: topical coverage by customer journey stage (acquisition, engagement, retention, operational excellence) and by elements of the marketing mix.

Implications for AI Economics

Market potential and diffusion
- Limited applied research on NLP for acquisition and personalization suggests unrealized value in banking: NLP could enable more efficient, targeted customer acquisition and cross‑sell, potentially lowering customer‑finding costs and increasing lifetime value.
- The concentration on retention reflects earlier, lower‑risk adoption; moving into acquisition/personalization could accelerate competitive dynamics and spur investment in NLP capabilities.
Productivity and organizational complementarities
- Realizing NLP value requires organizational investments (data pipelines, model deployment, integration with CRM), implying complementarity between AI tools and managerial/IT capabilities. Returns depend on these complementarities—an important consideration for adoption models and firm‑level productivity studies.
Data externalities, privacy, and regulation
- Under‑use of external text sources (social media, news, reviews) may owe to privacy, legal/regulatory uncertainty, or integration costs. Economic models should account for data access frictions, platform controls, and regulatory constraints when estimating NLP’s ROI in banking.
Competition, differentiation, and market structure
- Banks that operationalize NLP for personalization and acquisition can differentiate services, increasing switching costs and possibly market concentration if these advantages scale. Antitrust and market structure analyses should monitor NLP‑driven segmentation and incumbency advantages.
Labor and skill composition
- Greater NLP adoption could shift demand toward analytics, data engineering, and model‑monitoring skills while reducing routine marketing tasks. This has implications for wage structure and re‑skilling needs within financial services.
Measurement and causal inference needs
- The literature’s descriptive/engineering tilt highlights a gap in causal, field‑experimental evidence on NLP interventions’ effects on customer behavior and firm profits. AI economics should prioritize experimental or quasi‑experimental designs to estimate treatment effects, cost‑benefit, and general equilibrium impacts.
Research & policy priorities
- Empirical work quantifying the welfare impacts (consumer surplus, privacy costs), the distribution of gains across firms and consumers, and regulatory trade‑offs is needed.
- Studies that incorporate externalities (data spillovers), platform interactions (e.g., social media data access), and dynamic effects on competition will sharpen policy guidance.

Suggested immediate research directions for AI economists: - Field experiments in banks testing NLP‑driven acquisition/personalization versus standard approaches, measuring CAC, LTV, retention, and consumer welfare. - Structural/empirical models of adoption decisions that account for data access costs and organizational complementarities. - Analyses of privacy regulation impacts on the availability and economic value of external text data for marketing. - Macro‑level studies on how scaling NLP in banking affects market structure, employment, and productivity.

Assessment

Paper Typereview_meta Evidence Strengthn/a — This is a systematic literature review and mapping exercise rather than an empirical paper that estimates causal effects; it summarizes and characterizes existing studies rather than producing primary causal evidence. Methods Rigorhigh — The study follows a PRISMA protocol for systematic review, screens peer‑reviewed literature (2014–2024), and augments qualitative coding with reproducible, modern text-embedding (sentence-transformer) and dimensionality-reduction (UMAP) methods to map topical coverage and identify gaps—an appropriate and transparent approach for the stated goals, though embedding-based thematic assignments are descriptive and depend on choices of models and hyperparameters. SampleA curated sample of 109 peer‑reviewed journal articles published between 2014 and 2024, comprising (a) 8 articles that directly examine NLP applications in bank marketing, (b) 74 articles on NLP in marketing more broadly, and (c) 27 articles on marketing in banking that do not address NLP; article abstracts/text were embedded with sentence-transformers and visualized with UMAP for thematic clustering. Themesadoption productivity org_design governance labor_markets GeneralizabilityRestricted to peer‑reviewed journal articles (excludes conference papers, working papers, industry reports, and gray literature) which may omit timely or applied deployments., Timebound to 2014–2024; rapid developments in NLP since 2022–2024 may be underrepresented depending on publication lags., Possible language or regional bias if search/filtering favored English‑language journals (not specified)., Embedding + UMAP thematic mapping is sensitive to model choice and parameter settings, so cluster boundaries and 'gaps' are descriptive rather than definitive., Findings pertain specifically to banking marketing and may not generalize to other financial services or nonfinancial industries.

Claims (15)

Claim	Direction	Confidence	Outcome	Details
Only 8 peer‑reviewed papers directly examine NLP in bank marketing (out of a final sample of 109 articles published 2014–2024). Other	null_result	high	count of peer‑reviewed articles focused on NLP in bank marketing	n=109 0.04
Seventy‑four papers study NLP in marketing more broadly (not specifically banking). Other	null_result	high	count of peer‑reviewed articles on NLP in marketing (general)	n=109 0.04
Twenty‑seven papers study marketing in banking without using NLP methods. Other	null_result	high	count of peer‑reviewed articles on marketing in banking that do not use NLP	n=109 0.04
NLP applications in bank marketing are severely under‑studied. Other	negative	high	proportion and absolute count of studies at the intersection of NLP and bank marketing	n=109 0.04
Existing literature on NLP in marketing is concentrated around customer retention tasks (e.g., churn prediction, complaint handling, relationship management). Organizational Efficiency	negative	high	topical frequency/coverage by customer journey stage (retention)	n=109 0.04
Important gaps include customer acquisition, personalization at scale, use of external text sources (social media, news, reviews), operational process improvement, and cross‑channel integration. Other	negative	high	topical coverage by customer journey stage and source type (acquisition, personalization, external text usage, operations, cross‑channel integration)	n=109 0.04
The study followed a PRISMA protocol for literature selection and included peer‑reviewed journal articles published between 2014 and 2024, with a final sample size of n = 109. Other	null_result	high	methodological protocol adherence and sample size	n=109 0.04
The analysis used sentence‑transformer models to produce dense vector representations of article text and UMAP to project those embeddings into a low‑dimensional thematic map for cluster identification and gap detection. Other	null_result	high	analytic techniques applied to article abstracts/text (embedding + dimensionality reduction)	n=109 0.04
This paper is one of the first systematic reviews focused specifically on NLP in bank marketing, organizing findings along the customer journey and the marketing mix to provide a practical taxonomy. Other	null_result	medium	existence of prior systematic reviews specifically on NLP in bank marketing	n=109 0.02
Limited applied research on NLP for acquisition and personalization implies unrealized value in banking: NLP could enable more efficient, targeted customer acquisition and cross‑sell, potentially lowering customer‑acquisition cost (CAC) and increasing lifetime value (LTV). Firm Revenue	positive	speculative	customer‑acquisition cost (CAC), customer lifetime value (LTV), acquisition efficiency	0.0
Realizing NLP value in banks requires organizational investments (data pipelines, model deployment, CRM integration) and complementarity between AI tools and managerial/IT capabilities; returns will depend on these complementarities. Firm Revenue	mixed	medium	realized ROI from NLP adoption conditional on organizational investments and complementarities	0.02
The under‑use of external text sources in the reviewed literature may be due to privacy, legal/regulatory uncertainty, or integration costs. Other	negative	medium	use of external text sources in marketing research and barriers to their use	n=109 0.02
The current literature is skewed toward descriptive and engineering work; there is a lack of causal, field‑experimental evidence on NLP interventions' effects on customer behavior and firm profits. Other	negative	high	presence vs. absence of causal/experimental studies measuring effects on customer behavior and firm profits	n=109 0.04
If banks operationalize NLP for personalization and acquisition at scale, this could increase differentiation, raise switching costs, and potentially affect market concentration—warranting antitrust monitoring. Market Structure	positive	speculative	market structure indicators (differentiation, switching costs, market concentration) conditional on NLP adoption	0.0
Immediate research priorities for AI economists include: field experiments testing NLP‑driven acquisition/personalization (measuring CAC, LTV, retention, consumer welfare); structural/empirical models of adoption that include data access costs and complementarities; and analyses of privacy regulation impacts on external text data availability and value. Other	positive	medium	types of empirical/structural studies to be undertaken and the economic outcomes they should measure (CAC, LTV, retention, welfare, adoption decisions)	0.02