In Nairobi’s digital-lending ecosystem, credit scores are not neutral outputs but products of continual technical and political work; practitioners and regulators co-shape models and markets using data workarounds, multiple risk framings, and legal strategies.

Risk, Data, Alignment: Making Credit Scoring Work in Kenya

Daniel Mwesigwa, Steven J. Jackson, Christopher Csíkszentmihályi · April 13, 2026

openalex descriptive n/a evidence 7/10 relevance DOI Source PDF

Ethnographic evidence from Nairobi shows algorithmic credit scoring is actively produced through technical, legal, and political work by practitioners who shape both models and the social worlds they operate in to stabilize risk under uncertainty.

Credit scoring is an increasingly central and contested domain of data and AI governance, frequently framed as a neutral and objective method of assessing risk across diverse economic and political contexts. Based on a nine-month ethnography of credit scoring practices in Nairobi, Kenya, we examined the sociotechnical and institutional work of data science in digital lending. While established regional telcos and banks are leveraging proprietary data to develop digital loan products, algorithmic credit scoring is being transformed by new actors, techniques, and shifting regulations. Our findings show how practitioners construct alternative data using technical and legal workarounds, formulate risk through multiple interpretations, and negotiate model performance via technical and political means. We argue that algorithmic credit scoring is accomplished through the ongoing work of alignment that stabilizes risk under conditions of persistent uncertainty, taking epistemic, modeling, and contextual forms. Extending work on alignment in HCI, we show how it operates as a two-way translation, where models are made "safe for worlds" while those worlds are reshaped to be "safe for models."

Summary

Main Finding

Algorithmic credit scoring in Kenya is not a purely technical or neutral refinement of risk measurement but a sociotechnical accomplishment achieved through ongoing alignment work. Practitioners create and translate alternative data, construct multiple operationalizations of “risk,” and negotiate model performance via technical and political interventions. Alignment operates bidirectionally: models are made “safe for worlds” while social, institutional, and market practices are reshaped to be “safe for models.” This stabilizes decision-making under persistent epistemic, modeling, and contextual uncertainty but produces significant distributional, privacy, and governance consequences.

Key Points

Three interlinked forms of uncertainty disrupt credit scoring:
- Epistemic uncertainty — limits on what can be known or measured about borrowers.
- Modeling uncertainty — gaps between model claims and real-world deployment contexts.
- Contextual uncertainty — discretion, strategic behavior, and social practices that alter model inputs and outcomes.
Alternative data practices:
- Firms (startups, telcos, banks) construct “alternative data” from mobile-money flows, airtime use, SIM- and device-level signals, and other nontraditional traces.
- These data are often produced through technical and legal workarounds (data engineering, cross-party agreements, creative feature engineering) rather than through passive reuse of neutral records.
- The making of alternative data is performative: data engineering choices reshape how borrowers are represented and how risk is enacted in markets.
Negotiation of model performance:
- Teams balance explainability, regulatory compliance, and business objectives; performance claims are validated and contested across engineers, product managers, legal teams, and regulators.
- Non-technical levers (pricing, collection practices, product design) are used alongside algorithmic adjustments to manage defaults and perceived risk.
Two-way alignment:
- Alignment is not only tuning models to reflect social goals but also reshaping borrower behavior, institutional practices, and market rules to fit model affordances.
- This mutual shaping underpins the apparent success of scoring systems while embedding power asymmetries (e.g., incumbents’ data advantages).
Real-world context:
- Kenya’s fintech ecosystem (e.g., dominant telco platforms, high mobile-money penetration) creates both opportunities and risks: rapid market expansion, high default rates, aggressive debt collection practices, and emergent regulatory responses.
- Borrowers adopt strategies (SIM changes, refinancing, evasive practices) that feed back into model uncertainty and require further alignment work.

Data & Methods

Design: Nine-month ethnographic study in Nairobi (Feb–Nov 2025).
Sites and actors: Observations and engagements with digital-lending startups, established telcos and banks, product and data science teams, regulators, and borrowers/communities.
Methods: Participant observation, shadowing of data science/product teams, interviews across organizational roles, and analysis of regulatory and company practices and artifacts (models, features, data pipelines, policies).
Contextual inputs cited in the study: Kenya’s high mobile-money use (World Bank Findex 2025: ~32% borrowing via mobile-money; ~86% using mobile money for day-to-day needs), reports of high digital-loan default rates (up to ~40%), and Safaricom’s dominant market position in telecom/fintech data.
Analytical frame: Draws on HCI and science-and-technology studies concepts of alignment (Fujimura, Dourish) and distinguishes epistemic, modeling, and contextual uncertainty as analytic categories.

Implications for AI Economics

Market structure & data rents:
- Incumbent platforms with privileged data (telcos, large banks) hold competitive advantages; alternative-data production is itself an economic asset and barrier to entry.
- Economic models should treat data creation and legal/organizational work (data deals, consent engineering) as production inputs and sources of rents.
Endogeneity and performativity:
- Credit scoring is performative: the use of scores changes borrower behavior and market functioning. Econometric and theoretical models must account for feedback loops and policy endogeneity when estimating welfare or default dynamics.
Pricing, segmentation, and fairness:
- Risk-based pricing informed by alternative data may enable finer segmentation and price discrimination, raising distributional concerns. Regulators and economists should study who benefits and who is excluded.
Strategic behavior and model fragility:
- Borrowers and intermediaries adopt tactics (SIM switching, obfuscation, strategic borrowing) that reduce predictive validity and can generate arms races between firms and users. This implies higher ongoing costs for model maintenance and monitoring.
Institutional and coordination costs:
- Firms invest beyond algorithms: legal compliance, lobbying, collection practices, product redesign, and trust-building mechanisms are part of the “cost of alignment.” These non-algorithmic investments affect market equilibrium and should be included in cost–benefit assessments.
Policy and regulatory design:
- Effective regulation requires attention to how alternative data are produced/processed (not only which features are used). Transparency, auditability, limits on risky operational practices (e.g., public shaming collections), and standards for acceptable alternative-data sourcing are critical.
- Consumer protection policy should consider both model outputs and the institutional practices that make scoring workable (collection methods, pricing strategies, dispute mechanisms).
Research agenda for economists:
- Quantify welfare impacts of alternative-data scoring on different socioeconomic groups.
- Model dynamic feedbacks between scoring deployment and borrower behavior (endogenous selection and strategic responses).
- Measure transaction and governance costs of alignment (legal workarounds, data-sharing contracts, monitoring).
- Evaluate how regulatory interventions (data protection, transparency mandates, caps on interest/collections) alter incentives for data creation and model design.

Overall, the paper highlights that evaluating AI-driven financial products requires moving beyond predictive-accuracy metrics to include the economics of data production, institutional alignment work, strategic behavior, and regulatory feedbacks that collectively determine market outcomes and social welfare.

Assessment

Paper Typedescriptive Evidence Strengthn/a — The paper is a qualitative nine-month ethnography offering rich, contextualized insights rather than causal estimates; it does not attempt or provide statistical identification of causal effects. Methods Rigorhigh — Sustained nine-month ethnographic fieldwork with practitioner observation, interviews, and analysis of technical and institutional practices provides deep, triangulated qualitative evidence; however, it remains bounded to context-specific cases and interpretive analysis. SampleNine-month ethnography of credit scoring practices in Nairobi, Kenya, including field observations, interviews with practitioners at regional telcos, banks, digital lenders and startups, analysis of model artifacts and technical workflows, and engagement with regulators and policy documents. Themesgovernance adoption GeneralizabilitySingle-city, single-country study (Nairobi, Kenya) limits geographic generalizability, Focus on digital lending and specific local actors (telcos, banks, startups) may not represent other financial sectors, Qualitative, purposive sample prevents statistical generalization to broader populations, Findings may be time-bound given rapidly evolving technologies and regulatory changes

Claims (8)

Claim	Direction	Confidence	Outcome	Details
Credit scoring is an increasingly central and contested domain of data and AI governance. Governance And Regulation	mixed	high	role of credit scoring in data and AI governance (centrality and contestedness)	0.18
Established regional telcos and banks are leveraging proprietary data to develop digital loan products. Adoption Rate	positive	high	use of proprietary data by telcos and banks to create digital loan products (adoption of data-driven lending)	0.18
Algorithmic credit scoring is being transformed by new actors, techniques, and shifting regulations. Market Structure	mixed	high	structural transformation of algorithmic credit scoring (actor composition, techniques, regulatory landscape)	0.18
Practitioners construct alternative data using technical and legal workarounds. Innovation Output	mixed	high	practices for generating and using alternative data in credit models	0.18
Practitioners formulate risk through multiple interpretations. Decision Quality	mixed	high	variation in definitions and framings of risk among practitioners	0.18
Practitioners negotiate model performance via technical and political means. Output Quality	mixed	high	practices used to achieve and justify model performance (technical tuning and political negotiation)	0.18
Algorithmic credit scoring is accomplished through the ongoing work of alignment that stabilizes risk under conditions of persistent uncertainty, taking epistemic, modeling, and contextual forms. Decision Quality	mixed	high	alignment practices that stabilize risk amid uncertainty (epistemic, modeling, contextual alignment)	0.18
Alignment operates as a two-way translation, where models are made 'safe for worlds' while those worlds are reshaped to be 'safe for models.' Ai Safety And Ethics	mixed	high	reciprocal adjustments between predictive models and social/institutional environments (two-way alignment)	0.18