Exchanging generative modules, not model weights, lets siloed clients recover centralized accuracy in one communication round and supports exact deletion of a client's contribution. The method reduces reliance on pretrained foundation models, lowering coordination costs and creating opportunities (and regulatory questions) for marketplaces in certified generative priors.

FederatedFactory: Generative One-Shot Learning for Extremely Non-IID Distributed Scenarios

Andrea Moleri, Christian Internò, Ali Raza, Markus Olhofer, David Klindt, Fabio Stella, Barbara Hammer · March 17, 2026

arxiv other medium evidence 7/10 relevance Source PDF

FederatedFactory replaces weight-sharing with exchanged generative modules so clients can synthesize class-balanced datasets in one round, restoring centralized-level accuracy under extreme label heterogeneity and enabling exact modular unlearning.

Federated Learning (FL) enables distributed optimization without compromising data sovereignty. Yet, where local label distributions are mutually exclusive, standard weight aggregation fails due to conflicting optimization trajectories. Often, FL methods rely on pretrained foundation models, introducing unrealistic assumptions. We introduce FederatedFactory, a zero-dependency framework that inverts the unit of federation from discriminative parameters to generative priors. By exchanging generative modules in a single communication round, our architecture supports ex nihilo synthesis of universally class balanced datasets, eliminating gradient conflict and external prior bias entirely. Evaluations across diverse medical imagery benchmarks, including MedMNIST and ISIC2019, demonstrate that our approach recovers centralized upper-bound performance. Under pathological heterogeneity, it lifts baseline accuracy from a collapsed 11.36% to 90.57% on CIFAR-10 and restores ISIC2019 AUROC to 90.57%. Additionally, this framework facilitates exact modular unlearning through the deterministic deletion of specific generative modules.

Summary

Main Finding

FederatedFactory reframes federated learning by exchanging generative modules (priors) rather than discriminative model weights. In a single communication round it synthesizes class-balanced datasets from distributed, mutually exclusive local labels, eliminating gradient conflict and dependence on pretrained foundation models. Empirically on medical imagery benchmarks and CIFAR-10, it recovers centralized upper-bound performance and enables exact modular unlearning.

Key Points

Problem addressed: Standard FL collapses or fails when clients hold mutually exclusive label sets (severe label heterogeneity), producing conflicting optimization trajectories.
Core idea: Invert the unit of federation — share generative priors/modules instead of classifier weights. Clients use received generative modules to synthesize a universal, class-balanced dataset locally.
Zero-dependency: No reliance on external pretrained foundation models; the framework operates without that unrealistic assumption.
Communication efficiency: Achieves synthesis and alignment in a single communication round (clients exchange generative modules once).
Performance claims:
- Restores CIFAR-10 accuracy from a collapsed 11.36% baseline to 90.57% under pathological heterogeneity.
- Restores ISIC2019 AUROC to 90.57%.
- Across diverse medical imagery (including MedMNIST, ISIC2019) matches centralized upper-bound performance.
Unlearning: Supports exact modular unlearning by deterministically deleting specific generative modules (module-level removals correspond to removing their contribution to the synthesized dataset).

Data & Methods

Datasets mentioned: MedMNIST suite, ISIC2019 (dermatology images), and CIFAR-10 (used as a pathological heterogeneity stress test).
Method components:
- Each client trains or contributes a generative module encoding its local data distribution (class-specific priors).
- A single exchange of these generative modules across clients creates the ingredients for synthesizing a globally class-balanced training set at each client.
- Downstream discriminative models are trained on these synthesized, balanced datasets (so gradient conflict among clients’ discriminative updates is avoided).
- Deterministic module deletion enables exact removal of a client’s contribution (modular unlearning).
Evaluation metrics: classification accuracy (CIFAR-10) and AUROC (ISIC2019), compared against federated baselines and a centralized upper bound.
Experimental regime: Pathological label heterogeneity (mutually exclusive local labels) used to stress-test approaches; comparisons report major recovery of performance versus collapsed baselines.

Implications for AI Economics

Reduced reliance on foundation models: By eliminating the need for pretrained external priors, FederatedFactory weakens vendor lock-in around large foundation-model providers and shifts value toward locally trained generative modules and FL orchestration.
Lower communication rounds: Single-round exchange reduces networking/coordination costs, potentially lowering transaction costs for cross-silo collaboration and enabling cheaper federated offerings.
New synthetic-data markets and business models: Organizations could monetize certified generative modules or synth-data generation services, enabling data-scarce firms to access balanced training sets without raw-data sharing.
Data sovereignty and regulatory compliance: Exchanging generative modules (rather than raw data) and enabling exact modular unlearning align well with privacy regulations and make compliance and auditability easier — this could reduce legal risk and compliance costs.
Redistribution of economic value: If centralized performance can be recovered without centralized datasets, the competitive advantage from holding unique data assets may decrease; value may shift to quality of generative modules, module marketplaces, and expertise in module training.
Cost and incentives to adopt:
- Potential compute costs to train generative modules could be nontrivial; buyers/sellers must value tradeoffs between module training cost and downstream utility.
- Certification and trust frameworks will be important — buyers will need assurance that exchanged modules are non-toxic, non-leaking, and high-quality.
Risks and open economic questions:
- Potential for synthetic data to inadvertently leak private information — economic liability and insurance models will need to adapt.
- Standards and certification for synthetic-data quality and fairness will create new credentialing markets.
- Market concentration could still arise around high-quality module providers or FL orchestration platforms.
- Need for independent replication and analysis of claimed gains (e.g., identical 90.57% figures reported) before large-scale investment or regulatory reliance.

Overall, FederatedFactory suggests a shift in the economics of data collaboration: from pooling raw labeled data or depending on large pretrained models to exchanging modular generative assets that preserve sovereignty, reduce coordination costs, and open new markets — while raising new questions about certification, liability, and incentive alignment.

Assessment

Paper Typeother Evidence Strengthmedium — The paper provides strong empirical results on multiple public image benchmarks (CIFAR-10 and several medical-imaging datasets) and compares to federated baselines plus a centralized upper bound, showing dramatic recovery under extreme label heterogeneity; however, evidence is limited to benchmark experiments, simulated client partitions, a small number of datasets, and lacks real-world deployment, external replication, detailed privacy-leakage analysis, and cost/scale evaluation, so external validity is uncertain. Methods Rigormedium — Experimental design has several strengths — evaluation across multiple datasets including domain-relevant medical imagery, explicit pathological-label heterogeneity stress tests, comparison to collapsed baselines and centralized upper bounds, and demonstration of exact module-level unlearning — but the paper appears to omit or underreport important practical checks (scalability to many clients/classes, communication/computation cost accounting, privacy/leakage and membership tests, statistical robustness/uncertainty estimates, and real-world cross-silo deployments), limiting methodological rigor for economic or deployment claims. SamplePublic image benchmarks: CIFAR-10 (used as a pathological heterogeneity stress test), ISIC2019 dermatology images, and multiple datasets from the MedMNIST suite; experiments simulate federated clients holding mutually exclusive local label sets and exchange class-specific generative modules in a single communication round; downstream discriminative models are trained on locally synthesized, class-balanced datasets. Exact client counts, model sizes, and some dataset split details are not specified in the summary. Themesinnovation adoption governance GeneralizabilityEvaluated only on image classification benchmarks (CIFAR-10, MedMNIST, ISIC2019); unclear transfer to other modalities (text, speech, tabular)., Experiments simulate extreme, mutually exclusive label heterogeneity—performance under more realistic, mixed heterogeneity patterns is not shown., Scalability untested: unknown behavior with many clients, many classes, or large-resolution medical images and larger generative models., Privacy and membership-leakage risks of generative modules not fully analyzed, limiting conclusions about regulatory safety or legal compliance., Compute and economic costs of training and exchanging generative modules (vs. models or synthetic-data services) are not quantified, limiting economic applicability., Real-world deployment frictions (infrastructure, certification, trust frameworks) and adversarial or malicious-module scenarios are not evaluated.

Claims (14)

Claim	Direction	Confidence	Outcome	Details
FederatedFactory reframes federated learning by exchanging generative modules (priors) instead of exchanging discriminative model weights. Other	positive	medium	unit of federation / protocol (generative modules vs. discriminative weights)	0.07
A single communication round of generative-module exchange suffices for clients to synthesize class-balanced datasets locally and align their training data. Organizational Efficiency	positive	high	number of communication rounds required; class balance of synthesized datasets	Single communication round reported (experiments on CIFAR-10, MedMNIST, ISIC2019) 0.12
By synthesizing class-balanced datasets locally from exchanged generative modules, FederatedFactory eliminates gradient conflict among clients' discriminative updates. Organizational Efficiency	positive	medium	reduction/elimination of gradient conflict (inferred via improved downstream performance and convergence vs. collapsed federated baselines)	0.07
FederatedFactory operates without relying on external pretrained foundation models (zero-dependency). Other	positive	medium	dependency on pretrained models (binary: uses / does not use)	Zero dependency on external pretrained foundation models (binary claim) 0.07
Under pathological label heterogeneity (mutually exclusive local labels) FederatedFactory restores CIFAR-10 classification accuracy from a collapsed baseline of 11.36% to 90.57%. Output Quality	positive	high	CIFAR-10 classification accuracy (%)	Restored CIFAR-10 accuracy: 90.57% (baseline collapsed: 11.36%) 0.12
FederatedFactory restores ISIC2019 performance to AUROC = 90.57% under the tested regime. Output Quality	positive	medium	ISIC2019 AUROC	ISIC2019 AUROC = 90.57% 0.07
Across diverse medical imagery benchmarks (including MedMNIST and ISIC2019), FederatedFactory matches centralized upper-bound performance. Output Quality	positive	medium	classification performance vs. centralized upper bound (accuracy/AUROC)	Matches centralized upper-bound performance (reported across medical imagery benchmarks) 0.07
Downstream discriminative models trained on the synthesized, balanced datasets avoid conflicting optimization trajectories that cause collapse in standard federated learning under mutually exclusive labels. Organizational Efficiency	positive	medium	optimization stability / avoidance of collapsed training (measured indirectly via final performance metrics)	0.07
FederatedFactory enables exact modular unlearning: deterministic deletion of a client's generative module exactly removes that client's contribution to synthesized datasets. Regulatory Compliance	positive	medium	unlearning correctness (module-level removal effect on synthesized dataset composition)	Exact modular unlearning (deterministic deletion of a client's generative module) 0.07
FederatedFactory recovers centralized-model performance without pooling raw data or relying on a central dataset, thereby weakening dependence on foundation-model vendors and their pretrained priors. Market Structure	positive	medium	performance gap vs. centralized model; dependence on external pretrained priors	Recovers centralized performance without pooling raw data; reduces dependence on foundation-model vendors (conceptual claim) 0.07
The single-round exchange decreases communication rounds and associated coordination/network costs compared to typical iterative federated learning. Organizational Efficiency	positive	low	number of communication rounds; implied network/coordination cost (not directly measured)	Single-round exchange reduces communication/coordination rounds (qualitative) 0.04
FederatedFactory enables new economic opportunities (module marketplaces, synthetic-data services) and affects incentives by shifting value toward modular generative assets and orchestration rather than raw centralized datasets. Market Structure	positive	low	economic outcomes (market structure, incentives)—conceptual, not empirically measured	Enables module marketplaces and synthetic-data services (conceptual economic impact) 0.04
Exchanging generative modules (rather than raw data) and enabling modular unlearning improves auditability and aligns better with privacy/regulatory compliance than raw-data sharing. Regulatory Compliance	positive	low	regulatory compliance / auditability (qualitative claim)	Module exchange and modular unlearning improve auditability/privacy alignment (qualitative) 0.04
FederatedFactory's synthesized datasets allow organizations with data scarcity to obtain balanced training sets without sharing raw data, but training generative modules may incur nontrivial compute costs and require certification/trust frameworks. Training Effectiveness	mixed	low	compute/training cost (qualitative), need for certification/trust frameworks (qualitative)	Synthesized balanced datasets available to data-scarce orgs; training generative modules may incur nontrivial compute costs and require certification (qualitative) 0.04