The Commonplace
Home Dashboard Papers Evidence Digests 🎲
← Papers

Time-series foundation models trained on low-frequency data stumble on millisecond-scale 5G network dynamics, even after fine-tuning. Firms that collect and pretrain on high-resolution telemetry will have a practical edge in latency-sensitive services, potentially concentrating rents among incumbents.

Bridging the High-Frequency Data Gap: A Millisecond-Resolution Network Dataset for Advancing Time Series Foundation Models
Subina Khanal, Seshu Tirupathi, Merim Dzaferagic, Marco Ruffini, Torben Bach Pedersen · March 17, 2026
arxiv descriptive medium evidence 7/10 relevance Source PDF
Time-series foundation models pretrained on low-frequency datasets generalize poorly to millisecond-resolution 5G network traces in both zero-shot and fine-tuned settings, implying the need for high-frequency, domain-diverse pretraining and adapted architectures/fine-tuning for low-latency applications.

Time series foundation models (TSFMs) require diverse, real-world datasets to adapt across varying domains and temporal frequencies. However, current large-scale datasets predominantly focus on low-frequency time series with sampling intervals, i.e., time resolution, in the range of seconds to years, hindering their ability to capture the nuances of high-frequency time series data. To address this limitation, we introduce a novel dataset that captures millisecond-resolution wireless and traffic conditions from an operational 5G wireless deployment, expanding the scope of TSFMs to incorporate high-frequency data for pre-training. Further, the dataset introduces a new domain, wireless networks, thus complementing existing more general domains like energy and finance. The dataset also provides use cases for short-term forecasting, with prediction horizons spanning from 100 milliseconds (1 step) to 9.6 seconds (96 steps). By benchmarking traditional machine learning models and TSFMs on predictive tasks using this dataset, we demonstrate that most TSFM model configurations perform poorly on this new data distribution in both zero-shot and fine-tuned settings. Our work underscores the importance of incorporating high-frequency datasets during pre-training and forecasting to enhance architectures, fine-tuning strategies, generalization, and robustness of TSFMs in real-world applications.

Summary

Main Finding

A new millisecond-resolution dataset from an operational 5G deployment shows that current time series foundation models (TSFMs) — typically pre-trained on low-frequency data — generalize poorly to high-frequency wireless and traffic data in both zero-shot and fine-tuned settings. This highlights the need to include high-frequency, domain-diverse data in pre-training and to adapt model architectures and fine-tuning strategies for high-resolution time series.

Key Points

  • Dataset novelty: Introduces millisecond-resolution measurements of wireless and traffic conditions from a real-world 5G network, adding a new domain (wireless networks) to typical TSFM pretraining corpora (which often emphasize energy, finance, etc.).
  • Temporal scale: Sampling resolution at the millisecond level enables forecasting horizons from 100 ms (1 step) up to 9.6 s (96 steps), supporting very short-term predictive tasks not covered by low-frequency datasets.
  • Benchmarking result: Most TSFM configurations evaluated perform poorly on this high-frequency distribution, both in zero-shot transfer and after fine-tuning; traditional ML baselines were included for comparison.
  • Implication for pre-training: Lack of high-frequency examples in pre-training data reduces TSFM effectiveness on such tasks, indicating pretraining corpora must be broadened across temporal scales and domains.
  • Research directions: Necessitates development of architectures, multi-scale modeling, fine-tuning protocols, and robustness evaluations tailored to high-frequency time series.

Data & Methods

  • Data source: Operational 5G wireless deployment capturing wireless channel and traffic condition measurements at millisecond resolution (exact feature set per record not specified here).
  • Domain expansion: Wireless networking data complements existing TSFM domains (e.g., energy, finance) by adding network dynamics, interference, mobility, and traffic bursts characteristic of communications systems.
  • Tasks: Short-term forecasting tasks with horizons from 1 step (100 ms) to 96 steps (9.6 s) to evaluate immediate prediction performance relevant to resource allocation and control in networks.
  • Benchmarks: Compared multiple TSFM configurations against traditional machine learning models on predictive performance. Evaluations included zero-shot transfer (no fine-tuning) and fine-tuning regimes.
  • Findings from benchmarks: Most TSFM variants failed to match adequate predictive performance on the new distribution, revealing sensitivity to temporal resolution and domain mismatch; fine-tuning provided limited recovery in many configurations.

Implications for AI Economics

  • Value of data collection: High-frequency datasets (e.g., millisecond 5G traces) are economically valuable assets. Firms that invest in collecting domain-specific, high-resolution data can gain competitive advantages in model performance for latency-sensitive applications (telecom control, high-frequency trading analogues, real-time pricing).
  • Pretraining investment and returns: Pretraining on diverse temporal resolutions increases upfront costs (data acquisition, storage, compute) but can raise model generalization, reduce costly task-specific retraining, and speed deployment in time-critical markets — potentially improving ROI for platform providers and large enterprises.
  • Market structure and entry barriers: Organizations without access to high-frequency operational data may be disadvantaged. This could increase barriers to entry in sectors where low-latency forecasting matters, concentrating economic rents with incumbents able to collect such data.
  • Productization and monetization: Improved short-term forecasting for networks enables better resource allocation (spectrum, scheduling, edge compute), reduced service-level violations, and new latency-sensitive services — translating into revenue and cost savings.
  • Policy and regulation: Data governance, privacy, and sharing rules for network telemetry will affect who can collect/use such datasets. Policymakers may need to weigh incentives for data sharing (to foster competition and innovation) versus protecting proprietary operational data.
  • Research & deployment priorities: Economic actors should prioritize:
    • Investing in multi-scale pretraining corpora to reduce model brittleness across temporal resolutions.
    • Developing specialized architectures and fine-tuning procedures for high-frequency series to lower downstream adaptation costs.
    • Cost–benefit analyses for data collection and model re-training to inform capital allocation in AI-enabled infrastructure.
    • Standards for benchmarking high-frequency time series performance to guide procurement and contracting decisions.

Suggested next steps for practitioners and researchers: expand pretraining datasets to include high-frequency domains, explore multi-scale and temporal-resolution-aware architectures, evaluate transfer learning costs vs. benefits, and quantify economic gains from improved low-latency forecasting in telecom and other real-time industries.

Assessment

Paper Typedescriptive Evidence Strengthmedium — The paper provides direct empirical benchmarking on a novel, real-world millisecond-resolution 5G dataset and compares multiple TSFM configurations and traditional baselines, so results about model performance on that distribution are well supported; however, evidence is limited to a single deployment and set of model/finetuning choices and does not establish causal claims about economic outcomes. Methods Rigormedium — Data collection at millisecond resolution and evaluation across zero-shot and fine-tuned conditions and multiple baselines indicates careful benchmarking, but the report lacks detailed feature descriptions, information about dataset size/representativeness, sensitivity analyses across architectures and hyperparameters, and ablation studies that would strengthen methodological claims. SampleOperational 5G deployment telemetry capturing wireless channel and traffic-condition measurements at millisecond resolution (100 ms sampling step used as 1-step), used to construct short-term forecasting tasks with horizons from 1 step (100 ms) to 96 steps (9.6 s); exact feature set, geographic scope, deployment duration, and number of devices/sessions are not specified in the summary. Themesadoption productivity innovation GeneralizabilitySingle operational deployment — results may not generalize across different operators, geographies, network hardware, traffic mixes, or 5G configurations, Feature set and preprocessing not fully specified — unknown whether other telemetry modalities would change outcomes, Benchmarks limited to particular TSFM variants and finetuning regimes — other architectures or transfer protocols might perform better, Focus on very short horizons (sub-10s) — findings may not apply to lower-frequency forecasting tasks, Economic implications extrapolate from technical results and assume firms can collect/monetize high-frequency data, which may not hold in all markets

Claims (14)

ClaimDirectionConfidenceOutcomeDetails
Introduces a new millisecond-resolution dataset of wireless channel and traffic-condition measurements from an operational 5G deployment. Other positive high availability and characteristics of a millisecond-resolution 5G measurement dataset (sampling resolution, domain coverage)
Introduces millisecond-resolution 5G measurement dataset (sampling resolution claim)
0.18
The dataset sampling resolution is at the millisecond level, enabling forecasting horizons from 1 step (100 ms) up to 96 steps (9.6 s). Other positive high supported forecast horizons (temporal prediction horizon: 100 ms–9.6 s)
Forecasting horizons supported: 100 ms to 9.6 s (1 to 96 steps)
0.18
Current time-series foundation models (TSFMs), typically pretrained on low-frequency data, generalize poorly to high-frequency wireless and traffic data in zero-shot transfer. Output Quality negative medium predictive performance in zero-shot transfer (forecasting accuracy/error on high-frequency data)
TSFMs generalize poorly in zero-shot to high-frequency wireless/traffic data (benchmark finding)
0.11
Fine-tuning TSFMs on the high-frequency 5G data provides limited recovery; many configurations still perform poorly after fine-tuning. Output Quality mixed medium predictive performance after fine-tuning (forecasting accuracy/error)
Fine-tuning provides limited recovery for many TSFM configurations (qualitative)
0.11
Most TSFM configurations evaluated failed to achieve adequate predictive performance on this high-frequency distribution. Output Quality negative medium adequacy of predictive performance (forecasting error/accuracy relative to task requirements)
Most TSFM configurations evaluated failed to achieve adequate predictive performance (qualitative)
0.11
Traditional machine-learning baselines were included for comparison in the benchmarks. Other positive high inclusion of traditional ML baseline models in comparative evaluation
Traditional ML baselines were included in benchmarks (methodological note)
0.18
The poor TSFM performance is attributed to pretraining corpora lacking high-frequency, domain-diverse examples (temporal-scale and domain mismatch). Output Quality negative medium generalization effectiveness of TSFMs when pretrained on low-frequency corpora and evaluated on high-frequency tasks
Poor TSFM transfer attributed to pretraining corpora lacking high-frequency examples (causal interpretation)
0.11
Pretraining corpora must be broadened across temporal scales and domains (including high-frequency domains) to improve TSFM generalization. Output Quality positive medium expected improvement in model generalization (forecasting performance) if pretraining corpora include diverse temporal scales/domains (claimed, not experimentally verified in the summary)
Recommendation: broaden pretraining corpora across temporal scales to improve generalization (prescriptive)
0.11
Research and engineering efforts should develop architectures, multi-scale modeling, and fine-tuning protocols tailored to high-frequency time series. Research Productivity positive speculative anticipated improvement in high-frequency time-series performance through specialized architectures and protocols (proposal, not measured)
Recommendation: develop architectures, multi-scale modeling, and fine-tuning protocols for high-frequency TS (prescriptive)
0.02
High-frequency datasets (like millisecond 5G traces) are economically valuable; firms that collect such domain-specific, high-resolution data can gain competitive advantages in low-latency applications. Market Structure positive speculative economic value / competitive advantage derived from proprietary high-frequency datasets (argumentative, not empirically measured here)
High-frequency datasets are economically valuable and confer competitive advantage (qualitative)
0.02
Pretraining on diverse temporal resolutions increases upfront costs (data acquisition, storage, compute) but can raise model generalization and reduce downstream retraining costs, improving ROI for platform providers. Firm Revenue mixed speculative trade-off between upfront pretraining costs and downstream retraining costs / model generalization (theoretical claim)
Pretraining on diverse temporal resolutions increases upfront costs but can improve generalization and reduce downstream retraining costs (qualitative trade-off)
0.02
Organizations without access to high-frequency operational data may face increased barriers to entry in latency-sensitive markets, concentrating rents with incumbents who can collect such data. Market Structure negative speculative market competition / barriers to entry due to asymmetric access to high-frequency data (speculative implication)
Organizations without access to high-frequency data may face higher barriers to entry and rent concentration (speculative implication)
0.02
Improved short-term forecasting enabled by high-frequency data can translate into operational benefits such as better resource allocation (spectrum, scheduling), reduced service-level violations, and enablement of new latency-sensitive services. Organizational Efficiency positive speculative operational improvements (resource allocation efficiency, reduction in service-level violations, enablement of latency-sensitive services) as a function of improved forecasting (proposed, not measured)
Improved short-term forecasting can enable better resource allocation, fewer SLA violations, and new latency-sensitive services (projected operational benefits)
0.02
Benchmarks and standards are needed for evaluating high-frequency time series performance to guide procurement and contracting decisions. Governance And Regulation positive speculative existence and adoption of high-frequency TS benchmarking standards (recommendation)
Recommendation: establish benchmarks and standards for high-frequency TS evaluation to guide procurement
0.02

Notes