Cloud Repatriation

A decision framework for 2026 — the real survey data, a workload scoring matrix, a 3–5 year TCO model, and the risks. Repatriation as portfolio management, not ideology.

What repatriation is — and what it isn't

Cloud repatriation — moving workloads from public cloud back to private cloud, colocation, or on-premises — has become one of the defining infrastructure conversations of 2026. But it is not an anti-cloud backlash, and treating it as one leads you to the wrong decision. The framing I keep coming back to is portfolio management: placing each workload where its economics, performance, and compliance profile are best served. This is a workload-by-workload call, not an ideology.

The data is consistent on this. Only a small minority of organisations pursue a total cloud exit; the overwhelming majority are rebalancing — keeping variable, spiky, experimental workloads in the cloud while moving predictable, steady-state ones to owned infrastructure. The shift is from "everything in the public cloud" to "the right workload in the right place."


Why the 2026 narrative is happening

Several forces converged to turn repatriation from a fringe talking point into a board-level topic.

The invoice finally arrived

Organisations that adopted lift-and-shift cloud-first strategies between roughly 2018 and 2022 are now confronting compounding bills — egress fees, cross-region and cross-AZ transfer charges, premium managed-service markups, storage growth. FinOps and rightsizing help, but only to a point; some workloads stay structurally more expensive in the cloud even after optimisation. Flexera's 2026 data puts wasted IaaS/PaaS spend at around 29% of the total, after several years of slow decline.

The survey evidence (and the scepticism it deserves)

  • Barclays CIO Survey (Q4 2024): 83% of CIOs planned to move at least some workloads from public cloud back to private or on-premises — up from 43% in 2020, the highest the survey had recorded. (Some secondary write-ups report 86%; treat the precise number as soft.)
  • IDC: roughly 80% of enterprises expect to repatriate some compute or storage within 12 months — but only about 8% are moving entire workloads off cloud. That gap is the whole story: this is selective rebalancing, not exodus.
  • Reported drivers (IDC): cost (54%), performance (31%), data sovereignty (27%).

A note I'd insist on: these figures circulate widely across vendor blogs with slightly different numbers, and many of the loudest sources are infrastructure vendors with a commercial interest in the narrative. Treat the direction as well-evidenced and the precise percentages as soft.

The "Cost of Cloud" thesis, and AI economics

The intellectual foundation is Andreessen Horowitz's 2021 essay "The Cost of Cloud, a Trillion Dollar Paradox" by Sarah Wang and Martin Casado — its memorable line, "you're crazy if you don't start in the cloud; you're crazy if you stay on it," and its advice to have a repatriation strategy before you reach scale. Note they did not argue for blanket repatriation; they argued infrastructure spend should be a first-class metric. The thesis has credible critics, too (Corey Quinn among them).

The newer driver is AI. Continuous, high-utilisation inference is exactly the predictable profile that runs cheaper on owned hardware, and the AI budget squeeze is forcing CIOs to optimise everything else. For steady-state inference, the cloud's elasticity premium evaporates.

The egress turning point

In 2024, regulatory pressure (the EU Data Act) forced a structural change: Google Cloud announced free egress for leaving customers in January 2024, and AWS and Azure followed in March. The catch — these waivers generally require you to be exiting (cancelling, completing the move within 60–90 days), so they help one-way repatriation but not ongoing multi-cloud. They still materially lower the cost of a clean exit.


Who has actually repatriated

Named cases anchor the argument — but note the pattern: predictable, data-heavy workloads and strong engineering cultures. They are not evidence that repatriation works for everyone.

~$2M/yr
37signals
Exited AWS from 2022; ~$700k on hardware recouped within a year. In 2025 it moved ~18 PB off S3 onto its own data centres, with AWS waiving a $250k egress bill.
50%+
GEICO
The cautionary tale: a decade of migration, costs reportedly 2.5× expectations. Now repatriating ≥50% of workloads by 2029 onto OpenStack/Kubernetes — "50% per core, 60%+ per GB" reductions.
~$75M
Dropbox
The original. Moved the majority of customer data off AWS onto custom infrastructure in 2015–16, reporting roughly $74.6M saved over two years.

When repatriation makes sense — and when cloud still wins

Strong repatriation candidatesKeep in public cloud
Steady-state, high-utilisation workloads (databases, ERP, VDI)Spiky, unpredictable, or seasonal demand
Production AI inference running continuouslyAI training bursts; experimental workloads
Data-heavy workloads incurring large egress chargesEarly-stage products with uncertain scale
Regulated workloads with data-sovereignty requirementsGlobal edge delivery and burst capacity
Predictable, well-understood apps with stable growthWorkloads deeply coupled to cloud-native managed services

The single most useful heuristic: if your workload is predictable, you are paying a premium for elasticity you don't use. Cloud is priced like an insurance product — you pay for optionality. Steady-state workloads rarely exercise that option.


The scoring matrix

Score each workload 1–5 on each dimension, apply the weighting, and total. Higher = stronger repatriation candidate.

DimensionWeightScore 1 (stay in cloud)Score 5 (repatriate)
Utilisation predictability×3Highly variable / spikyFlat, 24/7, predictable
Data gravity / egress exposure×3Minimal data movementLarge datasets, heavy outbound traffic
Cost gap (3-yr TCO)×3Cloud cheaper or equalOn-prem materially cheaper
Performance / latency sensitivity×2Tolerant; cloud regions fineLatency-critical; multi-tenant variance a risk
Compliance / sovereignty×2No special requirementStrict residency, auditability, isolation
Managed-service coupling×2 (inverse)Deeply coupled (serverless, managed ML)Standard compute/storage/networking only
Internal ops maturity×2No on-prem/automation capabilityStrong IaC, monitoring, on-call, hardware skills
Growth trajectory×1Rapid / uncertain scalingStable, mature

Top third → strong candidate: proceed to detailed TCO and a pilot. Middle third → hybrid: move the steady-state core, keep burst and managed-service-coupled components in cloud. Bottom third → keep in cloud and focus on FinOps instead.

flowchart TD
    A["Workload under review"] --> B{"Predictable, high utilisation?"}
    B -->|No| Z["Stay in cloud / optimise FinOps"]
    B -->|Yes| C{"Material 3-yr TCO gap vs cloud?"}
    C -->|No| Z
    C -->|Yes| D{"Heavy data gravity or egress cost?"}
    D -->|Yes| E{"Ops maturity to run it ourselves?"}
    D -->|No| F{"Compliance or sovereignty driver?"}
    F -->|Yes| E
    F -->|No| G["Hybrid: move core, keep burst in cloud"]
    E -->|No| H["Build capability first or use managed colo"]
    E -->|Yes| I["Repatriate: pilot lowest-risk workload"]
The decision path. Most workloads exit early to "stay in cloud" — that's the point.

TCO: cloud vs on-prem vs colocation

A credible business case compares three-to-five-year fully-loaded costs — not just the compute line. The most common error I see is comparing a cloud bill against bare hardware cost while ignoring power, staff, and the value of lost elasticity.

Cost componentPublic cloudOn-premises (owned)Colocation / hybrid
ComputePay-as-you-go + reservedCapEx hardware, amortised 3–5 yrsCapEx hardware in leased racks
StoragePer-GB + request/retrieval feesDisk/flash arrays (CapEx)Arrays in colo
Data transfer / egressSignificant; one-time exit waiver possibleBandwidth commits (fixed)Fixed bandwidth, fairer egress
Power & coolingIncluded in priceYour cost / capacity commitsIncluded in colo fee
Facilities / spaceNoneDC build or leaseRack/space rental
Staff / opsLower headcount, higher service markupHardware + platform engineersReduced (provider hands-and-eyes)
Elasticity valueHigh (you pay for it)Low (you provision for peak)Medium
Migration costOne-time: hardware, refactor, parallel runOne-time

Build the model over a 3–5 year horizon, include the one-time migration cost, and — crucially — include the cost of inaction: projected cloud spend growth. Achievable savings for suitable workloads cluster in the 30–60% range; treat the high end sceptically and model your own numbers. 37signals' reported >90% reduction is an outlier driven by an exceptionally predictable, optimised workload — don't use it as a planning baseline.


Risk and compliance

Repatriation trades one risk profile for another. The honest list:

  • Operational complexity. Cloud abstracts enormous operational work. Repatriating without investing in automation, monitoring, and on-call produces a worse outcome than staying.
  • Loss of elasticity. You provision for peak. Mis-sizing means either wasted CapEx or capacity shortfalls.
  • Sovereignty cuts both ways. Regulation (GDPR, sector rules) and the US CLOUD Act push regulated workloads toward owned/local infrastructure — but you then own the compliance burden you were renting.
  • Egress as a one-time tax. Factor exit egress into the case; use the 2024 waiver programmes.
  • Reversibility. Build on portable, standardised APIs (OpenStack, Kubernetes, S3-compatible storage) so you keep the option to move back. Treating architectural flexibility as a core capability is the mature posture.

The execution path

flowchart LR
    A["1 Portfolio assessment"] --> B["2 TCO model 3-5 yr"]
    B --> C["3 Choose landing platform"]
    C --> D["4 Pilot lowest-risk"]
    D --> E["5 Parallel run and validate"]
    E --> F["6 Incremental migration"]
    F --> G["7 Operate and optimise"]
Pilot the highest-savings, lowest-risk workload first. Parallel-run before cutover — skipping it is a classic failure mode.

Start with the highest-savings, lowest-risk workload; 37signals migrated app by app, not all at once. Run both environments before cutover, validate cost and performance, then move the next tranche. Repatriation is the beginning, not the end — continuous capacity planning and density optimisation are what compound the savings.


When to repatriate: the decision framework

The bottom line

The binary "cloud vs on-prem" framing is over. The winning 2026 strategy is deliberate placement: cloud where it earns its premium (variable demand, global edge, rapid experimentation, AI training bursts), and owned or leased infrastructure where predictability, bandwidth economics, data gravity, and control dominate. Repatriation isn't a retreat from cloud — it's the maturation of cloud strategy. Make it a portfolio decision, model the full TCO honestly, pilot before you commit, and keep your exits open in both directions.

Economic SovereigntyModel Compression