TL;DR
- A 'rack' that draws 100 kW continuous is no longer exotic — every NVIDIA GB200 NVL72 rack lands in this class, and HGX-B200/H200-dense pods (12-15 SXM5 servers) approach it.
- 100 kW is the threshold above which air cooling is infeasible, busbar power distribution becomes the only practical topology, and floor loading, aisle width, and ceiling height all need to be re-engineered.
- Direct-to-chip liquid cooling is the standard thermal solution; immersion is reserved for densities above ~150 kW; rear-door heat exchangers handle only the chassis-air residual.
- Capex per 100 kW rack-position (including share of cooling plant, electrical, structural, and network) ranges from $190k to $500k — i.e. $1,875-$5,000 per installed kW, materially lower than legacy 15 kW air-cooled per-kW economics.
- Yobitel NeoCloud's reference rack design is a 100 kW DLC-cooled position used for every GB200 NVL72 deployment and for high-density H100/H200 pods, with the same envelope offered to UK and EU customers under NCSC OFFICIAL alignment.
Overview#
For two decades, 'high-density' meant 15 kW per rack and 'extreme' meant 30 kW. Both numbers were set by what conventional perforated-tile air cooling can dissipate from a 600 mm × 1200 mm cabinet at reasonable supply-air conditions (ASHRAE A1/A2 envelope). Anything beyond 30 kW required heroic engineering — cold-aisle containment plus chimney rear doors plus tightly controlled bypass — and even then it topped out around 40-50 kW.
AI infrastructure broke those numbers. An HGX-H100 baseboard with eight 700 W GPUs dissipates 5.6 kW from the GPUs alone; the full 8U server lands around 10 kW; twelve such servers in a 48U rack already hit ~120 kW. NVIDIA's GB200 NVL72 — the rack-scale Blackwell platform — packages 72 GPUs and 36 Grace CPUs in a single rack drawing roughly 120 kW continuous. AMD MI300X and MI325X HGX-class boards land in the same band. 100 kW per rack is therefore not aspirational; it is what current frontier compute requires to fit in a single cabinet without sharding a tightly-coupled NVLink/Infinity Fabric domain across multiple racks.
Every facility subsystem changes at this density: power feeds move from 32 A cords to 250-400 A busbar tap-offs, cooling moves from CRAH-fed air to DLC + CDU, floors move from 1,000 kg/m² to 1,500-2,500 kg/m², and network moves from 25/100 G ToR to 400/800 G optics with overhead NVLink Switch trays. This entry helps you size a 100 kW rack programme — power, cooling, structural, network, and capex — and understand the reference rack design Yobitel NeoCloud uses across UK and EU regions for every GB200 NVL72 and high-density H100/H200 deployment.
A 100 kW rack is the natural unit for ~12-15 HGX-H100 SXM5 servers (10 kW each) OR one GB200 NVL72 rack-scale system (~120 kW). The figure is the practical Schelling point between what utilities can feed, what dry coolers can reject, and what one cabinet can physically structure without splitting an NVLink domain.
Specifications#
The envelope below sits in the middle of published OCP Open Rack v3, NVIDIA GB200 NVL72, and the major server OEM (Supermicro, Dell, HPE, Lenovo) reference designs. Verify each parameter with the specific OEM datasheet before committing to a build.
| Parameter | Typical 100 kW class | Notes |
|---|---|---|
| Continuous rack power | 80-120 kW | NVL72 at the top of the band; 12-15 HGX-H100 servers in the middle. |
| Peak transient | 1.2× continuous | Job launches and synchronisation peaks; sized into the UPS. |
| Utility supply | 415 V 3-phase or 400/480 V | UK three-phase 415 V dominant; US 480 V common. |
| Per-rack ampacity | 250-400 A per feed | Two feeds A/B for 2N or N+1 source resilience. |
| Power topology | Overhead busbar with tap-off boxes | Alt: high-amperage point-of-rack PDU; rarely individual cords above ~50 kW. |
| Cooling supply | DLC at 25-32 °C (ASHRAE W3/W4) | Rear-door HX or in-row cooling captures the 5-15 % chassis-air residual. |
| Coolant return | 35-60 °C | Delta-T of 8-15 °C across the cold plate. |
| Liquid flow per rack | 100-180 L/min | Sized at ~1.5 L/min per kW liquid-cooled load. |
| Floor loading | 1,500-2,500 kg/m² | NVL72 ships as a single ~1,400-1,600 kg unit; pods of HGX-H100 cluster around 1,200-1,800 kg. |
| Rack frame | Standard 19-inch EIA-310 or OCP 21-inch | OCP Open Rack v3 for hyperscale; 19-inch for enterprise / colocation. |
| Rack height | 47-52U or 1OU (NVL72) | Higher than legacy 42U; ceiling clear of 4.5-6 m needed. |
| Network | 72+ × 400 G or 800 G optics per rack | Plus NVLink Switch trays for GB200; AOC/DAC + transceiver mix. |
| Cabling area | Top-of-rack + overhead trays | Power busbar overhead, liquid manifolds vertical at rear, network overhead front. |
OCP Open Rack v3 standardises 21-inch IT space, 48 V DC busbar, and unified bus-style power distribution. Yobitel NeoCloud uses 19-inch EIA-310 racks with overhead AC busbar for the enterprise-aligned variant (familiar service ergonomics, broader server vendor choice) and OCP-style for hyperscale-density partner deployments.
Architecture#
A 100 kW rack is not a 15 kW rack with bigger pipes — it is a fundamentally different topology where power, liquid, and network all enter from above and the room itself becomes thermally neutral. The block diagram below maps the production envelope used by Yobitel NeoCloud and by hyperscale operators on the same density class.
- Power enters from overhead busbar — no floor PDU islands, no per-rack whip cables. Tap-off boxes can be moved or up-rated without re-cabling the row.
- Liquid enters from overhead headers, drops vertical manifolds at the rear of each rack, and connects to each server via dripless blind-mate quick disconnects (OCP ACS spec).
- Network enters from overhead trays. 400 G and 800 G optics dominate, with NVLink Switch fabric for GB200 racks confined to the rack itself (rack-scale switch domain).
- The room is no longer organised around hot and cold aisles — DLC captures 85-95 % of heat into the liquid loop, RDHx captures the chassis-air residual, and the data hall sees near-neutral exhaust.
100 kW rack — production topology
====================================================
Overhead (above row, 4.5-6 m clear ceiling):
+----------------------------------------------+
| Busbar 400 V 3-phase, 800-1250 A run |
| -> Tap-off box per rack (250-400 A) |
| Liquid supply + return headers (DN50-DN80) |
| Network trunks (400/800 G + NVLink Switch) |
+----------------------------------------------+
| | |
power liquid network
v v v
+----------------------------------------------+
| Rack (47-52U EIA-310, or 1OU NVL72) |
| |
| Top: rPDU A + rPDU B (busbar tap), |
| quick-disconnects to vertical manifold,|
| ToR or NVLink Switch trays |
| |
| Mid: 12-15 HGX servers (H100/H200/B200) |
| or 1 GB200 NVL72 chassis stack |
| |
| Bottom: in-rack CDU (40-80 kW, optional) |
| or pass-through to in-row CDU |
+----------------------------------------------+
Adjacent to row:
+----------------------------------------------+
| In-row CDU (150-300 kW, N+1) |
| Leak detection rope along base of every rack |
| RDHx on rear of rack (for chassis-air |
| residual, on same secondary loop) |
+----------------------------------------------+
Facility loop (out to dry cooler / adiabatic):
Supply 30-35 C, return 40-55 C; free cooling
year-round in temperate UK/EU climates.Form Factor, Power, and Thermal Envelope#
100 kW racks preserve the standard 19-inch EIA-310 service ergonomics (or OCP 21-inch for hyperscale density) but reshape every adjacent envelope. The table below contrasts what a 100 kW class rack demands against the legacy 15 kW baseline and the GB200 NVL72 reference design.
| Subsystem | Legacy 15 kW | 100 kW HGX pod | GB200 NVL72 (120 kW) |
|---|---|---|---|
| Cooling | CRAH + hot-aisle containment | DLC + RDHx + CDU | Integrated DLC, vendor-shipped |
| Power feed | 2× 32 A 3-phase whip | Busbar 400 A tap-off A/B | Busbar 400 A tap-off A/B |
| Cabling area | Front + rear | Front + rear + overhead | Front + rear + overhead + side |
| Floor loading | ~1,000 kg/m² | 1,500-2,000 kg/m² | 2,000-2,500 kg/m² |
| Rack frame | Standard 42U 19-inch | 47-52U 19-inch reinforced | 1OU pre-integrated chassis |
| Plumbing at rack | None | Quick-disconnects, manifold, drip pan | Pre-plumbed; QDs to facility loop |
| Network port density | 48 × 25/100 G | 72+ × 400/800 G | 72 × 800 G + NVLink Switch |
| Service access | Front + rear | Front + rear + top | Front, rear, top (full rack swap) |
| Acoustic profile | 75-85 dB(A) | 65-75 dB(A) (DLC reduces fans) | 60-70 dB(A) (largely DLC) |
Many existing data halls advertise '100 kW-ready' but mean the power feed only. Verify floor loading, ceiling clear height, water supply/return capacity, and dry-cooler heat rejection at design stage. A 30 kW-rated hall retrofitted to 100 kW racks typically needs structural reinforcement and a new facility-water main, not just bigger breakers.
Vendor Ecosystem#
The 100 kW rack supply chain is mature in 2026 — multiple OEMs at every layer with broadly interoperable form factors. The map below names the active vendors at the rack, busbar, CDU, and cabling layers; cold plates and quick-disconnects are covered in the [Direct-to-chip](direct-to-chip-liquid-cooling) entry.
| Layer | Active vendors (2026) | Notes |
|---|---|---|
| Rack OEMs (enterprise 19-inch) | Vertiv, Schneider Electric, nVent, Eaton, Rittal | Reinforced frames rated to 2,500 kg loaded. |
| Rack OEMs (OCP 21-inch) | Wiwynn, Quanta QCT, Inspur, ZT Systems | Open Rack v3, 48 V DC busbar. |
| Pre-integrated GB200 NVL72 | Supermicro, Dell PowerEdge XE, HPE Cray XD, Lenovo SR685a, Wiwynn | NVIDIA-qualified partners; rack ships as one SKU. |
| Overhead busbar | Starline, EAE, Eaton xEnergy, Schneider Canalis, Legrand Zucchini | 400-1250 A 3-phase runs; tap-off boxes are plug-and-play. |
| In-row CDU | Vertiv XDU, nVent CDU1350, Schneider EcoStruxure, Stulz CyberCool, Motivair | 150 kW-1.5 MW; N+1 pump. |
| In-rack CDU | CoolIT CHx40/CHx80, Vertiv XDU450, Motivair RDHx-CDU | 40-80 kW for pilot/edge. |
| Rear-door HX | Motivair ChilledDoor, nVent SCHROFF, USystems ColdLogik, Vertiv Liebert | 30-60 kW residual capture; same secondary loop as DLC. |
| NVLink Switch trays | NVIDIA (rack-scale only for NVL72) | Confined to rack; cabling internal. |
| Top-of-rack switches | NVIDIA Spectrum-X / Quantum-X, Arista 7800R3, Cisco Nexus 9800 | 400/800 G port density for AI fabric. |
| Structured cabling | Belden, CommScope, Panduit, Corning | OS2 single-mode dominant for 400G+ inter-rack; AOC/DAC for in-rack. |
| Leak detection | RLE, TraceTek, RDM, Dorlen | Rope along rack base + point sensors at QDs. |
| DCIM / monitoring | Vertiv Trellis, Schneider EcoStruxure IT, Sunbird, Nlyte | Per-rack power, temperature, humidity, water flow telemetry. |
Sizing — Racks per Cluster, Racks per MW#
Sizing a 100 kW rack programme is top-down: start with the cluster scale (number of GPUs / NVLink domains), back out to rack count, then to MW of installed capacity, then to dry-cooler tonnage and utility-feed requirements. The shortcuts below match Yobitel NeoCloud's actual cluster planning method.
- Rack-to-MW ratio: 1 MW = 8-10 usable racks (not 10) — the gap accounts for power-conversion losses, cooling overhead, and non-IT loads. Use a planning factor of 1.10-1.15 (rack nameplate × racks × overhead = MW).
- Floor-plate area: a 100 kW rack with overhead busbar, CDU adjacency, and service aisles needs ~2.5-3.5 m² of data-hall floor including its share of aisle. A 5 MW build is therefore ~150-170 m² of densified white space.
- Utility-feed lead time: in the UK, 11 kV connections typically take 6-12 months from application to energisation; 33 kV is 12-24 months; 132 kV intake can be 24-48 months. Plan the substation work before the first server arrives.
- Heat-rejection sizing: dry coolers at warm-water DLC (W3/W4) need ~1.05-1.15× the IT load in nameplate cooling capacity to handle the 99 %-design wet bulb. Add adiabatic spray for sites that exceed the design wet bulb 1-2 weeks per year.
- Network sizing: each 100 kW rack typically lands 72-96 × 400 G or 800 G port equivalents into the leaf fabric. At 5 MW (40-48 racks), that is ~3,000-4,000 fabric ports — sized into Spectrum-X / Quantum-X reference designs.
| Facility scale | Racks at 100 kW | GPUs (HGX-H100) | GB200 NVL72 racks | Heat to reject | Indicative utility feed |
|---|---|---|---|---|---|
| 1 MW pod (edge / small cluster) | 8-10 usable racks | ~768-960 | 8 NVL72 | 1.0-1.15 MW | 1× 11 kV / 1.6 MVA tx |
| 5 MW (production AI training) | 40-48 racks | ~3,840-4,608 | 40-48 NVL72 | 5.0-5.75 MW | 2× 33 kV / 3 MVA tx |
| 10 MW (large cluster) | 85-95 racks | ~8,160-9,120 | 85-95 NVL72 | 10-11.5 MW | 2-3× 33 kV / 5 MVA tx |
| 50 MW (hyperscale-class) | 430-475 racks | ~41,000-45,500 | 430-475 NVL72 | 50-58 MW | Dedicated 132 kV intake |
| 100 MW+ (frontier) | 860-950 racks | ~82,000-91,000 | 860-950 NVL72 | 100-115 MW | Substation-grade intake |
Yobitel NeoCloud sizes UK clusters around 5 MW pods (40-48 racks of 100 kW each) as the unit of capacity — large enough to amortise the substation, cooling plant, and operations team, small enough to incrementally add a second pod without re-engineering the first.
Cost#
Capital cost ranges below are USD per installed kW (the figure that matters at AI scale) and per rack-position (the figure facility teams quote). They include the share of cooling plant, electrical distribution, network fabric, structural reinforcement, and project costs; they exclude the GPUs themselves and the long-haul fibre tail.
- Per-kW economics: 100 kW racks cost more per rack than 15 kW but materially less per kW of installed capacity. At AI scale (1+ MW), per-kW is the figure that matters because the cluster size is set by GPU count, not rack count.
- Capex split: roughly 35-45 % cooling (DLC + CDU + dry cooler), 25-30 % electrical (UPS, switchgear, busbar, transformer), 15-20 % structural (slab, raised floor, ceiling, BMS), 10-15 % network fabric (leaf/spine switches, optics, cabling).
- Opex baseline: at $0.18-0.32 per kWh UK/EU commercial electricity, a 100 kW rack at 80 % utilisation consumes ~$126k-$224k per year in power. PUE 1.15 vs 1.40 swings that by 22 %.
- Yobitel NeoCloud pricing reference: H100 SXM5 capacity in 100 kW DLC racks lands at the published NeoCloud per-GPU per-hour rate (Omniscient Compute exposes the live USD pricing per region) and absorbs the rack-level capex/opex into the per-GPU rate.
| Build type | Capex per cooled kW (USD) | Capex per rack-position (USD, 100 kW) | Notes |
|---|---|---|---|
| Greenfield hyperscale (5+ MW pods) | $1,875-$2,800 | $190k-$280k | Lowest end; benefit of standardisation, scale. |
| Greenfield enterprise / sovereign | $2,800-$4,200 | $280k-$420k | Tier III/IV resilience, smaller pod amortisation. |
| Brownfield retrofit (existing shell) | $3,500-$5,000 | $350k-$500k | Structural reinforcement, new water main, utility upgrade. |
| Edge / containerised | $4,000-$5,500 | $400k-$550k | Smaller scale, but pre-fabricated reduces project cost. |
| Reference: legacy 15 kW air-cooled | $1,000-$2,500 | $15k-$38k per rack (only) | Cheaper per rack — but $1,000-$2,500 per kW is similar; the density premium is largely scale, not per-kW. |
Migration — From 30 kW Air-Cooled to 100 kW DLC#
Most operators arrive at 100 kW from either a 15-30 kW air-cooled brownfield or a clean-sheet greenfield. The migration path matters as much as the steady-state design — the table below captures the dominant patterns.
| Path | Approach | Lead time | Capex impact | Notes |
|---|---|---|---|---|
| 15 kW air → 100 kW DLC, brownfield retrofit | Pull existing racks, install busbar overhead, run new facility-water main, install in-row CDU, deploy DLC-ready servers | 12-18 months | $350k-$500k per rack-position | Structural reinforcement common; utility upgrade often the critical path. |
| 30 kW RDHx → 100 kW DLC | Reuse facility-water loop, add DLC manifolds at rear of rack, swap servers | 6-9 months | $280k-$400k per rack-position | Simplest migration if RDHx loop was sized for future expansion. |
| 15 kW air → 100 kW DLC, new build | Greenfield slab, busbar from day one, plumbing risers pre-installed, dry-cooler plant | 18-36 months (incl. utility) | $190k-$420k per rack-position | Lowest per-kW, longest lead time. |
| Lift-and-shift to a NeoCloud-style multi-tenant facility | Migrate workloads to a pre-built 100 kW rack facility; no capex | 1-3 months per workload | Zero capex; per-GPU-hour opex | Trade-off: less control, faster time to capacity. Yobitel NeoCloud offers this path in UK + EU regions. |
| Edge containerised DLC pod | Drop a pre-fabricated DLC pod (1-4 racks) onto a pad next to existing facility | 3-6 months | $1.6-2.2 M for a 4-rack pod | Useful for sovereign edge, latency-bound workloads. |
If the workload is GPU training or inference and the timeline is tight, the lift-and-shift path to Yobitel NeoCloud (UK or EU regions, NCSC OFFICIAL aligned) usually beats a brownfield retrofit on both time and total cost. Reserve self-build for cases where data sovereignty, edge presence, or sustained 5+ year capacity demand justifies the capex.
Pitfalls and Operational Notes#
Most 100 kW rack failures are not silicon failures — they are facility failures concentrated at the power-feed coordination, cooling commissioning, and network top-of-rack stages. The discipline below is what separates a first-time-installer schedule slip from a stable production cluster.
- Utility-feed lead time: getting 100 kW per rack ×40 racks (a 5 MW pod) requires utility coordination that takes 6-24 months in most UK and EU markets. Apply for the connection before the rack OEM PO is placed.
- Cooling commissioning: every cold plate, every quick-disconnect, every manifold connection must be pressure-tested at 1.5× working pressure for 24 hours and chemistry-sampled before GPUs energise. Skipping commissioning is the most common first-year incident cause (see Direct-to-chip entry for full leak-handling discipline).
- Hot-spot mismanagement in mixed-density rooms: when one rack is 100 kW DLC and adjacent racks are 15 kW air, the air-side thermal model can break in surprising ways — bypass airflow goes the wrong direction, return-air temperature spikes, and the air-cooled racks throttle. Segregate by density class or use CFD modelling.
- Network top-of-rack congestion: 72-96 × 400/800 G optics per rack produce real cabling complexity. Plan structured cabling, labelling, and slack management at design time; once the rack is populated, recabling is a multi-hour outage.
- Floor loading verification: a GB200 NVL72 arrives as one ~1,500 kg unit on castors. Verify slab loading along the entire delivery path (loading dock, corridor, data-hall doorway, raised-floor stringer) before the truck arrives. Re-routing a delivery costs days.
- Service ergonomics: services that were at the back of the rack now compete with manifolds, busbar tap-offs, and rear-door HX. Document service procedures for every common operation (server swap, cold-plate replacement, optic swap) before production.
- Power-feed phasing: balance the load across the three phases at the rack PDU level, not just at the row level. Unbalanced rack-level loading causes neutral-current heating that does not show up on aggregate metering.
- Acoustic environment: 100 kW racks are quieter than equivalent 15 kW racks (DLC removes most fans) but the row-level airflow noise from RDHx and in-row cooling still requires hearing protection at sustained exposure.
- First-deployment learning curve: a customer's first 100 kW rack deployment typically runs 15-25 % over benchmark on schedule and capex due to commissioning rework, runbook development, and ops-team training. Plan a 30-day shakedown period before scheduling production workloads.
The single largest schedule slip on first-time 100 kW builds is utility-feed energisation. Treat the substation work as a parallel critical path from day one — not as a follow-on to building works.
Where 100 kW Racks Sit in the Yobitel Stack#
Yobitel NeoCloud's reference rack design is a 100 kW DLC-cooled position. Every GB200 NVL72 deployment in NeoCloud lands on this design natively; high-density H100/H200 pods (12-15 SXM5 servers per rack) use the same envelope. UK and EU sovereign regions deploy this rack class under NCSC OFFICIAL alignment, with the rack-level capex/opex absorbed into the per-GPU-hour rate published in Omniscient Compute.
For customers who want to consume the density without building it, NeoCloud is the lift-and-shift path. For customers building their own facility, our managed-ops team has commissioned 100 kW racks under both greenfield and brownfield-retrofit constraints and can hand over a complete commissioning runbook, leak-detection coverage map, and 30-day acceptance criteria as part of the Professional Services engagement.
References#
100 kW rack design crosses several authoritative reference sets. NVIDIA's GB200 NVL72 architecture brief defines the rack-scale platform; OCP Open Rack v3 standardises the 21-inch IT space and 48 V DC busbar; ASHRAE TC 9.9 governs the thermal envelope; Uptime Institute publishes high-density deployment guidance and Tier classification. A serious build cross-checks all four.