About Gigawatt Map
Last updated 2026-04-18
What this is
Gigawatt Map is a public atlas of the physical infrastructure behind the AI buildout: every datacenter we can verify, the power plants and substations feeding it, the submarine cables connecting regions, and the operators and tickers with exposure.
It is not a trading tool, a real-estate listing service, or a permit database. It is a reference map for anyone trying to understand where AI capacity is actually being built, who is paying for it, and what grid is absorbing the load.
The project is open-source and open-data. The code is MIT; the data carries the licenses of its upstream sources, listed in full below.
Who it's for
We write the page with five readers in mind. If you are not one of these, it probably still works, but these are the use cases we actively design against:
- AI infrastructure investors — locating operator capex exposure in 30 seconds and confirming a name against a site and a substation before sizing a position.
- Analysts & journalists — cross-referencing announced MW against built MW, grounding a story in a map they can cite.
- Developers & operators — scouting power and interconnect adjacency for new builds.
- Curious technologists — seeing where the model they used last night physically lives, and what it is plugged into.
- Educators & students — an on-ramp to the material reality of AI compute.
How we built it
A Python pipeline (data-pipeline/) fetches each source on its own cadence, normalizes it against Pydantic schemas, and writes merged artifacts — per-source GeoJSON for ODbL compliance, plus PMTiles for the web map. Every pipeline run emits a manifest.json with source versions, row counts, and hashes so a given map tile is traceable back to the exact upstream snapshot.
The web app (apps/web/) is a Next.js App Router build served at the edge. API routes query the artifacts by bounding box; nothing hits a live upstream at request time.
Refresh cadences vary by source and are documented in the inventory below. “Last updated” at the top of this page reflects editorial review, not the freshest tile.
Methodology
MW estimation
For facilities without a disclosed IT load, we estimate capacity from building footprint using a watts-per-square-foot band that depends on the build vintage and operator class:
- AI-dedicated campus → 300–500 W/sqft
- Modern hyperscaler (≥2020) → 200–350 W/sqft
- 2015–2019 build → 150–250 W/sqft
- Pre-2015 build → 100–200 W/sqft
We multiply the gross footprint by a 60% IT-load fraction (the remainder being cooling, electrical, and office), then round to the nearest 0.1 MW. We always surface a range; point estimates would imply a precision we do not have.
Confidence tiers
Every datacenter carries one of four confidence labels. We show this in the intelligence card so a reader can tell a regulator-filed number from our best guess:
- verified — operator-disclosed or regulator-filed MW, location matches.
- osm_only — OpenStreetMap footprint, no operator confirmation; MW estimated from sqft.
- press_release — operator announced but not yet built or permitted; MW from the release.
- estimated — no primary source; MW inferred from peer facilities of similar scale.
Substation proximity join
To associate a datacenter with the grid node feeding it, we find the nearest transmission substation within a 10 km radius, ranked first by voltage class (500 kV > 345 kV > 230 kV > 138 kV > lower) and then by distance. Substations further than 10 km are not joined; we prefer “unknown” to a false link. The join is a hint, not a delivery claim — actual service may come from a different feeder owned by a different utility.
Operator → ticker mapping
We map an operator to a public ticker only when the relationship is unambiguous: the operator is the listed parent, or a wholly-owned subsidiary whose capex rolls up cleanly. Joint ventures, private subsidiaries of public parents, and minority-invested neoclouds are left unmapped rather than guessed. The mapping table is hand-maintained and reviewed on every earnings season.
Data sources
Every layer the reader sees on the map is sourced from a public dataset or a hand-curated list maintained in this repo. License terms are honored: ODbL sources are redistributed per-source (never merged into a single download), and CC BY-NC-SA sources are used only for non-commercial display.
Datacenter geometry
| Source | Coverage | License | Refresh |
|---|---|---|---|
| OpenStreetMap (building=data_center) | Global, ~20k features | ODbL 1.0 | Weekly |
| IM3 / PNNL Datacenter Atlas | US, peer-reviewed | ODbL 1.0 | Annual |
| NewCloudAtlas | Neocloud / GPU operators | ODbL 1.0 | Rolling |
| Hand-curated AI campuses | Named flagship builds | CC BY-SA 4.0 | On announcement |
Power infrastructure
| Source | Coverage | License | Refresh |
|---|---|---|---|
| Global Energy Monitor (GEM / GIPT) | Global plants ≥20 MW | CC BY 4.0 | Semi-annual |
| WRI Global Power Plant Database | Global, 35k plants | CC BY 4.0 | Irregular (v1.3 2021) |
| EIA Form 860 | US generators | Public domain | Annual |
| Catalyst Cooperative PUDL | US utility + generator data | CC BY 4.0 | Monthly |
| OSM power=substation / power=line | Global, voltage-tagged | ODbL 1.0 | Weekly |
| GEM Gas Finance Tracker | New gas builds, financing | CC BY 4.0 | Quarterly |
Submarine cables & interconnect
| Source | Coverage | License | Refresh |
|---|---|---|---|
| TeleGeography submarine cable map | Global, landing points | CC BY-NC-SA 3.0 | Rolling |
| OSM telecom=exchange | IXPs, carrier hotels | ODbL 1.0 | Weekly |
| PeeringDB | IXPs, facilities, ASNs | CC BY 4.0 | Daily |
Cloud provider regions
| Source | Coverage | License | Refresh |
|---|---|---|---|
| AWS / Azure / GCP / Oracle / Alibaba / IBM / DigitalOcean / Vultr / Akamai | Hand-curated from vendor docs | Per-vendor ToS | Monthly |
Opposition & environmental risk
| Source | Coverage | License | Refresh |
|---|---|---|---|
| Data Center Watch | US opposition incidents | Editorial / fair use | Rolling |
| datacentertracker.org | US permits, litigation | Editorial / fair use | Rolling |
| FracTracker | Fossil-fuel proximity | CC BY-NC-SA 4.0 | Rolling |
| WRI Aqueduct Water Risk Atlas | Global water stress | CC BY 4.0 | Irregular |
| EPA FRS / ECHO | US regulated facilities | Public domain | Weekly |
Financial exposure
| Source | Coverage | License | Refresh |
|---|---|---|---|
| SEC EDGAR (10-K, 10-Q, 8-K) | US-listed operators | Public domain | On filing |
| Finnhub (free tier) | Global quotes | API ToS | Real-time (delayed) |
| Yahoo Finance (unofficial) | Global quotes, fundamentals | API ToS | Real-time (delayed) |
| Alpha Vantage | Equities, FX | API ToS | Daily |
Announcements & news
| Source | Coverage | License | Refresh |
|---|---|---|---|
| Datacenter Dynamics (DCD) | Global industry news | Editorial / fair use | Daily |
| Data Center Frontier | US industry news | Editorial / fair use | Daily |
| Reuters / Bloomberg / FT / WSJ | Financial coverage | Editorial / fair use | Daily |
Known gaps
We prefer to be loud about what the map does not cover:
- Chinese datacenter coverage is limited. OSM tagging is sparse in mainland China and regulatory filings are not public, so our Chinese footprint is best-effort from press coverage and operator disclosures.
- Indian coverage is improving as hyperscaler builds are announced, but lags the US and EU.
- Russia, Belarus, and Central Asia are poor — we would rather show gaps than guess.
- MW values for pre-2020 colo facilities are frequently estimated from footprint, not disclosed. Treat them as ranges.
- Neoclouds announce and lease capacity faster than our refresh cadence. If you see a fresh announcement, it probably beats the map.
Contribute
The repository lives on GitHub. Open a pull request against data/curated/ to add or correct a facility; every row needs a primary source_url. For larger contributions — a new source module, a methodology change — open an issue first.
Press and partnership inquiries: hello@gigawattmap.com.
Team
Sudhendra Kambhamettu, lead. The project is open to collaborators who want to own a data source, a region, or a methodology question — see the GitHub issues tagged help-wanted.
Press & citations
None yet. When coverage appears, it will be listed here with a link to the original.
Licensing
Code is released under the MIT license. Data retains the license of its upstream source, listed in the inventory above. Attribution is required when redistributing any layer — the attribution string for each source is embedded in the corresponding manifest.json entry.