GIGAWATT MAP
Skip to content

About Gigawatt Map

Last updated 2026-04-18

What this is

Gigawatt Map is a public atlas of the physical infrastructure behind the AI buildout: every datacenter we can verify, the power plants and substations feeding it, the submarine cables connecting regions, and the operators and tickers with exposure.

It is not a trading tool, a real-estate listing service, or a permit database. It is a reference map for anyone trying to understand where AI capacity is actually being built, who is paying for it, and what grid is absorbing the load.

The project is open-source and open-data. The code is MIT; the data carries the licenses of its upstream sources, listed in full below.

Who it's for

We write the page with five readers in mind. If you are not one of these, it probably still works, but these are the use cases we actively design against:

  1. AI infrastructure investors — locating operator capex exposure in 30 seconds and confirming a name against a site and a substation before sizing a position.
  2. Analysts & journalists — cross-referencing announced MW against built MW, grounding a story in a map they can cite.
  3. Developers & operators — scouting power and interconnect adjacency for new builds.
  4. Curious technologists — seeing where the model they used last night physically lives, and what it is plugged into.
  5. Educators & students — an on-ramp to the material reality of AI compute.

How we built it

A Python pipeline (data-pipeline/) fetches each source on its own cadence, normalizes it against Pydantic schemas, and writes merged artifacts — per-source GeoJSON for ODbL compliance, plus PMTiles for the web map. Every pipeline run emits a manifest.json with source versions, row counts, and hashes so a given map tile is traceable back to the exact upstream snapshot.

The web app (apps/web/) is a Next.js App Router build served at the edge. API routes query the artifacts by bounding box; nothing hits a live upstream at request time.

Refresh cadences vary by source and are documented in the inventory below. “Last updated” at the top of this page reflects editorial review, not the freshest tile.

Methodology

MW estimation

For facilities without a disclosed IT load, we estimate capacity from building footprint using a watts-per-square-foot band that depends on the build vintage and operator class:

  • AI-dedicated campus → 300–500 W/sqft
  • Modern hyperscaler (≥2020) → 200–350 W/sqft
  • 2015–2019 build → 150–250 W/sqft
  • Pre-2015 build → 100–200 W/sqft

We multiply the gross footprint by a 60% IT-load fraction (the remainder being cooling, electrical, and office), then round to the nearest 0.1 MW. We always surface a range; point estimates would imply a precision we do not have.

Worked example
1,000,000 sqft AI campus, built 2024
1,000,000 × 0.60 × (300–500 W/sqft) ÷ 1,000,000 = 180.0300.0 MW

Confidence tiers

Every datacenter carries one of four confidence labels. We show this in the intelligence card so a reader can tell a regulator-filed number from our best guess:

  • verified — operator-disclosed or regulator-filed MW, location matches.
  • osm_only — OpenStreetMap footprint, no operator confirmation; MW estimated from sqft.
  • press_release — operator announced but not yet built or permitted; MW from the release.
  • estimated — no primary source; MW inferred from peer facilities of similar scale.

Substation proximity join

To associate a datacenter with the grid node feeding it, we find the nearest transmission substation within a 10 km radius, ranked first by voltage class (500 kV > 345 kV > 230 kV > 138 kV > lower) and then by distance. Substations further than 10 km are not joined; we prefer “unknown” to a false link. The join is a hint, not a delivery claim — actual service may come from a different feeder owned by a different utility.

Operator → ticker mapping

We map an operator to a public ticker only when the relationship is unambiguous: the operator is the listed parent, or a wholly-owned subsidiary whose capex rolls up cleanly. Joint ventures, private subsidiaries of public parents, and minority-invested neoclouds are left unmapped rather than guessed. The mapping table is hand-maintained and reviewed on every earnings season.

Data sources

Every layer the reader sees on the map is sourced from a public dataset or a hand-curated list maintained in this repo. License terms are honored: ODbL sources are redistributed per-source (never merged into a single download), and CC BY-NC-SA sources are used only for non-commercial display.

Datacenter geometry

SourceCoverageLicenseRefresh
OpenStreetMap (building=data_center)Global, ~20k featuresODbL 1.0Weekly
IM3 / PNNL Datacenter AtlasUS, peer-reviewedODbL 1.0Annual
NewCloudAtlasNeocloud / GPU operatorsODbL 1.0Rolling
Hand-curated AI campusesNamed flagship buildsCC BY-SA 4.0On announcement

Power infrastructure

SourceCoverageLicenseRefresh
Global Energy Monitor (GEM / GIPT)Global plants ≥20 MWCC BY 4.0Semi-annual
WRI Global Power Plant DatabaseGlobal, 35k plantsCC BY 4.0Irregular (v1.3 2021)
EIA Form 860US generatorsPublic domainAnnual
Catalyst Cooperative PUDLUS utility + generator dataCC BY 4.0Monthly
OSM power=substation / power=lineGlobal, voltage-taggedODbL 1.0Weekly
GEM Gas Finance TrackerNew gas builds, financingCC BY 4.0Quarterly

Submarine cables & interconnect

SourceCoverageLicenseRefresh
TeleGeography submarine cable mapGlobal, landing pointsCC BY-NC-SA 3.0Rolling
OSM telecom=exchangeIXPs, carrier hotelsODbL 1.0Weekly
PeeringDBIXPs, facilities, ASNsCC BY 4.0Daily

Cloud provider regions

SourceCoverageLicenseRefresh
AWS / Azure / GCP / Oracle / Alibaba / IBM / DigitalOcean / Vultr / AkamaiHand-curated from vendor docsPer-vendor ToSMonthly

Opposition & environmental risk

SourceCoverageLicenseRefresh
Data Center WatchUS opposition incidentsEditorial / fair useRolling
datacentertracker.orgUS permits, litigationEditorial / fair useRolling
FracTrackerFossil-fuel proximityCC BY-NC-SA 4.0Rolling
WRI Aqueduct Water Risk AtlasGlobal water stressCC BY 4.0Irregular
EPA FRS / ECHOUS regulated facilitiesPublic domainWeekly

Financial exposure

SourceCoverageLicenseRefresh
SEC EDGAR (10-K, 10-Q, 8-K)US-listed operatorsPublic domainOn filing
Finnhub (free tier)Global quotesAPI ToSReal-time (delayed)
Yahoo Finance (unofficial)Global quotes, fundamentalsAPI ToSReal-time (delayed)
Alpha VantageEquities, FXAPI ToSDaily

Announcements & news

SourceCoverageLicenseRefresh
Datacenter Dynamics (DCD)Global industry newsEditorial / fair useDaily
Data Center FrontierUS industry newsEditorial / fair useDaily
Reuters / Bloomberg / FT / WSJFinancial coverageEditorial / fair useDaily

Known gaps

We prefer to be loud about what the map does not cover:

  • Chinese datacenter coverage is limited. OSM tagging is sparse in mainland China and regulatory filings are not public, so our Chinese footprint is best-effort from press coverage and operator disclosures.
  • Indian coverage is improving as hyperscaler builds are announced, but lags the US and EU.
  • Russia, Belarus, and Central Asia are poor — we would rather show gaps than guess.
  • MW values for pre-2020 colo facilities are frequently estimated from footprint, not disclosed. Treat them as ranges.
  • Neoclouds announce and lease capacity faster than our refresh cadence. If you see a fresh announcement, it probably beats the map.

Contribute

The repository lives on GitHub. Open a pull request against data/curated/ to add or correct a facility; every row needs a primary source_url. For larger contributions — a new source module, a methodology change — open an issue first.

Press and partnership inquiries: hello@gigawattmap.com.

Team

Sudhendra Kambhamettu, lead. The project is open to collaborators who want to own a data source, a region, or a methodology question — see the GitHub issues tagged help-wanted.

Press & citations

None yet. When coverage appears, it will be listed here with a link to the original.

Licensing

Code is released under the MIT license. Data retains the license of its upstream source, listed in the inventory above. Attribution is required when redistributing any layer — the attribution string for each source is embedded in the corresponding manifest.json entry.