AI·Infra·Map

About Us

Our mission: AI-native content distribution and app development.

We map AI infrastructure end to end — sourced and layered — then put AI at the core of the workflow: generating the content and building the applications on top of that map, not just assisting.

What we're building

aiinframap is two things at once: an AI Infrastructure Map — the engineering reference for the physical layer behind artificial intelligence — and an AI-native engine that turns that map into content and applications.

The map covers what every AI workload actually runs on: the electrons, optics, memory, silicon, systems, and models. Most coverage stops at the apps; we map the stack underneath them, bottleneck by bottleneck, and keep every claim traceable to a source. On top of that map, AI does the work — generating the analysis, distributing it, and building the tools — grounded in the same sourced data, not in hype.

Our mission: AI-native content distribution and app development, built on a rigorous AI Infrastructure Map.

How we layer the AI industry — the 8-layer stack

Every company, topic, product, and signal is placed on one canonical model, L1 → L8, running from the power plant to the prompt:

Layer	What it covers
L1 · Energy & DC Infrastructure	Power generation (incl. nuclear / SMR), electrical equipment, cooling, construction, datacenter REITs, storage & microgrids — everything that powers, cools, and houses AI datacenters.
L2 · Optical & Networking	800G / 1.6T optics, CPO & silicon photonics, InfiniBand vs Ethernet, coherent DCI, AEC & retimers — the fabric that moves data inside and between clusters.
L3 · HBM & Advanced Packaging	HBM / DRAM, CXL, CoWoS / SoIC, glass substrates — the memory-bandwidth and packaging bottleneck of every training cluster.
L4 · Chips	GPUs, custom ASICs, semicap equipment, foundry, EDA & IP, power semiconductors (SiC / GaN) — the silicon and the design + manufacturing stack behind it.
L5 · Training & Inference	GPU cloud & neocloud, inference serving, training-data services, cluster storage, server & rack OEMs, hyperscaler compute — where compute actually gets operated.
L6 · Foundation Models	Frontier & open-weights labs, robotics foundation models — the labs that train the weights everyone else builds on.
L7 · Middleware	Vector databases, agent frameworks & MCP, eval & observability, AI gateways, data + AI platforms — the plumbing between models and applications.
L8 · Applications	Coding tools, enterprise adoption, vertical apps, and event-anchored entities — what end users touch.

Topics — 33 focused landscapes

Inside those layers we run 33 topic hubs, each a deep landscape on one part of the stack. A selection:

L1 — Datacenter Power · Cooling · Buildout · Datacenter REITs · Nuclear & SMR Power · On-Site Power · Gas-Turbine Supply Crunch
L2 — Optical Modules · AI Networking · InfiniBand vs Ethernet · CPO & Silicon Photonics · AEC & Retimers
L3 — HBM Memory · Advanced Packaging (CoWoS / SoIC) · CXL Memory Fabric
L4 — AI ASICs · AI Accelerator Startups · EDA & Chip-Design IP · Semiconductor Equipment · Power Semiconductors (SiC / GaN)
L5 — GPU Cloud & NeoCloud · AI Data Stack · AI Servers & Rack Systems
L6 — Frontier Model Labs · Robotics Foundation Models & Humanoids
L7 — Vector Databases · AI Agent Frameworks & MCP · AI Eval & Observability · AI Gateways
L8 — AI Coding Tools · Enterprise AI Adoption · AI Voice Agents

Companies — 100+ tracked

Each company is classified to its primary layer (with a watchlist note when it spans several). A sample of what we track: NVIDIA, AMD, Broadcom, Marvell, Groq, Cerebras, ASML, Cadence, Synopsys, Arm (L4) — SK hynix, Samsung, Micron, TSMC, ASE, Amkor (L3) — Coherent, Lumentum, Arista, Credo, Astera Labs, Ayar Labs (L2) — GE Vernova, Bloom Energy, Vertiv, Digital Realty, Oklo, Constellation Energy (L1) — CoreWeave, Lambda, Nebius, Supermicro, Dell, Scale AI (L5) — OpenAI, Anthropic, Google DeepMind, xAI, Mistral, DeepSeek, Physical Intelligence (L6) — Pinecone, Weaviate, LangChain, Arize, OpenRouter (L7) — Cursor, Cognition, ElevenLabs, Snowflake (L8). …and many more, each on its own profile page.

Products — the things they actually ship

At each layer we track the products that gate everything above them — and tie each one back to the companies and the sourced facts behind it: NVIDIA Blackwell B200 & GB200 NVL72 racks, Google TPU and AWS Trainium custom silicon, SK hynix HBM3e / HBM4, TSMC CoWoS-L 2.5D packaging, 800G → 1.6T optical transceivers, PCIe / CXL / UALink retimers, small modular reactors (Aurora, NuScale), heavy-duty gas turbines, and EUV lithography.

What the site gives you

8-Layer Map — the entire stack on one page.
Topic Hubs — per-topic company rosters, sourced verified facts, and market briefings.
Companies Database — every tracked company, layer-classified, with a profile.
Leaderboards — ranked suppliers per layer, every row with sourced provenance.
Direction Indices — commitment vs attention signals per layer and topic, plus forward-looking forecasts.
Insights — cross-cutting industry synthesis and grounded long-form articles.
Tools — engineering calculators: datacenter power, PUE, HBM lead-time, GPU TCO, inference cost, and more.
Weekly, by layer — subscribe to exactly the layers you care about; AI-written briefings, delivered per layer.
Machine-readable — a JSON API and an llms.txt so agents and LLMs can read the map too.

The next 10 years — where the stack is heading

The defining pattern of this decade is that AI's binding constraint keeps migrating down the stack — and it is increasingly made of atoms, not bits. Our working view of 2025–2035:

Power becomes the constraint, not chips. The bottleneck has already moved from GPUs to megawatts. Grid-interconnect queues run years long, so behind-the-meter gas, fuel cells, and nuclear / SMR move from novelty to default. Single training sites cross the gigawatt line, and energy procurement becomes a core competency for anyone training at the frontier.
Custom silicon overtakes merchant GPUs — by volume. Hyperscaler ASICs (TPU, Trainium, MTIA, Maia) grow faster than merchant GPUs, and a healthy ecosystem of accelerator startups carves out inference. NVIDIA stays dominant in capability; the unit mix diversifies.
Memory and packaging stay the real chokepoint. HBM (HBM4 and beyond) and advanced packaging (CoWoS / SoIC, then glass substrates) gate how many accelerators actually ship. Whoever controls packaging capacity controls the ramp.
Optics goes in-package. Pluggable transceivers hit their reach and power limits; co-packaged optics and silicon photonics become the default interconnect as fabrics scale past 100k-GPU clusters and 1.6T → 3.2T.
Inference dwarfs training. As models deploy, inference becomes the majority of compute spend — pushing specialized inference silicon, gateways / routing, and cost-per-token to the center of the economics. Open-weight models close most of the gap with closed frontiers for everyday work.
Embodied AI leaves the demo stage. Vision-language-action models plus humanoid platforms move from lab videos to real deployments — the first AI layer that moves atoms, not just tokens.
The stack verticalizes and goes sovereign. Hyperscalers own more layers end to end, and nations stand up sovereign compute. The map of who controls which layer becomes as important as the technology in it.

The through-line: value and risk keep moving down the stack, toward the physical layer. That is exactly the layer aiinframap is built to map.

The idea

AI's binding constraint keeps moving — from GPUs to memory, to advanced packaging, to power and grid interconnects. aiinframap exists to map that supply chain end to end — with engineering rigor instead of hype — and to build, on top of it, an AI-native layer of content and applications. Every number is sourced, every company sits on the same stack, every claim traces to a citation. It is the reference map — and the engine — for the people who build, supply, finance, and analyze AI infrastructure.

Engineering reference, not investment advice.