The AI supercycle transformed memory from a commodity into the semiconductor industry's most strategically valuable resource. From HBM4 to DDR6, from supply shortages to advanced packaging โ this is the definitive technical guide.
Modern computing has shifted from compute-centric to data-centric. The bottleneck is no longer how fast chips calculate โ it is how fast they can move data.
For decades, Moore's Law drove the semiconductor industry: transistors shrank, CPUs ran faster, and compute throughput scaled predictably. But today, every dominant workload โ large language model inference, real-time video analytics, autonomous driving, cloud-scale databases โ is memory-bandwidth-bound. These systems exhaust memory throughput before they exhaust processor cores.
This shift elevated memory chips to the most critically constrained and commercially valuable component in the global supply chain. In 2026, memory accounts for $594.7 billion of the projected $1.29 trillion global semiconductor market โ nearly half of the entire industry.
A fundamental asymmetry explains memory's dominance by volume: processors ship one-to-one with systems, while memory chips ship in multiples. Consider a single AI inference server:
| Component | Count per Server | Memory Chips Involved |
|---|---|---|
| CPU | 2 sockets | 16โ32 DDR5 DIMMs (8 chips per DIMM) |
| GPU/AI Accelerator | 8 cards | 48 HBM3E stacks (6 per card) |
| NVMe SSD | 8โ16 drives | Dozens of NAND dies per drive |
| L1/L2/L3 Cache | Embedded in CPU/GPU | SRAM arrays, hundreds of MB total |
| Total silicon dies | Hundreds of memory dies per server | |
Every processor relies on a carefully designed hierarchy of memory types, each with different speed, size, and cost characteristics:
Memory hierarchy from fastest registers to cloud storage โ each level trades speed for capacity. AI workloads are constrained by the HBM and DDR5 layers.
Analysts coined a new term for this era: a supercycle โ not a normal demand peak, but a structural, multi-year repricing of the entire memory market.
The AI infrastructure buildout that began accelerating in 2023 reached a new intensity in 2025โ2026. Every major hyperscaler โ Microsoft, Google, Amazon, Meta, Oracle โ embarked on massive data center expansion programs costing hundreds of billions of dollars. The key constraint in all of them was not land, power, or network bandwidth. It was memory.
Training a frontier model like GPT-4 class requires petabytes of memory bandwidth. Parameters, gradients, optimizer states, and activations must all reside in fast memory simultaneously.
A 70B-parameter model requires ~140 GB of memory just to load. Serving millions of simultaneous requests multiplies this across thousands of accelerators, each with HBM stacks.
Long-context AI conversations require storing the key-value attention cache in memory. For a 128K-token context window, the KV cache alone can exceed a full GPU's HBM capacity.
Each hyperscaler deploys tens of thousands of AI servers. Each server holds hundreds of memory dies. The aggregate demand is unlike anything the memory industry has faced before.
AI features in smartphones (camera processing, real-time translation, voice recognition) pushed flagship RAM from 8 GB in 2021 to 16โ24 GB in 2026, all using LPDDR5X.
ADAS and autonomous driving systems require multiple high-speed memory subsystems for lidar point cloud processing, camera fusion, and real-time decision making.
| Memory Type | 2026 Revenue (Est.) | YoY Growth | Primary Driver |
|---|---|---|---|
| DRAM Total | $418.6 B | ~3ร YoY | AI HBM demand + DDR5 cloud |
| โ HBM (subset) | ~$50โ80 B | +70% YoY | H200, B200, MI300X GPU systems |
| โ DDR5 / LPDDR5X | Large majority of DRAM | +40% YoY | Server DIMM + smartphone |
| NAND Flash | $174.1 B | +138.5% YoY | AI data center SSD, enterprise |
| Total Memory | $594.7 B | Supercycle | AI infrastructure buildout |
| Model Class | Parameters | FP16 Memory | KV Cache (128K context) | Hardware Required |
|---|---|---|---|---|
| Small LLM (3B) | 3 billion | ~6 GB | +8 GB | 1 GPU (24 GB VRAM) |
| Medium LLM (8B) | 8 billion | ~16 GB | +20 GB | 1โ2 GPUs with HBM |
| Large LLM (70B) | 70 billion | ~140 GB | +40 GB+ | 4โ8 GPUs (HBM3E) |
| Frontier MoE (400B+) | 400B+ estimated | ~800 GB+ | Hundreds of GB | Multi-GPU cluster |
HBM is not just a faster DRAM โ it is a fundamentally different architectural concept that stacks dies vertically and shortens the electrical path to the processor.
Traditional DRAM (like DDR5) sits on a PCB some distance from the processor. Signals must travel centimeters through PCB traces, through connectors, and through long wires โ consuming power and limiting bandwidth.
HBM stacks multiple DRAM dies vertically โ like floors in a building โ connected by Through-Silicon Vias (TSVs): vertical electrical connections drilled through the silicon itself. This stack is then mounted directly on an interposer next to the processor die, bringing memory within millimeters of the compute logic.
The result is a memory bus 1024 bits wide (vs. 64 bits for a single DDR5 channel) and electrical paths so short that bandwidth reaches over 1 terabyte per second while consuming dramatically less power per bit.
HBM cross-section: DRAM dies stacked via TSVs, mounted on base logic die, connected to interposer via microbumps.
| Generation | Bandwidth | Bus Width | Dies | Max Capacity | Status (2026) |
|---|---|---|---|---|---|
| HBM2E | ~460 GB/s | 1024-bit | 8-Hi | 16 GB | Legacy |
| HBM3 | ~819 GB/s | 1024-bit | 8-Hi | 24 GB | H100, MI300X |
| HBM3E | ~1.18 TB/s | 1024-bit | 12-Hi | 36โ48 GB | Current Mainstream |
| HBM4 | 1.5+ TB/s | 2048-bit | 16-Hi | 48 GB+ | Entering Production |
| HBM4E | 2+ TB/s (target) | 2048-bit | 16-Hi+ | 64 GB+ | 2027+ Roadmap |
A critical and counterintuitive dynamic in the 2025โ2026 memory market: producing HBM actually reduces the total supply of conventional memory. Because HBM requires far more silicon area, more processing steps (TSV drilling, stacking, bonding), and achieves lower bit yield per wafer, each wafer converted to HBM production removes the capacity to manufacture roughly three equivalent commodity DRAM chips.
The 3-to-1 Rule: Converting one wafer to HBM production removes the capacity of approximately three DDR5 DRAM wafers from the commodity pool โ tightening the entire market.
The modern memory protocol landscape spans JEDEC standards, chiplet interconnects, and coherent memory fabrics โ each serving different performance and topology requirements.
| Protocol | Peak BW | Bus Width | Use Case | Key Feature | Standard |
|---|---|---|---|---|---|
| DDR5 | 89 GB/s/ch | 64-bit | Server, workstation, desktop | On-die ECC, Decision Feedback EQ | JEDEC JESD79-5 |
| DDR6 | 192+ GB/s/ch | 64-bit | Next-gen servers (2027+) | PAM4 signaling, doubled pins | JEDEC JESD79-6 (draft) |
| LPDDR5X | 68 GB/s/ch | 32-bit | Smartphones, mobile AI | Sub-1V operation, low leakage | JEDEC JESD209-5 |
| LPDDR6 | 136+ GB/s/ch | 32-bit | Next-gen mobile (2027+) | Improved power efficiency | JEDEC (upcoming) |
| HBM3E | 1,180 GB/s | 1024-bit | AI accelerators, HPC | 3D stacking, TSV, CoWoS | JEDEC JESD238 |
| HBM4 | 1,500+ GB/s | 2048-bit | Next-gen AI GPUs | Wider bus, hybrid bonding | JEDEC JESD238B (2026) |
| GDDR7 | 1,792 GB/s (GPU) | Multiple channels | Gaming GPUs, inference GPUs | 32 Gbps/pin, PAM4 | JEDEC JESD232 |
| CXL 3.0 | ~256 GB/s | PCIe-based | Memory pooling, disaggregation | Cache coherence, multi-head | CXL Consortium 3.0 |
| PCIe Gen 6 | 256 GB/s (x16) | x16 lanes | CPU-GPU, CPU-SSD | PAM4, 64 GT/s/lane | PCI-SIG Gen 6.0 |
| UCIe 2.0 | ~25 Tbps/mmยฒ | Die-to-die | Chiplet interconnect | Open standard, sub-mm pitch | UCIe Consortium |
| NVMe 2.0 | ~14 GB/s | PCIe lanes | Data center SSD | ZNS, CMB, FDP | NVM Express Inc. |
| UFS 4.0 / 5.0 | 4.2โ8+ GB/s | Serial lanes | Smartphone storage | Low power, M-PHY | JEDEC / MIPI |
Compute Express Link (CXL) is a cache-coherent interconnect built on top of PCIe Gen 5/6 physical layer. Unlike traditional memory which is tightly coupled to the CPU, CXL enables memory disaggregation โ pooling memory across multiple processors and dynamically allocating it to workloads on demand.
In an AI data center, CXL 3.0 allows a memory pool of terabytes of DDR5 or LPDDR to be shared across many accelerator nodes, significantly improving utilization and reducing the total memory footprint required.
CXL also enables coherent attachment of AI accelerators: a GPU or NPU connected via CXL can access the host CPU's memory with full cache coherence, eliminating expensive data copies that previously wasted bandwidth and latency.
CXL enables coherent memory pooling and accelerator attachment over standard PCIe physical layer.
The shortage is not a single bottleneck โ it is a cascade of interconnected supply constraints that compound each other across the entire value chain.
HBM production requires specialized equipment for TSV drilling, die stacking, thermocompression bonding, and advanced packaging. These tools cannot be repurposed from standard DRAM production lines. New HBM capacity requires purpose-built facilities with 18โ36 months lead time.
Through-Silicon Via formation is a slow, precision-intensive process. Each die requires thousands of vertical holes drilled and filled with conductive material. Yield loss during TSV formation compounds as stack height increases (12-Hi HBM3E, 16-Hi HBM4).
CoWoS (Chip-on-Wafer-on-Substrate) packaging from TSMC has been the dominant platform for HBM+GPU integration. TSMC's CoWoS capacity became so constrained in 2024โ2025 that it directly limited the number of H100/H200 GPUs NVIDIA could ship, regardless of GPU die availability.
Only SK Hynix, Samsung, and Micron produce HBM. SK Hynix alone holds 50โ62% of HBM capacity. This concentration means any quality, yield, or production issue at a single supplier ripples through the entire industry. There is no alternative supplier to absorb the gap.
HBM4 and advanced DRAM nodes require EUV lithography systems, which are produced exclusively by ASML. With each EUV system costing $150โ200 million and delivery lead times of 12โ18 months, memory fabs cannot rapidly scale EUV-enabled production lines.
US export controls restrict the sale of advanced memory chips and semiconductor manufacturing equipment to certain markets. This reshapes supply chain flows and adds uncertainty to capacity planning, further limiting the speed at which new production can be deployed.
Three companies control the entire global HBM supply. SK Hynix's early relationship with NVIDIA locked in the majority share through at least 2026โ2027.
Designing and integrating advanced memory systems involves some of the hardest problems in semiconductor engineering โ spanning signal integrity, power, thermal, reliability, and yield.
A 4-GPU AI training server: host CPU with DDR5, connected via PCIe Gen 5/6 to AI accelerators, each with 6 HBM3E stacks providing ~141 GB and ~6 TB/s bandwidth per GPU.
From kilobytes of SRAM to terabytes of stacked HBM โ five decades of continuous innovation in how computers store and access data.
Static RAM (6-transistor cells) developed for processor registers and caches. Intel's 1103 DRAM (1970) was the first commercially successful dynamic RAM.
Single Data Rate DRAM used in early PCs. FPM DRAM, EDO DRAM improved access times for burst transfers.
Synchronous DRAM synchronized to the system clock, enabling pipelined burst transfers. Became the standard for the next decade.
DDR SDRAM transfers data on both rising and falling clock edges, doubling bandwidth without increasing clock frequency. Fundamental innovation.
Each generation roughly doubled bandwidth while reducing supply voltage: 1.8V (DDR2) โ 1.5V (DDR3) โ 1.2V (DDR4). DDR4 reached 3200 MT/s.
JEDEC publishes the first High Bandwidth Memory specification. AMD uses HBM1 in the Fiji GPU (Radeon R9 Fury X) in 2015 โ first commercial HBM deployment.
DDR5 doubles data rate to 6400 MT/s, adds on-die ECC, and improves power management. LPDDR5 brings similar improvements to mobile devices.
HBM3 ships in NVIDIA H100. HBM3E (12-Hi stacking) achieves 1.18 TB/s per stack, deployed in H200 and AMD MI300X. The AI memory race accelerates.
HBM4 enters mass production with a 2048-bit bus and 1.5+ TB/s bandwidth. Hybrid bonding begins replacing microbumps. DDR6 specification finalized.
| Type | Primary Use | Key Property |
|---|---|---|
| SRAM | CPU/GPU caches | Fastest, no refresh, expensive |
| DRAM/DDR5 | Server/PC main memory | Dense, needs refresh |
| LPDDR5X | Mobile AI devices | Low power, wide bus |
| HBM3E | AI accelerators | Extreme bandwidth, 3D stack |
| GDDR7 | Gaming/inference GPUs | High BW, graphics optimized |
| NAND Flash | SSD storage | Non-volatile, high density |
| NOR Flash | Firmware/code storage | Random-access, small |
| MRAM | Embedded (automotive, IoT) | Non-volatile SRAM speed |
| ReRAM / PCM | Emerging storage-class | High density, non-volatile |
The next decade will see memory evolve from a separate component into an increasingly integrated part of compute โ bringing processing closer to where data lives.
DDR6 will use PAM4 signaling to achieve over 12,800 MT/s per channel โ approximately twice DDR5 peak speed โ on the same 64-bit bus width. JEDEC specification work is ongoing, with first silicon expected 2027.
LPDDR6 will bring similar bandwidth improvements to mobile with sub-0.5V operation for the most power-sensitive applications. On-device AI will require this level of memory performance for next-generation AI phones.
HBM4E is expected to reach 2+ TB/s per stack. HBM5 is likely to bring full hybrid bonding (die-to-die direct copper bonding at sub-5 ยตm pitch) and potentially Logic-in-Memory capabilities.
CXL 3.x will enable fabric-attached memory pools shared across racks of servers. AI workloads will dynamically allocate terabytes of disaggregated memory over low-latency coherent interconnects.
PIM embeds simple compute units directly inside DRAM arrays โ performing addition, multiplication, and activation functions without moving data to the processor. Samsung's HBM-PIM and SK Hynix's AiM demonstrate this approach.
Silicon photonics interconnects between processors and memory modules could eventually exceed the bandwidth limits of electrical interconnects, operating at terabits per second over optical waveguides on-package.
UCIe 2.0 and advanced packaging will allow mixing memory dies, compute dies, and I/O dies from different foundries and vendors in a single package โ enabling custom memory architectures optimized per workload.
For quantum computing and ultra-high-performance classical computing, superconducting memory operating at cryogenic temperatures is under research at IBM, Google, and academic institutions โ still many years from commercial deployment.
HBM bandwidth doubles roughly every 2 years, far outpacing DDR channel bandwidth growth. By 2029 HBM5 is projected to reach ~2.8 TB/s per stack.
Key questions engineers, students, and technologists ask about the 2025โ2026 memory landscape.
Memory controller and protocol IP engineering are among the highest-value specializations in the global ASIC job market โ and demand is accelerating with the AI supercycle.
Designs the digital logic that arbitrates, schedules, and optimizes DRAM access patterns. Must understand DDR5/LPDDR5X protocol timing, refresh scheduling, rank interleaving, and Power-Down entry/exit. Highest demand in AI chip startups and hyperscaler ASICs.
Designs or integrates the analog/digital Physical Layer (PHY) that implements the electrical interface to DRAM. Must understand PLL design, DLL, signal equalization, calibration algorithms, and power delivery.
Builds UVM-based verification environments that model the behavior of memory devices and protocol endpoints โ enabling design teams to verify memory controllers and interconnect IP without physical hardware.
Develops RTL and verification for PCIe Gen 5/6, CXL, or UCIe โ the interconnects that link AI accelerators to memory, storage, and each other. Critical for chiplet-based AI systems.
A newer role at the intersection of AI system design and memory engineering โ optimizing the memory subsystem for LLM inference kernels, KV cache management, tensor memory layouts, and memory-bandwidth-bound operation.
Designs RISC-V-based SoCs integrating custom memory controllers, cache hierarchies, and protocol bridges. Strong demand in India's semiconductor ecosystem through programs like DLI/ChipIN.
This article draws on publicly available technical specifications and industry analysis.
Publisher of DDR5, LPDDR5X, HBM, GDDR7, UFS standards. jedec.org
Defines CXL 1.0 / 2.0 / 3.0 specifications for cache-coherent interconnects. computeexpresslink.org
Universal Chiplet Interconnect Express โ open die-to-die interface standard. uciexpress.org
PCIe Gen 5 / Gen 6 specifications and compliance standards. pcisig.com
NVMe 2.0 and ZNS specifications for storage interface. nvmexpress.org
Semiconductor market size estimates (April 2026 forecast referenced for revenue figures). idc.com
LPDDR and mobile interface specifications including MIPI CSI-2, DSI. mipi.org
Open-source RISC-V ISA specification. riscv.org
L1, L2, and L3 caches inside GPUs and CPUs are not DRAM or HBM. They are on-chip SRAM structures โ and they operate under an entirely different design paradigm.
Static RAM (SRAM) uses a 6-transistor (6T) bitcell โ two cross-coupled inverters and two access transistors โ to store one bit of data without needing a refresh cycle. This makes it dramatically faster than DRAM (which stores charge on a capacitor and must be refreshed thousands of times per second), but also far more area-intensive and expensive per bit.
Every L1, L2, and L3 cache in every modern CPU and GPU is built from SRAM bitcells supplied by the foundry process design kit (PDK) and assembled into cache macros using EDA memory compilers from Synopsys, Cadence, or Siemens.
Inside an NVIDIA H100 GPU: L1/L2/Shared Memory are on-die SRAM (proprietary). HBM3E is external DRAM governed by JEDEC JESD238.
DRAM/HBM/GDDR must be interoperable across vendors โ a Samsung HBM3E stack must work with an NVIDIA controller and a Micron stack. That interoperability requirement is what makes JEDEC standards essential for DRAM.
SRAM caches have no such requirement. They are permanently embedded inside a single chip die, designed by one company, manufactured at one foundry, and never expected to connect to another vendor's cache. There is no interoperability problem to solve โ and therefore no consortium has ever standardized cache architectures.
| Property | SRAM (Cache) | DRAM / HBM / GDDR |
|---|---|---|
| Location | On-chip, inside CPU/GPU die | Off-chip, separate package or stack |
| Technology | 6T or 8T SRAM bitcell | 1T1C DRAM capacitor cell |
| Refresh needed? | No โ data held by transistors | Yes โ capacitors leak charge |
| Speed | 1โ5 ns (L1), 5โ30 ns (L2/L3) | 100โ500 ns (DRAM); ~100 ns (HBM) |
| Density | Low (6ร area vs DRAM per bit) | Very high (billions of bits per mmยฒ) |
| Who defines it? | Chip company + foundry PDK | JEDEC standards consortium |
| Standardized? | No โ proprietary | Yes โ JEDEC JESD specs |
| Examples | H100 L1/L2 caches, Apple M4 L3 | DDR5, HBM3E, LPDDR5X, GDDR7 |
TSMC, Samsung, Intel Foundry, and GlobalFoundries supply the physical SRAM bitcell libraries embedded in their Process Design Kits (PDKs). The foundry determines the minimum bitcell area, read/write stability margins, and power characteristics at each process node (5nm, 3nm, 2nm).
Synopsys, Cadence, and Siemens EDA provide memory compilers โ software tools that generate custom SRAM macros of any specified size, aspect ratio, word width, and number of ports. The chip designer specifies parameters; the compiler generates RTL, GDSII, timing models, and power models.
NVIDIA, AMD, Intel, Qualcomm, Arm, and Apple make all the architectural decisions: L1/L2/L3 sizes, number of cache ways (associativity), replacement policies (LRU, pseudo-LRU, random), cache coherence protocols (MESI, MOESI, CHI), and how caches integrate with pipeline stages and prefetchers.
| Parameter | Options | Impact |
|---|---|---|
| Cache Size | 32 KB โ 128 MB per level | Hit rate, area, power |
| Associativity | Direct-mapped, 4-way, 8-way, fully associative | Conflict miss rate vs access time |
| Replacement Policy | LRU, pseudo-LRU, RRIP, random | Hit rate under real workloads |
| Write Policy | Write-back, write-through | Memory traffic, coherence overhead |
| Coherence Protocol | MESI, MOESI, MESIF, ARM CHI | Multi-core correctness and performance |
| Prefetching | Stream prefetch, stride, ML-driven | Effective bandwidth utilization |
| ECC | SECDED, Chipkill (for LLC) | Reliability, silicon area overhead |
| Banked vs Unified | Multiple independent banks | Parallelism, access conflicts |
NVIDIA, AMD, Intel, Qualcomm, and Arm all build entirely different internal architectures โ yet their chips all speak the same memory and interconnect languages. Here's why.
One of the most important conceptual frameworks for any semiconductor engineer or IP vendor to understand is the split between what chip companies design privately and what they inherit from open consortium standards. These two domains coexist in every modern chip โ and knowing the boundary tells you exactly where to focus your IP development.
Every chip is a blend: proprietary compute logic + standardized external interfaces. IP/VIP vendors target the right side โ where JEDEC, PCI-SIG, UCIe, and IEEE define the rules.
NVIDIA's SM architecture, AMD's CDNA compute units, Apple's Neural Engine, and Google's TPU Matrix Multiply Units are competitive weapons. Companies invest billions to differentiate on internal microarchitecture โ execution throughput, power efficiency, scheduling intelligence. No consortium can or should standardize these.
A server board must accommodate DIMMs from Samsung, SK Hynix, or Micron interchangeably. A GPU must plug into any PCIe slot regardless of motherboard vendor. This interoperability is only possible because JEDEC, PCI-SIG, and USB-IF define the electrical and protocol rules that every implementer must follow.
As an IP or VIP seller, you cannot sell a replacement for NVIDIA's SM core. But you can sell a DDR5 controller, an HBM PHY, a PCIe Gen 6 verification IP, or a UCIe die-to-die interface block โ because these are defined by open standards that any chip team must implement, regardless of their proprietary compute architecture.
| Chip | Proprietary Internal Logic | Standardized Interfaces Used |
|---|---|---|
| NVIDIA H100 | CUDA SMs, Transformer Engine, NVLink 4.0 | HBM3 (JEDEC), PCIe Gen 5 (PCI-SIG) |
| AMD MI300X | CDNA3 compute dies, Unified Memory | HBM3 (JEDEC), PCIe Gen 5, UCIe chiplets |
| Intel Gaudi 3 | Matrix Multiply Units, MME | HBM2E (JEDEC), PCIe Gen 5 (PCI-SIG) |
| Qualcomm Snapdragon X Elite | Oryon CPU, Hexagon NPU | LPDDR5X (JEDEC), USB4 (USB-IF), MIPI CSI-2/DSI |
| Apple M4 | Firestorm/Icestorm cores, Neural Engine | LPDDR5X (JEDEC), USB4 (USB-IF), PCIe (PCI-SIG) |
| Google TPU v7 | Systolic array, HBM controller (custom) | HBM3E (JEDEC), PCIe/ICI interconnect |
HBM is JEDEC's standard. PCIe is PCI-SIG's. USB is USB-IF's. Knowing who publishes what โ and who are the members โ is foundational knowledge for chip design and IP development.
| Standards Body | Technology Domain | Key Specifications | Notable Members |
|---|---|---|---|
| JEDEC jedec.org |
DRAM, Flash, Packaging | DDR5 (JESD79-5), LPDDR5X, HBM3 (JESD238B.01), GDDR7, UFS 4.0 | Samsung, SK Hynix, Micron, NVIDIA, AMD, Intel, Qualcomm, Google |
| PCI-SIG pcisig.com |
PCIe Interconnect | PCIe 5.0, PCIe 6.0, PCIe 7.0 | Intel, AMD, NVIDIA, Arm, Qualcomm, Broadcom, Marvell |
| UCIe Consortium uciexpress.org |
Chiplet Interconnect | UCIe 1.0, UCIe 1.1, UCIe 2.0 | Intel, AMD, NVIDIA, TSMC, Samsung, Arm, Qualcomm, ASML |
| USB-IF usb.org |
USB / USB-C | USB4, USB 3.2, USB-C, USB Power Delivery | Apple, Intel, Qualcomm, Google, Microsoft, Texas Instruments |
| MIPI Alliance mipi.org |
Mobile Interfaces | MIPI CSI-2 (camera), DSI (display), D-PHY, C-PHY, I3C | Qualcomm, MediaTek, Samsung, Sony, ARM, Apple |
| IEEE 802 ieee.org |
Networking / Ethernet | Ethernet 802.3 (10G/100G/400G/800G), Wi-Fi 802.11be (Wi-Fi 7) | Broadcom, Marvell, Cisco, Intel, NVIDIA (Mellanox), Juniper |
| NVM Express Consortium nvmexpress.org |
Storage Interfaces | NVMe 2.0, ZNS (Zoned Namespace), CMB, FDP | Samsung, Western Digital, Seagate, Intel, Micron, KIOXIA |
| HDMI Forum hdmi.org |
Display / AV | HDMI 2.1, HDMI 2.1a (48 Gbps) | Sony, Panasonic, Toshiba, Silicon Optix |
| VESA vesa.org |
Display / DisplayPort | DisplayPort 2.1 (80 Gbps), eDP, DSC | AMD, NVIDIA, Intel, Dell, Samsung, LG, Apple |
| Accellera / IEEE accellera.org |
EDA / Verification Standards | SystemVerilog (IEEE 1800), UVM, JTAG (IEEE 1149.1), IJTAG (IEEE 1687), PSS | Synopsys, Cadence, Siemens EDA, Arm, Intel, NVIDIA, AMD |
| MLCommons mlcommons.org |
AI/ML Hardware Benchmarks | MLPerf Training, MLPerf Inference, MLPerf Tiny | Google, NVIDIA, AMD, Intel, Qualcomm, Microsoft, Meta |
| OCP (Open Compute) opencompute.org |
Open Accelerator Hardware | OAI (Open Accelerator Infrastructure), OCP-TAP | Meta, Microsoft, Google, Intel, AMD, NVIDIA |
| RISC-V International riscv.org |
Open ISA | RV32/RV64, Vector Extension, Hypervisor Extension | SiFive, Western Digital, NVIDIA, Google, Qualcomm, Arm (observer) |
| AUTOSAR / ISO | Automotive / Functional Safety | ISO 26262 (functional safety), AUTOSAR Classic/Adaptive | Bosch, NXP, Renesas, Continental, BMW, Toyota |
| Wi-Fi Alliance / Bluetooth SIG | Wireless | Wi-Fi 7 (802.11be), Bluetooth 5.4 | Qualcomm, MediaTek, Broadcom, Apple, Samsung, Intel |
For a semiconductor IP or Verification IP company, not every consortium is equally important on day one. Here is a practical priority ranking based on market impact and relevance to AI/memory IP development: