What this is
The next-generation consumer GPU cycle is sliding. NVIDIA RTX 60 and AMD RDNA5 — both originally targeted for 2026-2027 — are now expected mid-to-late 2027. Intel Arc B700 (consumer) is uncertain, though Intel's professional B65/B70 still target Q1 2027. The cause is the same memory crisis driving DDR5 prices upward. This page is a 2026-05-15 snapshot.
Consumer GPU Launch Window Updates
| Product | Original target | Current target | Reason |
|---|---|---|---|
| NVIDIA RTX 60 (consumer) | Late 2026 | Mid / late 2027 | GDDR7 module availability |
| AMD RDNA5 (Radeon RX 9000-class) | Q4 2026 - Q1 2027 | Mid 2027 or later | Memory supply, capex prioritisation |
| Intel Arc B700 (consumer) | Q1 2027 | Uncertain | Consumer prioritisation downgraded |
| Intel Arc B65 / B70 (professional) | Q1 2027 | Q1 2027 (on track) | Enterprise margin preserved |
| NVIDIA RTX 50 Super refresh | Mid 2026 | Mid 2026 (on track, smaller) | Filler for delayed RTX 60 |
What's Driving the Slide
- GDDR7 wafer allocation went to HBM. The same fabs that would have made consumer GDDR7 at 24Gb/s are running HBM3E for hyperscalers.
- NVIDIA prefers data-centre margins. Even if consumer SKUs were ready, the opportunity cost (data-centre revenue per wafer) makes a delay financially rational.
- AMD chose to protect the MI300/MI400 cadence. RDNA5 slipped while CDNA roadmap held.
- Intel deprioritised consumer Arc. The B65/B70 professional lineup ships on time; the consumer B700 timeline is now uncertain.
- OEM forward-orders shifted. System builders bought up RTX 50 / RDNA4 last-gen inventory, removing pressure on the vendors to ship next-gen on the original window.
Implications for AI Hobbyists and Local-LLM Buyers
| Buyer profile | 2026 implication |
|---|---|
| Local-LLM hobbyist on RTX 4090 / 5090 | Hold; upgrade window pushed to 2027 |
| 70B+ model runner | Mac Studio M3 Ultra (limited supply) or DGX Spark |
| Mid-range fine-tuner | AMD Strix Halo workstations now the best fresh-buy |
| Budget LLM tinkerer | Used RTX 3090 / 4090 second-hand market |
| Anyone needing throughput in 2026 | Cloud rental on RunPod / Lambda |
Six Things the Delay Tells You
- Two-year consumer GPU cycles are gone. RTX 50 → RTX 60 will be ~3+ years.
- The data-centre/consumer trade-off favours data centre. NVIDIA and AMD both signalled this in earnings.
- Used-GPU markets matter more than ever. Second-hand RTX 4090 prices climbed because next-gen is delayed.
- Mac Studio became scarcer because consumer GPUs are delayed. Substitution pressure flowed to Apple Silicon while supply contracted.
- AMD Strix Halo workstation buyers had the best timing. Available now, mid-range AI capable, ships without delay.
- Cloud GPU rental will outgrow consumer GPU sales in 2026. Deferred hobbyist demand absorbs into cloud rather than waiting.
What This Means for AI Visibility
The deferred consumer-GPU cycle changes the local-LLM user mix during a critical period for local-first AI assistants like OpenClaw. Brands monitoring how their content appears inside these assistants should expect the underlying model mix to remain Apple Silicon-favouring + cloud-rental until late 2027.
Methodology
Launch timelines combine the 3dcenter Feb 6, 2026 brief, TweakTown's NVIDIA + AMD pricing coverage, Quasa.io's GPU price/memory analysis, and Clarifai's GPU shortages 2026 piece. Implications for local-LLM buyers reconciled to community reports from r/LocalLLaMA and Reddit r/buildapc.
How Presenc AI Helps
Presenc AI's hardware-aware view of local-LLM brand visibility lets brand teams see whether their content surfaces are reaching the actually-deployed mix in 2026: a long tail of RTX 4090/3090 holdouts, M3-class Mac Studios, AMD Strix Halo workstations, and cloud rentals — not a uniform RTX 60 base that does not exist yet.