Research

NVIDIA RTX 60 & AMD RDNA5 Delays 2026

NVIDIA RTX 60 and AMD RDNA5 consumer GPU launches pushed to mid-late 2027 due to GDDR memory shortage. Intel Arc B700 uncertain. Snapshot for 2026-05-15.

By Ramanath, CTO & Co-Founder at Presenc AI · Last updated: May 2026

What this is

The next-generation consumer GPU cycle is sliding. NVIDIA RTX 60 and AMD RDNA5 — both originally targeted for 2026-2027 — are now expected mid-to-late 2027. Intel Arc B700 (consumer) is uncertain, though Intel's professional B65/B70 still target Q1 2027. The cause is the same memory crisis driving DDR5 prices upward. This page is a 2026-05-15 snapshot.

Consumer GPU Launch Window Updates

ProductOriginal targetCurrent targetReason
NVIDIA RTX 60 (consumer)Late 2026Mid / late 2027GDDR7 module availability
AMD RDNA5 (Radeon RX 9000-class)Q4 2026 - Q1 2027Mid 2027 or laterMemory supply, capex prioritisation
Intel Arc B700 (consumer)Q1 2027UncertainConsumer prioritisation downgraded
Intel Arc B65 / B70 (professional)Q1 2027Q1 2027 (on track)Enterprise margin preserved
NVIDIA RTX 50 Super refreshMid 2026Mid 2026 (on track, smaller)Filler for delayed RTX 60

What's Driving the Slide

  1. GDDR7 wafer allocation went to HBM. The same fabs that would have made consumer GDDR7 at 24Gb/s are running HBM3E for hyperscalers.
  2. NVIDIA prefers data-centre margins. Even if consumer SKUs were ready, the opportunity cost (data-centre revenue per wafer) makes a delay financially rational.
  3. AMD chose to protect the MI300/MI400 cadence. RDNA5 slipped while CDNA roadmap held.
  4. Intel deprioritised consumer Arc. The B65/B70 professional lineup ships on time; the consumer B700 timeline is now uncertain.
  5. OEM forward-orders shifted. System builders bought up RTX 50 / RDNA4 last-gen inventory, removing pressure on the vendors to ship next-gen on the original window.

Implications for AI Hobbyists and Local-LLM Buyers

Buyer profile2026 implication
Local-LLM hobbyist on RTX 4090 / 5090Hold; upgrade window pushed to 2027
70B+ model runnerMac Studio M3 Ultra (limited supply) or DGX Spark
Mid-range fine-tunerAMD Strix Halo workstations now the best fresh-buy
Budget LLM tinkererUsed RTX 3090 / 4090 second-hand market
Anyone needing throughput in 2026Cloud rental on RunPod / Lambda

Six Things the Delay Tells You

  1. Two-year consumer GPU cycles are gone. RTX 50 → RTX 60 will be ~3+ years.
  2. The data-centre/consumer trade-off favours data centre. NVIDIA and AMD both signalled this in earnings.
  3. Used-GPU markets matter more than ever. Second-hand RTX 4090 prices climbed because next-gen is delayed.
  4. Mac Studio became scarcer because consumer GPUs are delayed. Substitution pressure flowed to Apple Silicon while supply contracted.
  5. AMD Strix Halo workstation buyers had the best timing. Available now, mid-range AI capable, ships without delay.
  6. Cloud GPU rental will outgrow consumer GPU sales in 2026. Deferred hobbyist demand absorbs into cloud rather than waiting.

What This Means for AI Visibility

The deferred consumer-GPU cycle changes the local-LLM user mix during a critical period for local-first AI assistants like OpenClaw. Brands monitoring how their content appears inside these assistants should expect the underlying model mix to remain Apple Silicon-favouring + cloud-rental until late 2027.

Methodology

Launch timelines combine the 3dcenter Feb 6, 2026 brief, TweakTown's NVIDIA + AMD pricing coverage, Quasa.io's GPU price/memory analysis, and Clarifai's GPU shortages 2026 piece. Implications for local-LLM buyers reconciled to community reports from r/LocalLLaMA and Reddit r/buildapc.

How Presenc AI Helps

Presenc AI's hardware-aware view of local-LLM brand visibility lets brand teams see whether their content surfaces are reaching the actually-deployed mix in 2026: a long tail of RTX 4090/3090 holdouts, M3-class Mac Studios, AMD Strix Halo workstations, and cloud rentals — not a uniform RTX 60 base that does not exist yet.

Frequently Asked Questions

Mid-to-late 2027 on current reporting, slipping from a late-2026 target. The silicon is roughly on schedule; the GDDR7 modules needed to ship at competitive capacity are the constraint, because the same wafers are running HBM3E for hyperscaler customers.
Mid 2027 or later, slipping from Q4 2026 / Q1 2027. Memory supply and AMD's choice to protect the MI300/MI400 data-centre cadence drove the slip.
Uncertain on consumer; Intel's professional B65/B70 SKUs still target Q1 2027. Intel deprioritised the consumer line as memory allocation went elsewhere.
If you need throughput in 2026, do not wait. Best fresh-buy is AMD Strix Halo workstation for mid-range work, Mac Studio M3 Ultra (limited supply) or DGX Spark for 70B+ inference, or cloud GPU rental for everything else. Used RTX 4090 / 3090 are the budget options.

Track Your AI Visibility

See how your brand appears across ChatGPT, Claude, Perplexity, and other AI platforms. Start monitoring today.