ROCm Status in mid 2026 [D]
Our take
Hey folks
I'm starting to hear that ROCm works fine for inference now. But, I've not seen any reports on how viable it is for training. I have a couple of RTX 3090s I use for prototyping models, but I'm considering switching to a pair of RX7900XTX instead. On paper at least, the RX7900XTX can output about 4 times the throughput at FP16 with a similar power draw, VRAM, and cost.
Based on PyTorch docs, it seems like ROCm is now fully supported, but I'm struggling to find user reports on how well PyTorch runs with ROCm instead of CUDA.
How viable is it to switch over to ROCm at the moment? Is it at the "it just works" stage yet? Or is the AMD ecosystem still significantly behind CUDA?
[link] [comments]
Read on the original site
Open the publisher's page for the full experience