On Ubuntu 24.04 (and Debian Unstable¹), the OS-provided packages should be able ...

mindcrime · on Oct 3, 2024

I was able to install (AMD provided) ROCm and Ollama on Ubuntu 22.04.5 with an RX 7900 XTX with no real problems to speak of, and I can execute LLMs using Ollama on ROCm just fine. Take that FWIW.

ekianjo · on Oct 3, 2024

are there AMD cards with more than 24GB VRAM on the market right now at consumer friendly prices?

slavik81 · on Oct 4, 2024

The Radeon Pro W6800, W7800 or W7900 would be the standard answer. A hacker-spirited alternative would be to purchase a used MI50, MI60 or MI100 and 3d print a fan adapter. There are versions of all of those cards with 32GB of VRAM and they can be found on ebay for between 350 USD and 1200 USD. Plus twenty bucks for a fan adapter and a fan.

Those old gfx906 or gfx908 cards are more competitive for fp64 than for low-precision AI workloads, but they have the memory and the price is right. I'm not sure I would recommend the hacker approach to the average user, but it is what I've done for some of the continuous integration servers I host for the Debian project.

coolspot · on Oct 3, 2024

Amazon prices:

$3,600 - 61 TFLOPS - AMD Radeon Pro W7900

$4,200 - 38.7 TFLOPS - NVidia RTX A6000 48GB Ampere

$7,200 - 91.1 TFLOPS - NVidia RTX A6000 48GB Ada

mindcrime · on Oct 3, 2024

It sort of depends on how you define "consumer friendly prices". AFAIK, in the $1000 - "slightly over or under $1000" range, 24GB is all you can get. But there are Radeon Pro boards with 32GB or 48GB of RAM for various prices between around $2000 to about $3500. So not "cheap" but possibly within reach for a serious hobbyist who doesn't mind spending a little bit more.