• 0 Posts
  • 10 Comments
Joined 3 years ago
cake
Cake day: July 4th, 2023

help-circle
  • If you are having trouble getting the 6800xt to work with pi.dev I’d be surprised if the V620 would be any different, but I haven’t tried that tool. I can attempt it and get back to ya in a couple days if you’d like.

    I ended up getting it purely as it seemed like the cheapest option for 32GB VRAM that didnt have discontinued driver support. Around Jan/Feb 2026 the MI60’s had recently blown up in price but the V620 still seemed niche/slept on partially because AMD hasn’t released an SR-IOV driver for this. Servethehome forums had a big thread about how these aren’t particularly useful for home server/virtual machines as a result. I think it’s still possible to pass it through to docker containers but I haven’t tried it yet.

    This guy accepted a $350 offer for mine:
    www.ebay.com/itm/157133307609
    Then you’ll need a shroud:
    www.ebay.com/itm/286347509481
    The optional included fan works well, pushes 60CFM but is LOUD. I ended up replacing it with an Arctic P8 Max which is much quieter but only pushes 40CFM, but cools it fine with -100mV undervolt in LACT.


  • mierdabird@lemmy.dbzer0.comtoSelfhosted@lemmy.worldDo you host your own AI?
    link
    fedilink
    English
    arrow-up
    4
    arrow-down
    1
    ·
    edit-2
    6 days ago

    I started out playing around with code generation using Ollama/open-webui and qwen 2.5 coder 14b on a 3060 12GB, but ended up on a winding journey with an ex datacenter card called the AMD V620. Its roughly equivalent to an RX 6800XT, but with double the VRAM. At this point i’ve really done nothing productive with it but learned a lot about bios settings, GPU/ROCm drivers, and custom fan solutions/PWM controls trying to get it setup and optimized haha.

    It’s pretty sick though, that amount of VRAM with 512GB/s bandwidth can run Qwen 3.6 27B dense with 100k context window at 20 tokens/sec in LM studio. Draws 300 watts at the wall on my ITX chassis (idling about 30w).

    I’ve been dabbling in building an aviation weather and field condition report application using this, but my next step is to rebuild my VS Code environment into a new machine. I’m kinda enjoying just fucking around with building the hardware too though



  • It’s interesting to me that nowhere in the article and really nowhere in the comments does anyone raise the question of why do the Faroe Islanders do these hunts in the first place?

    Obviously people living close to the sea have always gotten large portions of their diet from seafood, so are they actually preserving and eating all of the whale/dolphin meat they are harvesting?

    If so then it doesn’t seem to me like a question of should the hunt be banned, unless we also want to discuss banning all other forms of hunting and animal husbandry.
    It seems instead that we should be asking if they are doing it less humanely than other forms of hunting or animal husbandry, and if so what can be done to reduce the whales’ suffering?



  • It can vary a lot based on what qwen model you want to run, but generally the 27b dense or 35b MoE are currently the best balance between size and capability afaik.

    If you can run two 16GB cards you can pretty much max out the context on the 27b model, but a single card like the 3060 12gb could still work well on the 35b MoE model with the excess spilling into system memory.

    I saw in another comment you have cards from the 2010’s but if they don’t have at least 8gb I wouldn’t even bother


  • It’s hard to say what exactly your requirements are in terms of VRAM/RAM from what you described here, but as a general recommendation whether AMD or Intel, I’d stick with DDR4 generation hardware. DDR5 is extremely expensive, but any non-MoE model that spills into system memory will still be frustratingly slow.

    For GPU’s the best bang for your buck if you want Nvidia is probably the 3060 12GB, it has 360GB/s memory bandwidth and one or more of those is a very reasonable starting point for local AI.
    If you’re okay with AMD there are some really unique cards floating around, I recently picked up a V620 off ebay for $350, it’s an ex-datacenter card with 32GB GDDR6 @ 512GB/s bandwidth. It’s a bit of a power hog but in my early testing it was running Qwen coder 3 30B at like 100 tokens/sec.

    I run it on an ASUS X570 PRO board which is the cheapest AM4 board I could find with an optimal PCI-E setup: three x16 slots running 4.0x8, 4.0x8, 3.0x4. I have successfully tested it with the V620, a 9060XT, and a 3060 for 60 GB total VRAM, though the third x16 is only single slot so I had to borrow a pci extender cable to try it. I’ve found 48gb VRAM is plenty for me so I doubt I’ll actually run a third card unless I find a good deal on a single slot one.

    Kinda turned into a ramble but let me know if you got questions