Strix Halo can only allocate 96GB RAM to the GPU. So GPT-OSS 120B can be ran onl...

vid · 2025-10-22T13:45:22 1761140722

It can use only 96GB RAM on Windows, on Linux people have allocated up to 120GB. Here's one source: https://www.reddit.com/r/LocalLLaMA/comments/1nmlluu/comment...

ondra · 2025-10-22T12:51:54 1761137514

GPT-OSS 120B uses native 4 bit representation, so it fits fine.

yencabulator · 2025-10-25T19:40:15 1761421215

I bet you're confusing VRAM (the old fixed thing) and GTT (dynamic) memory allocation. Linux amdgpu does GTT just fine. amdgpu_top is an example monitoring app that shows them separately.

More: https://news.ycombinator.com/item?id=44859582

fluoridation · 2025-10-22T14:52:51 1761144771

>Strix Halo can only allocate 96GB RAM to the GPU.

Are you referring to exclusive or shared allocation? I think shared allocation allows using all available memory.