Trying to understand Halo Strix memory

The Hobbyist · edit-2 11 days ago

Trying to understand Halo Strix memory

The Hobbyist · 10 days ago

I was trying to reason from how GPUs occasionally use a so called clamshell design where, if I understand correctly, they split their bus to reach double the number of memory chips. The chips are paired and respond to the same addresses but then each provide part of the data which is then combined.

Your example for vehicles got me confused, because as you point out, if you double the number of lanes while keeping the speed the same, you do effectively double the number of vehicles passing per unit of time, which is the bandwidth we are trying to achieve.

I’m sorry if I’m missing some important details but I am still rather confused.

PS: as per the specific framework memory speed specs, the halo strix chip maxes out at 8000, so 8533 is not supported, as per the specs I linked in the the post.

sp3ctr4l · edit-2 10 days ago

I was trying to reason from how GPUs occasionally use a so called clamshell design…

https://usercomp.com/news/1238286/gddr5-clamshell-mode-and-bandwidth

Clamshell mode is relevant to GDDR5 memory.

Memory that is a part of the GPU, part of a GPU board.

LPCAMM/2 and LPDDR5(x) memory is system memory, aka, RAM sticks.

Completely different kinds of memory.

…

… if I understand correctly, they split their bus to reach double the number of memory chips…

You do not understand correctly.

There are many different kinds of memory chips, and they are not all directly compatible with all other kinds of processors.

You know how pcpartpicker or sites like that will tell you hey, you can’t plug DDR4 RAM into a mobo that only has slots for DDR5 RAM?

Imagine that but 1000x more complicated for trying to mix and match things at not the ‘hardware component’ level, but the ‘hardware subcomponents of hardware components’ level.

Clamshell mode only works on GPUs because the processor(s) on a GPU is only ever going to directly interface with the GDDR memory that is part of the the GPU, and is intentionally designed to specifically work with that kind of GDDR memory… so you don’t need to worry about making GDDR memory being able to directly interface with other parts of the system.

A GPU is an add-in-board. Its much closer to an entire miniature PC itself, on one board, and everything it communicates to the rest of the system is through the PCIE 16 slot on the motherboard.

That allows a GPU to have more specialized tech within itself, as you’re not going to be customizing, manually modifying the memory or other components of your GPU.

System memory directly connects to a motherboard, and thus has to be directly compatible with anything else that could be plugged in to any thing else directly to the motherboard.

Because system memory is directly connected to potentially a much greater variety of other hardware via the mobo, it cannot be so specialized, otherwise less advanced components that directly connect to the mobo wouldn’t be able to interface with it.

…

The Halo Strix is technically not a CPU.

It is an APU.

That means its a single hardware component that is a hyrbrid of CPU style processor cores, and GPU style processor cores.

That means the system its plugged into has to use a kind of system memory that is both generally compatible with other hardware components on the mobo, but is also compatible with the more specialized GPU style processor core demands.

This is why, in general, there are architecture differences between laptop/smartphone style memory, and pc style system memory, why you can’t usually plug laptop RAM into a PC, or visa versa.

…

Your example for vehicles got me confused…

I’m sorry if this is complicated and confusing, there may be a more straightforward way to explain it, but computer hardware really is quite complicated at this level of detail.

If my example/analogies are still confusing and inadequate, then abandon them and try this:

Bus widths are data transfer standards and protocols… in that sense they are kind of like a language.

A 256 bit ‘language’ speaking 256 ‘words’ at the same speed as a 128 bit ‘language’ may be able to push twice as much meaningful informatiom in the same amount of time… but this is useless if there isn’t an instantaneous translation between the 128 bit and 256 bit ‘speakers’.

If your standard is expecting ‘words’ that are all 128 ‘letters’ long, and then you try to send a 256 letter word… you’re gonna have a problem. Half the letters won’t get through.

You would have to have some specialization on the sender side, that breaks its 256 letter words into 2 seperate 128 letter words, and some specialization on the reciever side that takes 2 concurrent 128 letter words, combines them into the 256 letter word, and then decodes that into the actual intended meaning (instruction set) of the 256 letter word.

At that point, your memory now also has its own ‘translator’ or, less metaphorically, processor of some kind… which is silly and wasteful.

It makes more sense to just use memory that can ‘speak the same language’ as the processor.

… If this still doesn’t make sense, then go to wikipedia or find an instructional course or video series that actually, properly explains compuyer hardware design, including what a bus actually is and how it works.

…

PS: as per the specific framework memory speed specs, the halo strix chip maxes out at 8000, so 8533 is not supported, as per the specs I linked in the the post.

The AMD specs you link actually don’t specify what you are saying here.

They say:

256-bit LPDDR5x

LPDDR5x is a blanket term. A container term.

The ‘x’ is a placeholder for all the specific throughput speeds of all different kinds of LPDDR5 - (number) speeds.

It certainly does support LPDDR5 - 8000.

It likely also supports LPDDR5 - 8533…

…though AMD does not actually specifically confirm nor deny this on your cited source page, as they are using a more vague, catch-all term.