The Hobbyist

Just a stranger trying things.

  • 24 Posts
  • 596 Comments
Joined 2 years ago
cake
Cake day: July 16th, 2023

help-circle



  • I was trying to reason from how GPUs occasionally use a so called clamshell design where, if I understand correctly, they split their bus to reach double the number of memory chips. The chips are paired and respond to the same addresses but then each provide part of the data which is then combined.

    Your example for vehicles got me confused, because as you point out, if you double the number of lanes while keeping the speed the same, you do effectively double the number of vehicles passing per unit of time, which is the bandwidth we are trying to achieve.

    I’m sorry if I’m missing some important details but I am still rather confused.

    PS: as per the specific framework memory speed specs, the halo strix chip maxes out at 8000, so 8533 is not supported, as per the specs I linked in the the post.




  • This computer is not a gaming machine (as they falsely advertised), because you can definitely get both more powerful and more upgradeable for less.

    It is exclusively an AI computer. Its whole advantage is to provide massive high bandwidth memory. It serves only that purpose and it serves it amazingly well. For anyone else, it is not a good value proposition. But in the AI space, this machine is a fantastic value. There is nothing else out there with 128GB that even comes close in price. An RTX 5090 has 32GB and costs 2000 USD alone, without any other component. The nvidia digits probably won’t have 128GB and certainly not at that price.

    This competes against Apple’s computers with their HBM where people run LLM lovally and it does it using many more standardized components and with a much more reasonable price.

    This machine serves a niche exclusively and I don’t blame anyone for dismissing it, but it’s because it serves a very specific use case for which there is little to no alternatives.

    Edit: yes the nvidia digits will have 128GB of shared memory, with 1 petaflops of int4 compute, source nvidia: https://nvidianews.nvidia.com/news/nvidia-puts-grace-blackwell-on-every-desk-and-at-every-ai-developers-fingertips starting at 3000 USD.














  • I don’t understand in what circumstances anyone would like to use link shorteners? I can only find reasons why not to use them:

    • subject visitors to surveillance
    • destination under the control of a third party (potential for ransom for the author, like we see here, and potential of ads for visitors like we see here)
    • obfuscation of the actual destination
    • how long will the redirect be valid for? It could be deemed unviable for the company to continue support of the redirect, thus rendering the destination inaccessible from all places where the shortened link is used.
    • more…?