🇦🇺𝕄𝕦𝕟𝕥𝕖𝕕𝕔𝕣𝕠𝕔𝕕𝕚𝕝𝕖@lemm.ee to

LocalLLaMA@sh.itjust.worksEnglish · 1 month ago

How much gpu do i need to run a 90b model

12

How much gpu do i need to run a 90b model

🇦🇺𝕄𝕦𝕟𝕥𝕖𝕕𝕔𝕣𝕠𝕔𝕕𝕚𝕝𝕖@lemm.ee to

LocalLLaMA@sh.itjust.worksEnglish · 1 month ago

Do i need industry grade gpu’s or can i scrape by getring decent tps with a consumer level gpu.

Chat

ffhein@lemmy.world
link
fedilink
English
arrow-up
2·
1 month ago
You have to specify which quantization you find acceptable, and which context size you require. I think the most affordable option to run large models locally is still getting multiple RTX3090 cards, and I guess you probably need 3 or 4 of those depending on quantization and context.

LocalLLaMA@sh.itjust.works

localllama@sh.itjust.works

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

40 users / day
56 users / week
246 users / month
411 users / 6 months
28 local subscribers
2.55K subscribers
248 Posts
950 Comments
Modlog