Do i need industry grade gpu’s or can i scrape by getring decent tps with a consumer level gpu.

  • red
    link
    fedilink
    English
    arrow-up
    1
    ·
    3 hours ago

    this is useless, llama.cpp already does that airllm does (offloading to CPU) but its actually faster. so just use ollama