• gens@programming.dev
    link
    fedilink
    English
    arrow-up
    0
    ·
    edit-2
    22 days ago

    LLMs are limited by memory bandwidth much more then calculating power. You need HBM. Dedicated accelerators only lower power usage.