• humanspiral@lemmy.ca
    link
    fedilink
    English
    arrow-up
    0
    ·
    21 days ago

    3.6 27b is probably most powerful/efficient (to size) model out there. Qwen has a history of leveraging deepseek power as well. (deepseek creating small models with Qwen as the base), and Alibaba is main hosting service for deepseek. Alibaba/Qwen in talks to invest in Deepseek, atm.