• jacksilver@lemmy.world
    link
    fedilink
    arrow-up
    1
    ·
    5 hours ago

    So LLMs can trace their origin back to the 2017 paper “Attention is all you need”, they with diffusion models have enabled prompt based image generation at an impressive quality.

    However, looking at just image generation you have GANs as far back as 2014 with style GANs (ones that you could more easily influence) dating back to 2018. While diffusion models also date back to 2015, I don’t see any mention of use in images until early 2020’s.

    Thats also ignoring that all of these technologies go back further to lstms and CNNs, which go back further into other NLP/CV technologies. So there has been a lot of progress here, but progress isn’t also always linear.