Can we discuss how it’s possible that the paid model got worse and the free one got better? Is it because the free one is being trained on a larger pool of users or what?

  • blue_zephyr@lemmy.world
    link
    fedilink
    English
    arrow-up
    4
    arrow-down
    2
    ·
    edit-2
    1 year ago

    It’s because the research in question used a really small and unrepresentative dataset. I want to see these findings reproduced on a proper task collection.

    • Gsus4@lemmy.oneOP
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      True, checking whether a number is prime is very limited in scope for chargpt, but this is in line with other reports of progressive dumbing down.