Avram Piltch is the editor in chief of Tom’s Hardware, and he’s written a thoroughly researched article breaking down the promises and failures of LLM AIs.

  • RickRussell_CA@beehaw.orgOP
    link
    fedilink
    English
    arrow-up
    1
    ·
    9 months ago

    it’s basically impossible to tell where parts of the model came from

    AIs are deterministic.

    1. Train the AI on data without the copyrighted work.

    2. Train the same AI on data with the copyrighted work.

    3. Ask the two instances the same question.

    4. The difference is the contribution of the copyrighted work.

    There may be larger questions of precisely how an AI produces one answer when trained with a copyrighted work, and another answer when not trained with the copyrighted work. But we know why the answers are different, and we can show precisely what contribution the copyrighted work makes to the response to any prompt, just by running the AI twice.