• luciole (he/him)@beehaw.org
    link
    fedilink
    arrow-up
    26
    ·
    8 个月前

    The actual research page is so awkward. The TLDR at the top goes:

    single portrait photo + speech audio = hyper-realistic talking face video

    Then a little lower comes the big red warning:

    We are exploring visual affective skill generation for virtual, interactive characters, NOT impersonating any person in the real world.

    No siree! Big “not what it looks like” vibes.