Lugh@futurology.todayM to

Futurology@futurology.todayEnglish · 1 year ago

Two-faced AI language models learn to hide deception - ‘Sleeper agents’ seem benign during testing but behave differently once deployed. And methods to stop them aren’t working.

9

13

Two-faced AI language models learn to hide deception - ‘Sleeper agents’ seem benign during testing but behave differently once deployed. And methods to stop them aren’t working.

Lugh@futurology.todayM to

Futurology@futurology.todayEnglish · 1 year ago

9

Two-faced AI language models learn to hide deception

‘Sleeper agents’ seem benign during testing but behave differently once deployed. And methods to stop them aren’t working.

Chat

Possibly linux
link
fedilink
English
arrow-up
1·
1 year ago
Sorry, to late for that
- mateomaui@reddthat.com
  link
  fedilink
  English
  arrow-up
  2·
  1 year ago
  Alright, I’ll be out back digging the bomb shelter.
  - Possibly linux
    link
    fedilink
    English
    arrow-up
    1·
    edit-2
    1 year ago
    Its too late for that honestly
    - mateomaui@reddthat.com
      link
      fedilink
      English
      arrow-up
      2·
      1 year ago
      Alright, I’ll switch to digging holes for the family burial ground.