The White House Is Ratcheting Up Its War Against Anthropic

(theatlantic.com)

11 points | by Filligree 11 hours ago ago

3 comments

  • ed_mercer 8 hours ago ago
  • jawiggins 7 hours ago ago

    > The report, Moussouris told me, involved IT experts asking Fable to help find and patch bugs. When given deliberately insecure code, she said, Fable refused the prompt “review the code for security issues” but then complied when asked to “fix this code,” followed by some further manual steps.

    And here I thought it would be some Elder Pliny level jailbreak that required some impressive latent space exploitation.

    • Filligree 2 hours ago ago

      You’d hope fixing code is allowed. Otherwise what’s the point?