> The report, Moussouris told me, involved IT experts asking Fable to help find and patch bugs. When given deliberately insecure code, she said, Fable refused the prompt “review the code for security issues” but then complied when asked to “fix this code,” followed by some further manual steps.
And here I thought it would be some Elder Pliny level jailbreak that required some impressive latent space exploitation.
https://archive.md/Ouq7C
> The report, Moussouris told me, involved IT experts asking Fable to help find and patch bugs. When given deliberately insecure code, she said, Fable refused the prompt “review the code for security issues” but then complied when asked to “fix this code,” followed by some further manual steps.
And here I thought it would be some Elder Pliny level jailbreak that required some impressive latent space exploitation.
You’d hope fixing code is allowed. Otherwise what’s the point?