OpenClaw security assessment [pdf]

(zeroleaks.ai)

60 points | by nreece 20 hours ago ago

19 comments

simonw 19 hours ago ago

Almost all of this report is about leaking system prompts.
The OpenClaw system prompt has no measures in it at all to prevent leaking, because trying to protect your system prompt is almost entirely a waste of time and actually makes your product less useful.
As a result, I do not think this is a credible report.
Here's the system prompt right now: https://github.com/openclaw/openclaw/blob/b4e2e746b32f70f8fb...
DeepYogurt 20 hours ago ago

Zeroleaks.ai is a 13 day old registration. Cool.
https://whois.domaintools.com/zeroleaks.ai

[-]
- rovr138 20 hours ago ago
  
  More interesting, looks to be from this 16yo, https://github.com/x1xhlol, https://www.lucknite.dev/
  
  [-]
  - arcfour 19 hours ago ago
    
    Explains why it reads like AI slop. "CRITICAL BREACH..."
    
    [-]
    - edoceo 19 hours ago ago
      
      Can we call slop in two words? I didn't feel that. Is my radar off? /me taps screen
      
      [-]
      - Uehreka 18 hours ago ago
        
        I frequently push back on people being hair-trigger about calling things AI, but even I’ve gotta admit, that’s exactly what Claude code says if you ask it to do a security review and it finds something. I’ve seen this numerous times.
      - arcfour 19 hours ago ago
        
        I can detect it pretty well, but that was just one example.
        No person starts a summary that way, it's over-the-top and meaningless. I have seen AI do that many times when summarizing something related to security, though. Claude often says "CRITICAL:" or "CRITICAL VULNERABILITY:" or similar, especially when you jam the context window full of junk.
  - cyrusradfar 19 hours ago ago
    
    Yes, with 128K GH stars. Impressive if true.
    
    [-]
    - jasonjmcghee 19 hours ago ago
      
      The account's stars are mostly a "system prompts" collection repo fwiw.
    - kristopolous 19 hours ago ago
      
      Trying to hustle online and writing high quality software aren't the same
- rovr138 20 hours ago ago
  
  Looks interesting, https://github.com/ZeroLeaks/zeroleaks
  At least, I am curious about the tool
rodrigosetti 19 hours ago ago

It's a moltbook agent tasked to get HN attention

[-]
- AstroBen 18 hours ago ago
  
  seems it worked. We've been outsmarted by the lobster
alan_sass 19 hours ago ago

Is this a CC generated .md report formatted as a .pdf? Looks familiar.
jonrcooper 20 hours ago ago

Zero mention of specific models that are being compromised makes it hard to take the numbers in this report seriously.
I do understand there's a lot of people running openclaw that don't really understand it and know what models are actually running. But we've known for a while that there are tons of older models that are pretty vulnerable, and you can hook up any model to OpenClaw, so, this data is not really that useful. Even though I totally agree that there are plenty of security risks here

[-]
- adam_arthur 19 hours ago ago
  
  Relying on the model for security is not security at all.
  No amount of hardening or fine-tuning will make them immune to takeover via untrusted context
K0IN 18 hours ago ago

Can someone give me context on why leaking the system prompt of a open source tool, I run on my machine is a problem?

[-]
- ottah 18 hours ago ago
  
  Only if you write a custom prompt with information you don't want to disclose.
bhewes 20 hours ago ago

Ha this moltbook gone crazy.