Show HN: Katakate – Dozens of VMs per node for safe code exec

(github.com)

122 points | by gbxk 5 days ago ago

51 comments

mentalgear 5 days ago ago

I would really like to see a good local sandboxing solution in this space, something that is truly local-first. This is especially important since many coding models / agentic builders will eventually become lightweight enough to run them on-device instead of having to buy tokens and share user data with big LLM cloud providers.

[-]
- dloss 4 days ago ago
  
  Anthropic recently released a sandboxing tool based on bubblewrap (Linux, quite lightweight) and sandbox-exec (macOS). https://github.com/anthropic-experimental/sandbox-runtime
  I wonder if nsjails or gVisor may be useful as well. Here's a more comprehensive list of sandboxing solutions: https://github.com/restyler/awesome-sandbox
  
  [-]
  - gbxk 4 days ago ago
    
    wow that's super new! Thanks for that, will look deeply into it and compare
- mkagenius 4 days ago ago
  
  > something that is truly local-first
  Hey, we built coderunner[1] exactly for this purpose. It's completely local. We use apple containers for this (which are 1:1 mapped to a lightweight VM).
  1. Coderunner - https://github.com/instavm/coderunner
  
  [-]
  - gbxk 4 days ago ago
    
    Very cool! Apple containers run on Apple ARM so it's complimentary to my stack which doesn't support ARM yet (but soon will when extending to Qemu which supports ARM). Thanks for sharing!
- gbxk 4 days ago ago
  
  (sorry I didn't reply in-thread, I'm new to HN, re-posting response here:)
  Exactly! The main local requirement is to have hardware virtualization available (e.g. /dev/kvm) but that should be fine on your local linux machine. Won't work in cloud machines or on Mac ARM in current form but maybe if I extend
  
  [-]
  - ofrzeta 4 days ago ago
    
    There are some providers that offer KVM nested virtualization, I think Google Cloud, Digital Ocean ... any others?
    
    [-]
    - gbxk 4 days ago ago
      
      True! GCP does. I haven't tested it yet. I didn't know D.O does. If anyone knows others, I'm interested too!
      
      [-]
      - eyberg 4 days ago ago
        
        We (NanoVMs) can run (both unikernel and normal linux) virtualized workloads on plain old ec2 instances (eg: t2.small).
        
        [-]
        
        gbxk 4 days ago ago
        
        Interesting, thanks for sharing!
- elric 4 days ago ago
  
  Are there any such solutions that can adequately protect against side-channel attacks (à la rowhammer, meltdown, spectre, ...)? I mean protecting local file access and network access is pretty easy, but side-channels and VM escaping attacks seem like a bigger concern.
  
  [-]
  - gbxk 4 days ago ago
    
    That's an interesting direction! TEE support would be relatively straightforward with current stack (and it's on my roadmap), so that could be a first step forward.
  - ATechGuy 4 days ago ago
    
    Side-channel attacks apply to multi-tenant cloud environments, not local.
    
    [-]
    - elric 4 days ago ago
      
      That seems like a naive take. If any of your local VMs are internet connected and are compromised, side channel attacks could be used to exfiltrate data from other VMs or the host.
      
      [-]
      - ATechGuy 4 days ago ago
        
        Then why only apply to VMs, why not apps?
- sshine 4 days ago ago
  
  https://rstrict.cloud/ is a CLI built in Rust on top of the Landlock API for the Linux kernel.
  It lets you narrow the permission scope of an executable using simple command line wrappers.
  
  [-]
  - gbxk 4 days ago ago
    
    Thanks, will study that one too!
- kernc 4 days ago ago
  
  Local-first (on Lunix), POSIX shell: https://github.com/sandbox-utils/sandbox-run
  
  [-]
  - gbxk 4 days ago ago
    
    Thanks for sharing, adding it to my list.
- _false 4 days ago ago
  
  What about this: https://github.com/apple/container
  
  [-]
  - gbxk 4 days ago ago
    
    Very cool one. That's dedicated to Apple ARM which I don't currently support so the two are complimentary. Apple containers shares some primitives with Kata. I'll investigate if it's possible to use Apple containers as a VMM inside Kata, or creating an Apple Containers runtime class in Kubernetes. If either is possible, we could then potentially use Apple containers as a backend in Katakate. I need more time to study that.
srcreigh 4 days ago ago
```
    name: project-build
    image: alpine:latest
    namespace: default
    egress_whitelist:
      - "1.1.1.1/32"      # Cloudflare DNS
      - "8.8.8.8/32"      # Google DNS
```
This is basically a wide-open network policy as far as data exfiltration goes, right?
Malicious code just has to resolve <secret>.evil.com and Google/CF will forward that query to evil resolver.
[-]
- gbxk 4 days ago ago
  
  That's a config example.
  Yes, blocking DNS exfiltration requires DNS filtering at cluster level. This is what will be added with the Cilium integration which is top-3 on the roadmap (top of readme).
  DNS resolution is required for basic Kubernetes functionality and hostname resolution within the cluster.
  That's said explicitly in several places in the docs: "DNS to CoreDNS allowed"
  One thing I could do is make it exposed in config, to allow the user to block all DNS resolutions until Cilium is integrated. LMK if desired!
  
  [-]
  - srcreigh 4 days ago ago
    
    > One thing I could do is make it exposed in config, to allow the user to block all DNS resolutions until Cilium is integrated. LMK if desired!
    Yes, but it's not great for it to be an optional config option. Trivially easy to use data exfiltration methods shouldn't be possible at all in a tool like this, let alone enabled by default.
    I want to recommend ppl to try this out and not have to tell them about the 5 different options they need to configure in order for it to actually be safe. It ends up defeating the purpose of the tool in my opinion.
    Some use cases will require mitmproxy whitelists as well, eg default deny pulling container image except matching the container whitelist.
    
    [-]
    - gbxk 4 days ago ago
      
      This is an excellent point. I moved this to #1 on the TODO list. I'll deny all DNS resolution by default until Cilium is integrated, if that passes the basic functionality tests.
      I'll also add to the roadmap whilelist/deny for container pulling.
      Thanks!
    - gbxk 2 days ago ago
      
      As promised: https://github.com/Katakate/k7/tree/fix/no-dns-res-in-lockdo...
      Will merge that in after it passes all network tests on a clean/wiped instance.
empath75 4 days ago ago

This seems like an amazing stack that ticks a lot of boxes for me, but I really dislike cli or a custom api as the UX for this and would prefer to manage all of this with CRDs so i can just use the k8s client for everything.

[-]
- gbxk 4 days ago ago
  
  Actually you can! After you run "k7 install" you'll have a k3s cluster up and running, with Kata as a runtime class, and Firecracker specified in Kata config. So nothing prevents you from hitting the Kubernetes API! kubectl will work.
  Note: I use k3s' internal kubectl and containerd, to avoid messing with your own if you have some already installed. That means you can run commands like "k3s kubectl ..."
  And thank you for the compliments on the stack.
- gbxk 4 days ago ago
  
  If you have any suggestion on how I can make this more friendly UX-wise to your personal usage, I am most interested to hear! And this will shape my roadmap.
alexeldeib 4 days ago ago

as someone in the space this ticks a lot of boxes: kubernetes-native, strong isolation, python sdk (ideal for ML scenarios). devmapper is a nice ootb approach.
Glancing at the readme, is your business model technical support? Or what's your plan with this?
Anything interesting to share around startup time for large artifacts, scaling, passing through persistent storage (or GPUs) to these sandboxes?
Curious what things like 'Multi-node cluster capabilities for distributed workloads' mean exactly? inter-VM networking?

[-]
- gbxk 4 days ago ago
  
  No business model short-term. My goal is broad adoption, 100% open-source.
  By multi-node I mean so far I only support 1 k8s node, i.e. 1 machine, but soon adding support for multiple. Still, on 20 CPUs I can run +50 VM pods with fractional vCPU limits.
  For GPU passthrough: not possible today because I use Firecracker as VMM. On roadmap: Add support for Qemu, then GPU passthrough possible.
  Inter-VM networking: it's already possible on single-node: 1 VM = 1 pod. Can have multiple pods per node (have a look at utils/stress-test.sh). Right now I default deny-all ingress for safety (because by default k8s allows inter pod communication), but can make ingress configurable.
  Startup time: a second, or a few seconds, depending on which base image (alpine, ubuntu, etc...) and whether you use a before_script or not (what I execute before the network lockdown)
  Large artifacts: you can configure resource allocated to a VM pod in the sandbox config and it basically uses k8s resource limits.
  Let me know if any other question! Happy to help
  
  [-]
  - yjftsjthsd-h 4 days ago ago
    
    > No business model short-term. My goal is broad adoption, 100% open-source.
    IMHO that's kind of a red flag. There's a happy path here where it's successful but stays low-maintenance enough that you just work on it in your spare time, or it takes of and gets community support, or you get sponsorships or such. But there's also an option where in a year or two it becomes your job and you decide to monetize by rug-pulling and announce that actually paying the bills is more important than staying 100% open source. Not a dig at you, just something that's happened enough times that I get nervous when people don't have a plan and therefore don't have a plan to avoid the outcome that creates problems for users.
    
    [-]
    - gbxk 4 days ago ago
      
      Sure one day if it really kicks off I could think of offering additionally a SaaS solution with paid enterprise features like SOC 2 compliance, RBAC, multiple clouds supported, etc. Why not. But I strongly believe that for it to be successful, it needs a strong open-source base. Then, billing huge companies for compliance features or huge usage makes sense. That would support development of the open-source part too.
      I like the Docker model, for instance: free for companies under 250 employees or $10m/y revenue.
      In any case, it will always be open-source.
      Those paid enterprise features wouldn't come from closed-source: they would come from compliance of a particular SaaS-offered infra setup, that anybody else could reproduce. Just like HuggingFace.
gbxk 4 days ago ago

Thanks everyone for the amazing feedback and discussion!
For anyone curious:
– Docs: https://docs.katakate.org
- LangChain Agent tutorial: https://docs.katakate.org/guides/langchain-agent
It's getting late where I am, so I'm heading to bed — looking forward to replying to any new comments tomorrow!
Bnjoroge 4 days ago ago

Great project. There's been multiple approaches/tools in the space(top of my head I can think of e2b, arrakis, claude's new tool). how is this different?

[-]
- gbxk 4 days ago ago
  
  Thanks! I'll review Arrakis and come back. E2B is often considered harder to setup and less AI engineers friendly for direct stack contributions, as Katakate is the only alternative fully implemented in Python (core modules, Typer CLI, FastAPI, Python SDK).
  Our native K8s support and exposition of K8s API also makes it friendly to devops.
  Finally, our deploy/infra stack is lean and tightly fits in a single Ansible playbook, which makes it easy to understand and contribute to, letting you rapidly gain full understanding and ownership of the stack.
re_spond 4 days ago ago

Looks like an interesting project. Do you have any comments on how it is different from running gVisor?

[-]
- gbxk 4 days ago ago
  
  Thanks! Yes: Katakate provides much stronger isolation, since it uses hardware virtualization (via Kata Containers and Firecracker) while gVisor relies purely on software sandboxing in user space.
  gVisor isolates containers by intercepting system calls in a user-space kernel, so it can still be vulnerable to sandbox escape via gVisor bugs, though not directly through Linux kernel exploits (since gVisor doesn’t expose the host kernel to the container).
  Katakate also provides more than isolation: it offers orchestration through Kubernetes (K3s)
  You could create a gVisor RuntimeClass in Kubernetes to orchestrate gVisor sandboxes, but that would require extra setup.
bigwheels 4 days ago ago

Is this basically an open-source DIY version of E2B?
If so, cool! AFAICT E2B is open-source licensed but tricky to setup.

[-]
- ushakov 4 days ago ago
  
  hey, I work at E2B, anything we can do to improve the setup for you?
  
  [-]
  - bigwheels 4 days ago ago
    
    I dig E2B, it's a great service and very cost effective. Thanks for all your hard work!
ygouzerh 4 days ago ago

Nice, seems way cheaper to use than the new Cloudflare Sandbox SDK solution

[-]
- gbxk 4 days ago ago
  
  Thanks, I'll review that one too and compare.
ed_mercer 4 days ago ago

Why do I need this if I already have containers and k8s for running agents?

[-]
- gbxk 4 days ago ago
  
  It is well known that containers do not provide you safe isolation. It is not their purpose. They share kernel and page cache with the host. Any kernel exploit gives to someone in a container potential root control of the host (see DirtyPipe, DirtyCow). That's why you need VM-level isolation.
  
  [-]
  - innanet-worker 4 days ago ago
    
    today i'm one of the lucky 10k https://xkcd.com/1053/
    
    [-]
    - gbxk 4 days ago ago
      
      Lucky you! And lucky me for sharing the info :)
ushakov 4 days ago ago

what does Katakana add on top of Kata?

[-]
- gbxk 4 days ago ago
  
  Katakate is built on top of Kata, and sets up a stack combining Kubernetes (K3s), Kata, Firecracker, and devmapper snapshotter for thin pool provisioning. Combining these tools together is highly non-trivial and can be a headache for many, especially for AI engs who are often more comfortable with Python workflows. The stack gets deployed with an Ansible playbook. It implements a CLI, API and Python SDK to make it super easy to use. A lot of defense in depth settings are also baked-in so that you don't need to understand those systems at a low level to get a secure setup.
whalesalad 4 days ago ago

From an outside perspective, this looks silly. Like fitting a square peg in a round hole. But I do ack "what if we could run vm's as easily as we run containers" use case and atm it seems like things like this (and katacontainers) are the only ways to do it. Wondering a few things: do all the layers of abstraction make things brittle and how is performance impacted?

[-]
- gbxk 4 days ago ago
  
  It uses Kata with Firecracker which gives you as light of a boot as it gets. Subsecond booting for instance is accessible with a lighter rootfs, which is also on the roadmap (one of the easiest items, actually). The k8s layer doesn't add overhead either compared to any other VM. If you want to compare to bare containers, depending on workload, you could see a 5% overhead due to virtualization. Exact overhead would depend on workload.