HN
New
Show
Ask
Jobs
Built with Svelte
We used sparse autoencoders to explain LLM moderation flags of violent threats
(variance.co)
6 points | by
karinemellata
13 hours ago ago
No comments yet.
No comments yet.