HN
New
Show
Ask
Jobs
Built with Svelte
Real-time LLM Inference on Standard GPUs (3k tokens/s per request)
(blog.kog.ai)
7 points | by
morgangiraud
6 hours ago ago
No comments yet.
No comments yet.