Show HN: Reducing LLM input tokens by 70%

(adola.app)

5 points | by Jbunga 4 hours ago ago

5 comments

archleaf 4 hours ago ago

Nice idea. Can I choose a strategy for token reduction, based on what I'm optimzing for? I might be ok with a quality drop for a great cost savings, for example.

[-]
- Jbunga 4 hours ago ago
  
  Yeah, you can set roughly the target ratio through the api (for example, target_ratio=.3), though our api will try to maximize the quality given the this target ratio (and it might add a couple more tokens to do so)
zahlekhan an hour ago ago

why is the landing page straight up taken from wisprflow?
jubilee88 4 hours ago ago

[dead]
bingbong06 4 hours ago ago

[dead]