Show HN: Reducing LLM input tokens by 70%

(adola.app)

5 points | by Jbunga 4 hours ago ago

5 comments

  • archleaf 4 hours ago ago

    Nice idea. Can I choose a strategy for token reduction, based on what I'm optimzing for? I might be ok with a quality drop for a great cost savings, for example.

    • Jbunga 4 hours ago ago

      Yeah, you can set roughly the target ratio through the api (for example, target_ratio=.3), though our api will try to maximize the quality given the this target ratio (and it might add a couple more tokens to do so)

  • zahlekhan an hour ago ago

    why is the landing page straight up taken from wisprflow?

  • jubilee88 4 hours ago ago

    [dead]

  • bingbong06 4 hours ago ago

    [dead]