Save Claude Code Tokens with Smart Routing

(github.com)

11 points | by FrancescoMassa a day ago ago

4 comments

  • nithiink a day ago ago

    How do you handle prompt caching? A lot of cost savings for a single model chat come from cache hits on the conversation context, and switching models invalidates that cache — the new model has to reprocess everything at full input price.

  • undefined a day ago ago
    [deleted]
  • patch_dev a day ago ago

    What does this solve that well used subagents doesn't solve already?

    • FrancescoMassa 21 hours ago ago

      On our tests subagents & well used workflows are 20-30% more expensive for context & token efficiency