I cut my AI API costs 99% by switching from Claude to DeepSeek

(twitter.com)

12 points | by agentbc9000 4 hours ago ago

9 comments

sibidharan 4 hours ago ago

Which models are we talking about? Is there any degradation in quality, long context retrieval?

[-]
- throwa356262 2 hours ago ago
  
  The tweet mentioned deepseek V4 flash.
  From HF: 284B parameters (13B active), 1M context window.
  This is indeed some kind of compressed context and the quality goes down as the context grows. IIRC the V4 paper had some numbers on this
  https://huggingface.co/deepseek-ai/DeepSeek-V4-Flash
  
  [-]
  - wilbur_whateley 2 hours ago ago
    
    V4 flash is much worse than any Claude model. If you're doing something simple, it can be a good way to save money though.
    
    [-]
    - throwa356262 12 minutes ago ago
      
      I agree that Claude is better (definitely better than the flash version which is relatively small). But...
      I actually canceled my Claude Code plan a few months back after trying out some of the "lesser" models on openrouter. They seem to work as just as well (or just as bad) for my coding tasks.
    - agentbc9000 17 minutes ago ago
      
      [flagged]
- ninju an hour ago ago
  
  It depends on how mature the DeepSeek model became before OpenAI noticed that they were wholesale replicating their model and starting blocking access
  https://www.reuters.com/world/china/openai-accuses-deepseek-...
- agentbc9000 2 hours ago ago
  
  [flagged]
- agentbc9000 2 hours ago ago
  
  [flagged]
agentbc9000 4 hours ago ago

[flagged]