3 comments

  • balousek 2 days ago ago

    Maybe I'm missing something, but I'm confused why you compared gemini 2.5 against opus 4.6, and gpt 5.3-codex ? I assume you used sonnet 4.5 because 4.6 was released in the last 2 days, but opus 4.6 and 5.3-codex were released 2 weeks ago, so gemini 3 was definitely around then.

  • ctmnt 2 days ago ago

    Very interesting analysis and great write up. Gemini the semi.

    Regarding Opus’s use of the git history: I run Opus and Codex in parallel a lot, or going back and forth with each other. For interesting problems I’ll give them both the same prompt at the same moment and see what they each come up with. My guess is that if you hadn’t told Opus that “it worked fine recently” it wouldn’t have gone first to git like that. Both Opus 4.6 and Codex 5.3 are incredibly good at reading code and diagnosing bugs.

  • matthewmueller 2 days ago ago

    Context lens is a great name!