3 comments

  • crionuke 5 hours ago ago

    For me unit or even int tests are more reliable signs to get agent is done or still not, and you ?

    • LeoStehlik 5 hours ago ago

      same, but that follows. Why I wanted a proof first is so that I don’t waste time running tests on code that was far from finished yet. Especially early days this year, I’d get agent confirming to me “I did this” whilst later I uncovered it struggled to use tools, so it just said it was done. When I recieve the evidence of “I’ve done it” (iterate if anything is missing), only then I trigger the round of unit tests. I know this may sound like a bit of too much careful handholding, but got burned so many times this pays off.

  • jasondillingham 2 hours ago ago

    [flagged]