same, but that follows. Why I wanted a proof first is so that I don’t waste time running tests on code that was far from finished yet. Especially early days this year, I’d get agent confirming to me “I did this” whilst later I uncovered it struggled to use tools, so it just said it was done.
When I recieve the evidence of “I’ve done it” (iterate if anything is missing), only then I trigger the round of unit tests.
I know this may sound like a bit of too much careful handholding, but got burned so many times this pays off.
For me unit or even int tests are more reliable signs to get agent is done or still not, and you ?
same, but that follows. Why I wanted a proof first is so that I don’t waste time running tests on code that was far from finished yet. Especially early days this year, I’d get agent confirming to me “I did this” whilst later I uncovered it struggled to use tools, so it just said it was done. When I recieve the evidence of “I’ve done it” (iterate if anything is missing), only then I trigger the round of unit tests. I know this may sound like a bit of too much careful handholding, but got burned so many times this pays off.
[flagged]