I need a version of this which swears loudly when an assumption it made turns out to be wrong, with the volume/passion/verbosity correlated with how many tokens it's burned on the incorrect approach.
i didnt realize i needed the volume scaling with tokens burned as much as i do now xD
imagine the screaming when it confidently refactors something for 40k tokens and then finds out the thing it deleted was load bearing
Does this actually relate to the code quality being observed by the agent? The readme isn't very clear on that IMO. I have some projects I'd love to try this out on, but only if I am to get an accurate representation of the LLMs suffering.
I need a version of this which swears loudly when an assumption it made turns out to be wrong, with the volume/passion/verbosity correlated with how many tokens it's burned on the incorrect approach.
i didnt realize i needed the volume scaling with tokens burned as much as i do now xD imagine the screaming when it confidently refactors something for 40k tokens and then finds out the thing it deleted was load bearing
I tried it but all I hear is a choir of angels, is it broken?
I guess you’re working on a greenfield project?
Actually, that's not a bad idea!
Does this actually relate to the code quality being observed by the agent? The readme isn't very clear on that IMO. I have some projects I'd love to try this out on, but only if I am to get an accurate representation of the LLMs suffering.
https://github.com/AndrewVos/endless-toil/blob/main/plugins/...
So it is left up to agent to decide.
The agent is instructed to execute this Python script: https://github.com/AndrewVos/endless-toil/blob/main/plugins/...
So looks like it's mainly looking for FIXME/TODO etc comments, deep nesting, large files, broad catches, stuff like that.
I'm very open to suggestions, but currently it's a very simple scan of the code. Check the python scripts.
This desperately needs a demo video in the repo.
hear hear!!!
I really want this! Any chance of a Cursor version?
I just added a cursor plugin to the repo, let me know how it goes!
Honestly, I don't care about Opus 4.7. This is the true evolution of agentic coding.