I’ve been testing on a large open source Java project that I contribute to.
Took a lot of trial and error to get the setup script working. If commands were too verbose then everything would crash, and if it’s not outputting anything for a while then the script is cancelled because it thinks that it’s stalled.
There was also the issue with maven not detecting the default Linux http proxy.
I’ve also put a lot of effort into trying to create a good AGENTS.md file.
So far it’s been very slow going, but I have managed to get it to start porting some of my unit tests to use test containers for running up Elasticsearch.
One thing that would be great is if Codex would check the CI status for the PR it created and see if something didn’t pass. That way it could actually be a bit more like a human.
I’ve been testing on a large open source Java project that I contribute to.
Took a lot of trial and error to get the setup script working. If commands were too verbose then everything would crash, and if it’s not outputting anything for a while then the script is cancelled because it thinks that it’s stalled.
There was also the issue with maven not detecting the default Linux http proxy.
I’ve also put a lot of effort into trying to create a good AGENTS.md file.
So far it’s been very slow going, but I have managed to get it to start porting some of my unit tests to use test containers for running up Elasticsearch.
One thing that would be great is if Codex would check the CI status for the PR it created and see if something didn’t pass. That way it could actually be a bit more like a human.
Thanks for sharing your experience!