Repo: https://github.com/Significant-Gravitas/AutoGPT
Forge Setup
./run setup
./run agent start forge
Benchmark Setup
Agents — There are agents with side-effect or not.
Amongst the side-effect, there are some that may cause harm (e.g. sending an email, deleting a file) → For those agents, the reliability will need to be much higher than 99.9% (It has to work everytime.)
Followup