Create an automated 'agent-loop' debugger that tracks the multi-step tasks requested by the new Opus model. It should catch hallucination loops and unnecessary tool calls.
Suggested repo: loop-watch
"Don't let your agents wander off into infinite loops."
Estimated effort: 70h