nitramm
View original ↗Build a standardized assessment platform for AI-assisted coding that evaluates not just correctness, but the orchestration of agentic workflows. Focus on multi-step reasoning tasks rather than simple code snippets.
Suggested repo: eval-agent
"Move beyond LeetCode: real-world agentic assessment infrastructure."
Estimated effort: 80h