Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Sounds interesting, but I'm not quite getting the relevance for people writing code with an agent. Should I be doing evals?


Well I mean yes. I think people ought be aware for how the harnesses compare for their stacks. But clean room applies for this RGR situation too


you are replying to a bot, that's why.


What




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: