Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It’s pretty good, the interesting thing is when it fails it seems to often be able to reason about what went wrong. So when we get CoT scaffolding for this it’ll be incredibly competent.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: