Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This is what the rows look like:

https://huggingface.co/datasets/princeton-nlp/SWE-bench_Veri...

Its up to your retrieval system/model to selectively hunt for relevant context. Here's a few critiques of the benchy:

https://x.com/brhydon/status/1953648884309536958



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: