Hacker Newsnew | past | comments | ask | show | jobs | submit | cpeterso's commentslogin

Looks like LLMs also find Dafny easier to write than Lean. This study, “A benchmark for vericoding: formally verified program synthesis”, reports:

> We present and test the largest benchmark for vericoding, LLM-generation of formally verified code from formal specifications … We find vericoding success rates of 27% in Lean, 44% in Verus/Rust and 82% in Dafny using off-the-shelf LLMs.

https://arxiv.org/html/2509.22908v1


Not surprising, as Dafny is a bit less expressive (refinement instead of dependent types) and therefore easier to write. IMHO, it hits a very nice sweet spot. The disadvantage of Dafny is the lack of manual tactics to prove things when SAT/SMT automation fails. But this is getting fixed.

And they can get “too build to fail” bailouts from the US government after they are dependent on ChatGPT.

Good PR for Anthropic: the DoD already has contracts with OpenAI and xAI, but is still so eager to use Claude that they must threaten Anthropic.

> Version two: hide foes?

That's a good idea.

Here's my bad idea: the extension auto downvotes foes and auto upvotes friends. :)


automations get your account in trouble, it's against the rules

It already happens today

If you haven't already, check out Microsoft's "The Windows® 95 User Interface: A Case Study in Usability Engineering" report summarizing some of the Windows 95 designers' user research:

https://dl.acm.org/doi/fullHtml/10.1145/238386.238611


I read it and it partially inspired the entire project. It made me realise how inaccessible modern design is despite being held up as best in class and easy to use

Can the app run on smart glasses, warning you of other smart glasses users nearby? You might not see the notification on your phone.


That would be like antropic and google crying about china stealing the weights that were originally built by scraping as fuck stolen content :-)


> That would be like antropic and google crying about china stealing the weights that were originally built by scraping as fuck stolen content :-)

do you really see a relation between the two, or are you just willfully 'buying an advertisement' by trying to shape a metaphor from the social qualms that you wish to rebroadcast to people?

in other words, no -- this isn't at all similar to the companies that steal media in order to train models only to complain about similar theft from other companies targetted towards them -- but I agree with the motivation, fuck em; they're crooks...

but don't weaken metaphors simply to advertise a social injustice. If you want to do that, don't hijack conversations abroad.


Well, at least Chinese paid for api access and all the tokens.


"Glasses detected within 3 inches."


Cheeky

There are a few tools to compile COBOL code to .NET without needing to rewrite in a different programming language.


This is the first thing that occurred to me. The people above suggesting a cobol to python or go update confuse the heck out of me. Why not just convert to vanilla jacascript at that point? Bizarre


You usually want a language that provides compile-time check and you already use and know.


GitHub recently added new repository settings to turn off pull requests or limit them to approved contributors. The announcement doesn't mention AI agents, but that's certain relevant.

https://github.com/orgs/community/discussions/187038


GH also needs to find a way to stop AI scraping of IP.

(Or not. It might be lucrative to host some novel algorithm on GH under a license permitting its use in generative LLM results, at a reasonable per-impression fee.)


Or they could write native client apps using Qt to handle that “last 10%” of native app polish.


> So I gave up.

Try your mobile browser’s reader view mode.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: