Hacker Newsnew | past | comments | ask | show | jobs | submit | irthomasthomas's commentslogin

That still leaves open the possibility that they reduce model quality due to profit. ;p

Apparently they came with 75 meters of bombing an nuclear power plant already. A plant with 10x the material Chernobyl had, and in vulnerable above-ground storage. https://www.ndtv.com/world-news/middle-east-war-why-attacks-...

Is this incident not reason enough? Astronauts in space are needing remote support to debug it, and taking up priceless mission time.

Sure, but bespoke software isn't necessarily going to be more reliable.

https://www.joelonsoftware.com/2000/04/06/things-you-should-...

> The idea that new code is better than old is patently absurd. Old code has been used. It has been tested. Lots of bugs have been found, and they’ve been fixed.


This quote is completely and totally irrelevant. Nobody is saying they should code a new Outlook. If they did code something, it would be significantly smaller in scope and rigorously tested like spacebound programs in the past were. "New space-engineering-grade code created with actual engineering practices" is absolutely going to be more reliable than "old bloated commercial shitware". But I guess software engineering is a lost art, so it can't be helped.

It's also going to take a hell of a lot longer and cost more than buying an Outlook license. If I was lead on that project, you'd have an uphill battle trying to convince me that spending $100k+ on an email solution unless you can point to specific, serious deficiencies in the existing off the shelf solutions.

Software Engineering is far from a lost art: part of the practice is intelligently making cost-benefit decisions.


The current solution is literally causing problems in space. Space-grade engineering is expensive, but having things go wrong on your already very expensive mission is even more expensive.

Until we've had this failure, I do agree that using COTS software was the logical choice. And now we know better.

Sure, but people who didn't know better until this particular incident do not deserve the title "engineer". Being able to classify and manage risks before they happen is engineering 101.

Engineering requires working around constraints as well - and a major constraint of any project I've worked on was budget. If they wrote a new email client and it had some bug, we'd be laughing about why they didn't use one of the COTS email clients.

It’s a personal communication device. It’s not mission critical.

Alpine and mutt are about as far from bespoke as it gets. Both are far less likely to suffer from bugs than outlook.

Alpine and Mutt are about 20 and 30 years old, respectively.

And that problem would go away with a 30 year-old solution?

That problem would be much less likely with a minimalist battle tested OSS solution whose maintainers and users have decidedly different priorities than those governing something like outlook or even thunderbird.

The higher the stakes the more valuable minimalism becomes.


This just proves its vibe coded because LLMs love writing solutions like that. I probably have a hundred examples just like it in my history.

Actually, this could be a case where its useful. Even it only catches half the complaints, that's still a lot of data, far more than ordinary telemetry used to collect.

Efficiency gains can be used to make existing models more profitable, or to make new larger and more intelligent models.


Some yes, others no. Distillation and quantization can't be used to make new base models since they require a preexisting one.


it enables models larger than was previously possible.


No because the base model from which the distilled or quantized models are derived is larger.


Its possibly just an SEO trick. People have been calling Thiel the antichrist for a long time.


A friend made a cli tool, ideal for agents, which does this and can aggregate intelligence across multiple platforms.

https://github.com/bm-github/owasp-social-osint-agent


Have you tried meta-prompts e.g. "Rewrite the prompt to improve the perceived taste and expertise of the author"


Opus doubled in speed with version 4.5, leading me to speculate that they had promoted a sonnet size model. The new faster opus was the same speed as Gemini 3 flash running on the same TPUs. I think anthropics margins are probably the highest in the industry, but they have to chop that up with google by renting their TPUs.


The conspiracy theorist side of me whispers "instead of the rumored Sonnet 5.0 you got Opus 4.6...suspicious"


They will rename it The Free Democratic Republic of America.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: