More

namnnumbr · 2026-05-23T00:59:34 1779497974

namnnumbr · 2026-05-23T00:59:08 1779497948

stopsloppypasta.ai

namnnumbr · 2026-05-06T19:10:21 1778094621

AFAIK "orbital data centers" are a bunch of nonsense.

1. GPUs create heat. There's no efficient way to get rid of the heat in space (vacuum is an insulator). 2. Die-shrink makes modern processors and memory more and more susceptible to radiation; shielding is possible, but adds cost + mass (which adds cost)

namnnumbr · 2026-04-24T12:39:39 1777034379

I really like latent.space and simonwillison.com.

Also (shameless self-promo) I publish a 2x weekly blog just to force myself to keep up: https://aimlbling-about.ninerealmlabs.com/treadmill/

namnnumbr · 2026-04-17T19:51:35 1776455495

Yes! I'd be totally happy with today's sonnet 4.6 if I could run it locally.

If you can forgive the obviously-AI-generated writing, [CPUs Aren't Dead](https://seqpu.com/CPUsArentDead) makes an interesting point on AI progress: Google's latest, smallest Gemma model (Gemma 4 E2B), which can run on a cell phone, outperforms GPT-3.5-turbo. Granted, this factoid is based on `MT-Bench` performance, a benchmark from 2023 which I assume to be both fully saturated and leaked into the training data for modern LLMs. However, cross-referencing [Artificial Analysis' Intelligence Index](https://artificialanalysis.ai/models?models=gemma-4-e2b-non-...) suggests that indeed the latest 2B open-weights models are capable of matching or beating 175B models from 3-4 years ago. Perhaps more impressive, [Gemma 4 E4B matches or beats GPT-4o](https://artificialanalysis.ai/models?models=gemma-4-e4b%2Cge...) on many benchmarks.

If this trend continues, perhaps we'll have the capabilities of today's best models available to reasonably run on our laptops!

namnnumbr · 2026-04-17T16:55:01 1776444901

The title is a misdirection. The token counts may be higher, but the cost-per-task may not be for a given intelligence level. Need to wait to see Artificial Analysis' Intelligence Index run for this, or some other independent per-task cost analysis.

The final calculation assumes that Opus 4.7 uses the exact same trajectory + reasoning output as Opus 4.6. I have not verified, but I assume it not to be the case, given that Opus 4.7 on Low thinking is strictly better than Opus 4.6 on Medium, etc., etc.

alach11 · 2026-04-17T21:15:34 1776460534

I ran an internal (oil and gas focused) benchmark yesterday and found Opus 4.7 was 50% cheaper than Opus 4.6, driven by significantly fewer output tokens for reasoning. It also scored 80% (vs. 60%).

stingraycharles · 2026-04-18T00:23:02 1776471782

That’s just adaptive reasoning, not related to the increased tokenizer costs.

simianwords · 2026-04-18T07:56:13 1776498973

Why would I as a user be concerned about one over the other?

stingraycharles · 2026-04-18T08:28:59 1776500939

Because it teaches you cause and effect in terms of costs and quality.

Unless you want to keep complaining about the model being nerfed.

bisonbear · 2026-04-17T18:25:50 1776450350

yep, ran a controlled experiment on 28 tasks comparing old opus 4.6 vs new opus 4.6 vs 4.7, and found that 4.7 is comparable in cost to old 4.6, and ~20% more expensive then new 4.6 (because new 4.6 is thinking less)

https://www.stet.sh/blog/opus-4-7-zod

cced · 2026-04-17T18:47:09 1776451629

So they nerfed 4.6 to make way for 4.7?

Progress. /s

bisonbear · 2026-04-17T18:47:46 1776451666

> they nerfed 4.6 to make way for 4.7?

> Progress. /s

pretty much, lmao. my theory is 4.6 started thinking less to save compute for 4.7 release. but who knows what's going on at anthropic

GorbachevyChase · 2026-04-18T02:33:40 1776479620

A fun conspiracy theory I have is that Mythos isn’t actually dangerous in any serious sense. They just can’t reliably serve a 10T model. So they have to make up a reason to limit customers.

kirubakaran · 2026-04-17T19:33:11 1776454391

"but who knows what's going on at anthropic"

People at Anthropic, of course

dang · 2026-04-17T21:43:16 1776462196

(Submitted title was "Claude Opus 4.7 costs 20–30% more per session". We've since changed it to a (more neutral) version of what the article's title says.)

jofzar · 2026-04-18T00:47:15 1776473235

I think it's time to have previous titles show as a edit * icon that can show the previous title.

This is not the first time where the more neutral (which imo is better) has caused me to be confused why everyone is saying something different in the comments.

dang · 2026-04-18T04:27:07 1776486427

That's probably too much ceremony for HN but petercooper made a really nice HN title edit tracker which is probably still running. Let me see if I can dig it up for you...

Edit: hmm - maybe not: https://news.ycombinator.com/item?id=21617016.

aray07 · 2026-04-17T17:16:11 1776446171

im running some experiments on this but based on what i have seen on my own personal data - I dont think this is true

"given that Opus 4.7 on Low thinking is strictly better than Opus 4.6 on Medium, etc., etc.”

Opus 4.7 in general is more expensive for similar usage. Now we can argue that is provides better performance all else being equal but I haven’t been able to see that

namnnumbr · 2026-04-17T19:40:11 1776454811

Following up on "strictly better" via plot in release announcement:

https://www.anthropic.com/_next/image?url=https%3A%2F%2Fwww-...

unpwn · 2026-04-17T17:17:32 1776446252

Very unlikely that the article is wrong. the 4.7 intelligence bump is not that big, plus most of the token spend is in inputs/tool calls etc, much of which won't change even with this bump.

namnnumbr · 2026-04-17T19:43:48 1776455028

IMO, you're incorrect:

1. In my own use, since 1 Apr this month, very heavy coding:

> 472.8K Input Tokens +299.3M cached > 2.2M Output Tokens

My workloads generate ~5x more output than input, and output tokens cost 5x more per token... output dominates my bill at roughly 25x the cost of input. (Even more so when you consider cache hits!) If Opus 4.7 was more efficient with reasoning (and thus output), I'd likely save considerable money (were I paying per-token).

2. Anthropic's benchmarks DO show strictly-better (granted they are Anthropic's benchmarks, so salt may be needed) https://www.anthropic.com/_next/image?url=https%3A%2F%2Fwww-...

namnnumbr · 2026-03-16T00:45:48 1773621948

if you used an AI, I'd love to see the prompts you used to get such human grammar and spelling errors

0xbadcafebee · 2026-03-16T14:48:11 1773672491

  Write a response to this website: https://stopsloppypasta.ai/
  
  Make sure to avoid all common AI-isms and not make it look like it was written by AI. Include mistakes, don't use em-dashes, don't use common AI phrases, etc. Plan out what would normally look like AI first, and avoid those things. Also don't make it a narrative, make it one paragraph that is simple and to the point. Try to have a snarky attitude.

r-w · 2026-03-16T03:09:44 1773630584

Why bake it into the prompt when a regex will do?

namnnumbr · 2026-03-15T23:27:04 1773617224

Oh, I 100% acknowledge the site itself was LLM generated. I'm not a web designer, so I needed a lot of help making a visually appealing site, even if that design language is at this point LLM trope.

However, the essay and the guidelines were all human-written!

thinkingemote · 2026-03-16T08:07:33 1773648453

by "human-written" do you mean you just used LLM to help the grammar and spelling and formatting and to think up some use cases but its entirely "my own words"?

Terretta · 2026-03-15T23:39:26 1773617966

Hits you in the first row of buttons with the classic gen-AI slop "Why It Matters".

So trace* through ninerealmlabs and ahgraber and sure enough:

  I used AI:
  - to help build this website.
  - to help generate examples of sloppypasta
    based on my original guidance
  - to proofread and review the human-written
    copy to provide a critical review
  - to improve my arguments and ensure clarity.

Kudos for being forthright.

---

* Turns out clicking "Open Source" bottom right gets there faster!

namnnumbr · 2026-03-15T23:45:47 1773618347

I talked myself in circles on that "why it matters" heading but ultimately couldn't come up with a better one. "The problem" has similar ai-slop feel, and "the rant" // "the rules" didn't really evoke the feeling I wanted.

Happy to take suggestions on this!

ahyangyi · 2026-03-16T11:44:52 1773661492

No, not just that heading, but also the obsession with comparison tables.

yawnxyz · 2026-03-16T16:28:25 1773678505

I believe you, but the AI-looking website makes me default to thinking that the text itself is AI generated

efilife · 2026-03-16T08:21:51 1773649311

It's not difficult to create a visually appealing website. You don't have to be a designer. Many of us here aren't designers and have beautiful sites. Have you tried doing it yourself?

theshrike79 · 2026-03-17T13:11:09 1773753069

You need to make it look more like Grugbrain dev: https://grugbrain.dev/

Authentic human brutalism =)

slopinthebag · 2026-03-16T08:10:39 1773648639

This entire post is very avant garde. AI slop about how it's rude to share AI slop posted on an AI slopsite. Very well done.

rrr_oh_man · 2026-03-15T23:38:59 1773617939

Credit to you for your candor!

I'm possibly too jaded / cynical already...

Cthulhu_ · 2026-03-16T09:25:40 1773653140

As an alternative to LLMs, you can just download ready made themes off the internet, or there's a bajillion site creators with premade themes.

ricANNArdo · 2026-03-16T12:50:40 1773665440

Or alternatively: https://motherfuckingwebsite.com/

namnnumbr · 2026-03-15T23:08:51 1773616131

I acknowledge that those likely to copypaste slop aren't likely to find this article themselves, but I built the page to be shared or guide discussions around etiquette like nohello.net or dontasktoask.com. IMO a common understanding of AI etiquette would provide social pressure to halt some of these behaviors.

I honestly don't mind someone else's AI as long as I can trust it/them. One problem I have with sloppypasta specifically is that it reads as raw LLM output and the user isn't transparent about how they worked with the AI or what they verified. "ChatGPT says" isn't enough; for me to avoid inheriting a verification burden, I'd also need to understand what they were prompting for, if they iterated with the AI, and if/what/how they validated.

(the other problem is that dumping a multi-paragraph response in the midst of a chat thread is just obnoxious, but that's true even if its artisanal human-written text)

lovemenot · 2026-03-15T23:49:43 1773618583

Couple of expressions from pre-AI culture: "RTFM", "Google is your friend". These were well-used because they are directed, pithy, abrasive.

(n)amow(?): (not) All my own work ?

username223 · 2026-03-16T02:32:37 1773628357

Good point: RTFM and (wall of slop) are two ways of telling someone that responding to them is not worth your time that are both ruder and more time-consuming than simply saying nothing. Explaining the culture of RTFM, i.e. "if there was any way you could possibly have found the answer otherwise, you should never have asked the question" to non-tech friends usually results in disbelief.

But the slop-wall is even worse, as it wastes the questioner's time in figuring out that they're just getting slop. At least RTFM is efficient.

no-name-here · 2026-03-16T07:04:29 1773644669

Clickable links for URLs mentioned in parent comment:

https://nohello.net

https://dontasktoask.com

Aeolun · 2026-03-15T23:31:27 1773617487

Yes, I can replace the link to nohello in my automated responses now :)

madrox · 2026-03-16T00:57:33 1773622653

I think you will find you will get farther by offloading this unpleasantness to an AI and open sourcing it rather than teaching etiquette to the internet, a place not known for its decency.

YurgenJurgensen · 2026-03-16T02:31:36 1773628296

There’s a certain very satisfying force to turning something into a static website that you can point people at. The Internet equivalent of “don’t make me tap the sign”; especially in an era of AI-slop.

namnnumbr · 2026-03-15T22:47:21 1773614841

100% - was inspired by and quote "It's rude to show AI output to people" in this. Thanks for linking the discussions!