Hacker Newsnew | past | comments | ask | show | jobs | submit | Nav_Panel's commentslogin

Think about what OpenRouter primarily traffics in and what you can do with that raw material.

Nah I agree with you, as someone who's read a lot of Asimov. As far as MULTIVAC stories go, I always preferred "All The Troubles of the World" (https://schools.ednet.ns.ca/avrsb/070/rsbennett/HORTON/short...).


The point is that open weights turns puts inference on the open market, so if your model is actually good and providers want to serve it, it will drive costs down and inference speeds up. Like Cerebras running Qwen 3 235B Instruct at 1.4k tps for cheaper than Claude Haiku (let that tps number sink in for a second. For reference, Claude Opus runs ~30-40 tps, Claude Haiku at ~60. Several orders of magnitude difference). As a company developing models, it means you can't easily capture the inference margins even though I believe you get a small kickback from the providers.

So I understand why they wouldn't want to go open weight, but on the other hand, open weight wins you popularity/sentiment if the model is any good, researchers (both academic and other labs) working on your stuff, etc etc. Local-first usage is only part of the story here. My guess is Qwen 3.5 was successful enough that now they want to start reaping the profits. Unfortunately most of Qwen 3.5's success is because it's heavily (and successfully!) optimized for extremely long-context usage on heavily constrained VRAM (i.e. local) systems, as a result of its DeltaNet attention layers.


It is partly the medium's fault. A lot of the sins of CD/digital mastering wont fly on vinyl because there's physical constraints around what you can literally press into the record groove.


I've had a Framework 13 for several years now, so I'm excited to see this kind of thing start to happen. Praying the next one out is a GPU/tensor workload unit so I'm not stuck at home on my desktop when I want to mess around with local AI models...


Love it, they're teaching LLMs how to skim texts properly, which is exactly the right approach for handling long contexts.


wasn't this the attention sink concept to some degree? I mean it doesn't seem out of the realm of possibility that if the latency overhead isn't signifigant, that frontier models start adopting similar to DeepSeek OCR tech


I see a lot of debate about whether AI is "conscious" or has "consciousness", with people talking past each other without a firm grasp on their own stances, so I made a quick quiz to help you locate where you stand.


I prefer to write blog posts in markdown, especially if I'm on the go, and historically I've manually reformatted it in Substack's editor. Figured, why not automate this? So I built a simple utility. Everything happens in your browser with the `marked` library.

Most important feature for me is converting from ASCII to "fancy" quotes and apostrophes, which Substack inserts automatically in its editor. Some more advanced features are probably broken, because Substack's editor is a bit wonky, but the basic stuff all works. Give it a try!

And here's the source: https://github.com/simpolism/md-to-substack


Yeah. I noticed a lot of "It's not just X. It's Y." which is the biggest tell for me.


I think that is extremely common in adverts (most famously here, the M&S "It's not just bread. It's our stone-baked, hand-shaped artisanally-molested bread"), and narrative media like that distinctive kind of journalism written like a storybook, and which (I think) then bled into popular media like true-crime podcasts: "It wasn't just Tuesday. It was the last Tuesday he would ever see. <intro music>".

Also this kind of short sentence construction is used in the incredibly annoying and pervasive style of headlines for opinion pieces: X is Y. And it's Z. (where Z is often "not OK" or "OK").

I assume all this overuse is where LLMs picked it up and weighted it highly.


Is this itself an AI generated comment? The word "just" appears 1 time in the article.


The term “not just” doesn’t appear in the text.


I think it was a simplified example. The exact text is:

> What emerged wasn't just a unique identifier. It was a compressed database record


Copy paste? Use something like desktop commander and just let it edit the files for you directly. It'll even run commands to test it out. Or go further and use Cline/RooCode and if you're building a webapp it'll load your page in a small browser, screenshot the contents, and send that to the model. The copy-paste stuff is beginner mode.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: