More

jdiff · 2026-06-13T23:59:27 1781395167

That shouldn't be used to judge other models - it's never been true for Grok.

jdiff · 2026-06-13T23:42:38 1781394158

Smear frames are not something often applicable in this kind of animation. Smear frames are just about specific to frame-by-frame animation. No smear frames are demonstrated in this article.

jdiff · 2026-06-13T17:44:40 1781372680

You're thinking of smear frames. Squash and stretch are animation techniques that are perfectly coherent. Smear frames as well contribute to an overall coherent animation. They're a counterpoint to the general idea put forward in this article, but it's also rarely ever relevant to this type of animation.

jdiff · 2026-06-12T21:23:50 1781299430

I have a coworker who, when he needs to operate some software that is unfamiliar to him, snaps a photo of it and has Gemini AI read each label and description. If there is a checklist or form that needs to be filled, Gemini reads each question.

There's only one of him, not 40% of my coworkers, but these people can be employed and maintain employment.

jdiff · 2026-06-12T17:34:10 1781285650

It's been happening for some time now. I saw this pop up over 6 months ago. After searching I can find some dating back at least a year.

deadbabe · 2026-06-12T17:42:35 1781286155

Even more reason for post-mortem.

jdiff · 2026-06-12T15:31:00 1781278260

LLMs don't make a good A-frame, nor would I classify them as wood-like. People propose LLMs as solutions as if they're wooden when they're teetering contraptions of metal rods, aluminum extrusions, rubber bands, and duct tape. That can do the trick. It can't be relied on to fail reliably like a single solid material like wood.

jdiff · 2026-06-12T13:17:39 1781270259

From elsewhere in the thread, some hard numbers on the topic. https://formulae.brew.sh/analytics/homebrew-os-arch-ci/30d/

Intel homebrew is larger than Linuxbrew, yet I think it'd be shocking if they dropped support for Linuxbrew.

Old machines still work. They're still deeply useful. I'm still using daily an Intel Macbook with homebrew on it. When I no longer use it daily in some years more, it'll still make a perfect server.

jdiff · 2026-06-12T11:40:43 1781264443

Had a little luck with having it do an impression of the Star Trek computer, although at the cost of having it try to insert star-trek themed hallucinations like warp engine status.

jdiff · 2026-06-12T11:28:57 1781263737

We already have that in the form of separate reasoning/thinking and speaking streams. Even with that it's awfully hard to get LLMs to keep it consistently concise. As soon as that context window starts growing it falls right back into verbosity without constant nudges back.

Terr_ · 2026-06-12T16:41:33 1781282493

Right, I often bring up the film noir analogy for "reasoning" models, it's satisfying, like the revelation when a magic trick is explained, and many oddly disconnected questions about "why the scarf" or "where does the assistant go" all become sensible at once.

On a practical level, I believe more developers and adopters need these magic tricks spoiled, because otherwise they'll build a lot of important stuff on top of the idea that magic-is-real, leading to various forms of suffering in the long run.

That said, I'm no LLM / math academic, so if I'm totally wrong on the the trick, I'd like to know what needs revising.

jdiff · 2026-06-09T02:20:16 1780971616

OpenCode seemed perfectly workable as a programming assistant. As personal assistants, they all fall short. It's too difficult to really shape their output.

I was briefly impressed with OpenClaw a few times, but ultimately was turned off by not being able to get the models to stop being so damnably verbose. I thought I made progress for a while by having it tweak its soul, iterate, switch models, iterate, switch models, fuse the results, iterate... but ultimately it's all forgotten early in each session. And then one day it killed itself by rebuilding the container it was inside.

Hermes apparently has some plagiarism issues they're trying to cover up [0] and I was deeply unimpressed with their janky, flickery CLI that force-enables a bulky obnoxious header on every launch. Hermes did readily dive into its own source code and did readily confirm that there was no way to disable it. So that's neat. It constantly (wants to) run from upstream master which is unsettling.

Nanoclaw and nanobot seemed fine, but not notably different. There were some common bugs and glitches that caused some minor data loss while configuring nanobot. After that I just deciding to start hacking my own together.

What I really want in a harness is being able to actually control and rewrite the entire context window, like Zed's Text Threads before they obnoxiously and inexplicably removed what, to me, was their most powerful and distinguishing feature.

[0] https://github.com/NousResearch/hermes-agent/issues/10232

KennyBlanken · 2026-06-09T02:49:06 1780973346

The project in question that is asserting Hermes is copying their ideas:

https://github.com/EvoMap/evolver

The timeline here is pretty telling and it looks like Hermes basically points their coding agent at evolver and says "reimplement this yourself." A few days later Hermes magically sports a nearly identical feature.

https://evomap.ai/blog/hermes-agent-evolver-similarity-analy...

jonaustin · 2026-06-09T16:26:20 1781022380

The handling of that issue looks _incredibly_ shady.

Not touching this project with a ten foot pole..