Hacker Newsnew | past | comments | ask | show | jobs | submit | jdiff's commentslogin

That shouldn't be used to judge other models - it's never been true for Grok.

Smear frames are not something often applicable in this kind of animation. Smear frames are just about specific to frame-by-frame animation. No smear frames are demonstrated in this article.

You're thinking of smear frames. Squash and stretch are animation techniques that are perfectly coherent. Smear frames as well contribute to an overall coherent animation. They're a counterpoint to the general idea put forward in this article, but it's also rarely ever relevant to this type of animation.

I have a coworker who, when he needs to operate some software that is unfamiliar to him, snaps a photo of it and has Gemini AI read each label and description. If there is a checklist or form that needs to be filled, Gemini reads each question.

There's only one of him, not 40% of my coworkers, but these people can be employed and maintain employment.


It's been happening for some time now. I saw this pop up over 6 months ago. After searching I can find some dating back at least a year.

Even more reason for post-mortem.

LLMs don't make a good A-frame, nor would I classify them as wood-like. People propose LLMs as solutions as if they're wooden when they're teetering contraptions of metal rods, aluminum extrusions, rubber bands, and duct tape. That can do the trick. It can't be relied on to fail reliably like a single solid material like wood.

From elsewhere in the thread, some hard numbers on the topic. https://formulae.brew.sh/analytics/homebrew-os-arch-ci/30d/

Intel homebrew is larger than Linuxbrew, yet I think it'd be shocking if they dropped support for Linuxbrew.

Old machines still work. They're still deeply useful. I'm still using daily an Intel Macbook with homebrew on it. When I no longer use it daily in some years more, it'll still make a perfect server.


Had a little luck with having it do an impression of the Star Trek computer, although at the cost of having it try to insert star-trek themed hallucinations like warp engine status.

We already have that in the form of separate reasoning/thinking and speaking streams. Even with that it's awfully hard to get LLMs to keep it consistently concise. As soon as that context window starts growing it falls right back into verbosity without constant nudges back.

Right, I often bring up the film noir analogy for "reasoning" models, it's satisfying, like the revelation when a magic trick is explained, and many oddly disconnected questions about "why the scarf" or "where does the assistant go" all become sensible at once.

On a practical level, I believe more developers and adopters need these magic tricks spoiled, because otherwise they'll build a lot of important stuff on top of the idea that magic-is-real, leading to various forms of suffering in the long run.

That said, I'm no LLM / math academic, so if I'm totally wrong on the the trick, I'd like to know what needs revising.


OpenCode seemed perfectly workable as a programming assistant. As personal assistants, they all fall short. It's too difficult to really shape their output.

I was briefly impressed with OpenClaw a few times, but ultimately was turned off by not being able to get the models to stop being so damnably verbose. I thought I made progress for a while by having it tweak its soul, iterate, switch models, iterate, switch models, fuse the results, iterate... but ultimately it's all forgotten early in each session. And then one day it killed itself by rebuilding the container it was inside.

Hermes apparently has some plagiarism issues they're trying to cover up [0] and I was deeply unimpressed with their janky, flickery CLI that force-enables a bulky obnoxious header on every launch. Hermes did readily dive into its own source code and did readily confirm that there was no way to disable it. So that's neat. It constantly (wants to) run from upstream master which is unsettling.

Nanoclaw and nanobot seemed fine, but not notably different. There were some common bugs and glitches that caused some minor data loss while configuring nanobot. After that I just deciding to start hacking my own together.

What I really want in a harness is being able to actually control and rewrite the entire context window, like Zed's Text Threads before they obnoxiously and inexplicably removed what, to me, was their most powerful and distinguishing feature.

[0] https://github.com/NousResearch/hermes-agent/issues/10232


The project in question that is asserting Hermes is copying their ideas:

https://github.com/EvoMap/evolver

The timeline here is pretty telling and it looks like Hermes basically points their coding agent at evolver and says "reimplement this yourself." A few days later Hermes magically sports a nearly identical feature.

https://evomap.ai/blog/hermes-agent-evolver-similarity-analy...


The handling of that issue looks _incredibly_ shady.

Not touching this project with a ten foot pole..


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: