More

polyrand · 2026-03-04T21:14:08 1772658848

I find this a very exciting release. I was actually hoping we would somehow get macOS on mobile 'A' chips some day. And I think this is better than putting 'M' chips on an iPad.

My iPad with an 'M1' chip actually consumes more battery than much older iPads when both are locked and with the screen off. I ended up figuring it was probably because, in the 'M' chip, the lowest possible energy usage is way higher than the 'A' chip. So even small background wake-ups used more energy.

I'm still hoping one day we have an iPad with macOS.

polyrand · 2026-02-07T16:47:08 1770482828

I do this, and it's a huge quality of life improvement. No so much because of shadowing existing binaries, but for better command auto-complete. For example: I have a bunch of tmux utilities and all start with `,t` which is not a polluted command-name prefix compared to just `t`.

But I'm now facing the problem that LLM agents don't like this, and when I instruct them to run certain tools, they remove the leading comma. It's normally fixed with one extra sentence in the prompt, but still inconvenient.

polyrand · 2026-02-05T21:59:11 1770328751

> a period of inefficiency

I think this is something people ignore, and is significant. The only way to get good at coding with LLMs is actually trying to do it. Even if it's inefficient or slower at first. It's just another skill to develop [0].

And it's not really about using all the plugins and features available. In fact, many plugins and features are counter-productive. Just learn how to prompt and steer the LLM better.

[0]: https://ricardoanderegg.com/posts/getting-better-coding-llms...

polyrand · 2026-01-26T10:26:39 1769423199

I share the same feeling. I waited as much as possible to upgrade to iOS 26 / macOS Tahoe.

Two days ago, I finally upgraded. Liquid Glass is one of the worst things I've ever seen in terms of design. It reminds me of when I personalized old cheap android phones or Linux distros just "to look cool". Cool-looking: yes. Unusable: also yes. Tasteful design: almost absent.

Just the increase of the border-radius in all elements makes it hideous. Apps with a search bar on a scrollable list look like a CSS bug when the search bar is on top of the elements. Neither the search bar nor the element underneath are visible. Although this applies to most transparency effects on Liquid Glass. Neither the elements above nor below the "glass" are visible. And the extra value added is zero.

The thing is, I can still adapt to it, or tweak transparency and contrast. But I've seen elderly relatives struggle just because WhatsApp decided to add the "Meta AI" floating button. I can't imagine what this "inaccessible" UI changes can do.

freehorse · 2026-01-26T10:51:30 1769424690

It is the first time I am trying to skip a macos version. I really hope in macos27 they will fix things. I used to skip every second windows version, so back here we are.

inatreecrown2 · 2026-01-26T11:53:21 1769428401

same here. using mac since OS9, and Tahoe is the first time i skipped a version (downgrade after 2 months)

polyrand · 2026-01-24T21:08:42 1769288922

Don't forget that if you're using SQLite on something like EBS, multiple queries may not be efficient.

I'm saying this as a huge SQLite fan, but also beware of what kind of storage you're using in your instance.

andersmurphy · 2026-01-25T18:45:05 1769366705

Yeah, you really want directly connected NVME drives to your machine/VPS. It can make orders of magnitude difference.

polyrand · 2026-01-19T17:51:56 1768845116

I've been using z.ai models through their coding plan (incredible price/performance ratio), and since GLM-4.7 I'm even more confident with the results it gives me. I use it both with regular claude-code and opencode (more opencode lately, since claude-code is obviously designed to work much better with Anthropic models).

Also notice that this is the "-Flash" version. They were previously at 4.5-Flash (they skipped 4.6-Flash). This is supposed to be equivalent to Haiku. Even on their coding plan docs, they mention this model is supposed to be used for `ANTHROPIC_DEFAULT_HAIKU_MODEL`.

RickHull · 2026-01-19T18:41:32 1768848092

Same, I got 12 months of subscription for $28 total (promo offer), with 5x the usage limits of the $20/month Claude Pro plan. I have only used it with claude code so far.

theshrike79 · 2026-01-21T07:58:41 1768982321

This offer was so stupid cheap there was no point in NOT getting :D

stogot · 2026-01-19T20:38:13 1768855093

Do they still have that promo offer?

Mashimo · 2026-01-19T21:39:13 1768858753

Looks like they have something for 29 USD with 3x the claude code usage: https://z.ai/subscribe

victorbjorklund · 2026-01-19T21:53:16 1768859596

How has the performance been lately? I heard some people say that they change their limits likely making it almost not useable

chewz · 2026-01-19T23:52:17 1768866737

Never had any problems with Z.ai models.

However they are using more thinking internally and that makes them seem slow.

polyrand · 2025-12-24T15:59:26 1766591966

Not sure about the impact of these, I guess it depends on the context where this engine is used. But there seems to be already exploits for the engine:

https://x.com/itszn13/status/2003707921679679563

https://x.com/itszn13/status/2003808443761938602

polyrand · 2025-12-22T22:29:19 1766442559

A few comments mentioning distillation. If you use claude-code with the z.ai coding plan, I think it quickly becomes obvious they did train on other models. Even the "you're absolutely right" was there. But that's ok. The price/performance ratio is unmatched.

hashbig · 2025-12-23T04:54:20 1766465660

I had Gemini 3 Flash hit me this morning with "you're absolutely right" when I corrected it on a mistake it did. It's not conclusive of anything.

polyrand · 2025-12-23T06:13:00 1766470380

That's interesting, thanks for sharing!

It's a pattern I saw more often with claude code, at least in terms of how frequently it says it (much improved now). But it's true that just this pattern alone is not enough to infer the training methods.

theptip · 2025-12-23T06:26:58 1766471218

Or it’s conclusive of an even broader trend!

ljosifov · 2025-12-23T10:12:27 1766484747

I imagine - and sure hope so - everyone trains on everything else. Distillation - ofc if one has bigger/other models providing true posterior token probabilities in the (0,1) interval (a number between 0 and 1), rather than 1-hot-N targets that are '0 for 200K-sans-this-token, and 1 for the desired output token' - one should use the former instead of the latter. It's amazing how as a simple as straightforward idea should face so much resistance (paper rejected) and from the supposedly most open minded and devoted to knowing (academia) and on the wrong grounds ('will have no impact on industry'; in fact - it's had tremendous impact on industry; better rejection wd have been 'duh it is obvious'). We are not trying to torture the model and the gpu cluster to be learning from 0 - when knowledge is already available. :-)

Havoc · 2025-12-23T01:23:02 1766452982

>Even the "you're absolutely right" was there.

I don't think that's particularly conclusive for training on other models. Seems plausible to me that the internet data corpus simply converges on this hence multiple models doing this.

...or not...hard to tell either way.

polyrand · 2025-12-22T09:35:52 1766396152

I enjoyed the post. I was about to link the "Let Me Speak Freely" paper and "Say What You Mean" response from dottxt, but that's already been posted in the comments.

I'm a huge fan of structured outputs, but also recently started splitting both steps, and I think it has a bunch of upsides normally not discussed:

1. Separate concerns, schema validation errors don't invalidate the whole LLM response. If the only error is in generating schema-compliant tokens (something I've seen frequently), retries are much cheaper.

2. Having the original response as free text AND the structured output has value.

3. In line with point 1, it allows using a more expensive (reasoning) model for free-text generation, then a smaller model like gemini-2.5-flash to convert the outputs to structured text.

polyrand · 2025-12-11T11:33:07 1765452787

A frozen dictionary would be very welcome. You can already do something similar using MappingProxyType [0]

  from types import MappingProxyType
  
  d = {}
  
  d["a"] = 1
  d["b"] = 2
  
  print(d)
  
  frozen = MappingProxyType(d)
  
  print(frozen["a"])
  
  # Error:
  frozen["b"] = "new"

[0]: https://docs.python.org/3/library/types.html#types.MappingPr...

zahlman · 2025-12-11T13:27:04 1765459624

> You can already do something similar

Only if you deny access to the underlying real dict.

ali_m · 2025-12-11T13:53:47 1765461227

Yes, this only prevents the callee from mutating it, it can't provide a strong guarantee that the underlying mapping won't be changed upstream (and hence MappingProxyType can't be washable).