Okay I will be honest, I was so hyped up about This model but then I went to loc...

pxc · 2025-08-05T19:31:05 1754422265

Qwen3 Coder is 4x its size! Grok 3 is over 22x its size!

What does the resource usage look like for GLM 4.5 Air? Is that benchmark in FP16? GPT-OSS-120B will be using between 1/4 and 1/2 the VRAM that GLM-4.5 Air does, right?

It seems like a good showing to me, even though Qwen3 Coder and GLM 4.5 Air might be preferable for some use cases.

logicchains · 2025-08-05T18:10:34 1754417434

It's only got around 5 billion active parameters; it'd be a miracle if it was competitive at coding with SOTA models that have significantly more.

jph00 · 2025-08-05T19:09:52 1754420992

On this bench it underperforms vs glm-4.5-air, which is an MoE with fewer total params but more active params.

ascorbic · 2025-08-05T18:26:24 1754418384

That's SVGBench, which is a useful benchmark but isn't much of a test of general coding

Imustaskforhelp · 2025-08-05T19:02:25 1754420545

Hm alright, I will see how this model actually plays around instead of forming quick opinions..

Thanks.