More

rvz · 2026-03-23T11:16:54 1774264614

Ever since Microsoft's acquisition of GitHub 8 years ago, GitHub has completely enshittified and has become so unreliable, that even self-hosting a Git repository or self-hosted actions yourself would have a far better uptime than GitHub.

This sounded crazy in 2020 when I said that in [0]. Now it doesn't in 2026 and many have realized how unreliable GitHub has become.

If there was a prediction market on the next time GitHub would have at least one major outage per week, you would be making a lot of money since it appears that AI chatbots such as Tay.ai, Zoe and Copilot are somewhat in charge of wrecking the platform.

Any other platform wouldn't tolerate such outages.

[0] https://news.ycombinator.com/item?id=22867803

rvz · 2026-03-23T10:04:47 1774260287

Also having to wait for ChatGPT for a "thinking" response to search for information that is slower than a Google search loses them lots of money.

I believe that it can still work and I won't claim about being unsurprised about this failure. But this is a great opportunity to execute this problem really well if OpenAI and others are not interested in getting good at this.

Perplexity also attempted this, got sued by Amazon and it appears semi-abandoned.

The only problem is that it must be quicker or just as quick as a Google search, and also compatible with the existing checkout flows.

TeMPOraL · 2026-03-23T10:23:57 1774261437

> Perplexity also attempted this, got sued by Amazon and it appears semi-abandoned.

Any details on that? I feel the answer is more likely there than in "friction".

Hardly any purchase of consequence is so sensitive to friction that the difference between Google Search and an LLM response matters (especially that in reality, we're talking 20+ manual searches per one LLM response). I.e. I'm not going to use LLMs advise on some random 0-100$ purchase anyway, and losing #$ on a ##$ purchase due to suboptimal choice is not that big of a deal - but I absolutely am going to consult it (and have it compile tables and verify sources) on a $500+ purchase and for those I can afford spending few more minutes on research (or rather few hours less, compared of doing it the usual way).

rvz · 2026-03-23T07:58:49 1774252729

Probably vibe-coded their infrastructure.

Many such cases.

rvz · 2026-03-22T20:11:11 1774210271

The security issues in OpenClaw is not even the main issue, the hype will die if there is no monetary incentive. Like I said before:

If you are spending more money on tokens than the agents are making you money (or not), then it is unfortunately all for nought.

The question is, who is making money on using Openclaw other than hosting?

nickthegreek · 2026-03-22T20:35:44 1774211744

$10/month minimax using m2.7 and openai-codex oauth $20/month will allow you to mess around with this stuff for negligible cost.

rvz · 2026-03-22T16:55:42 1774198542

This was obvious to those who value their time over the job given to them and all the office politics, performative meetings and the blame-game that comes with it.

rvz · 2026-03-22T16:06:08 1774195568

This firing is going to "backfire" in ultra-wide 4K.

ffsm8 · 2026-03-22T16:11:26 1774195886

i doubt it, honestly. atlassian is too deeply ingrained in big corpo with jira and confluence.

this controversy will not have enough steam behind it to affect hteir bottom line whatsoever

leereeves · 2026-03-22T16:27:18 1774196838

It might not affect their bottom line or even how customers feel about them, but I think it will affect current and future employees.

rwmj · 2026-03-22T16:27:38 1774196858

Their garbage software hasn't hurt them, it's unlikely that one developer being fired will make any difference.

nunez · 2026-03-22T17:50:40 1774201840

It happened in 2023.

rvz · 2026-03-22T15:11:20 1774192280

The "tech jobs" you are looking for are actually potemkin ghost jobs that are never going to be filled and are only there to give no signal to market traders and analysts whether if the company is hiring or not.

nirvanist · 2026-03-22T15:12:35 1774192355

I think it s a crime, playing with people lives

jfil · 2026-03-22T22:20:22 1774218022

Here in Ontario, as of January 2026 it is a crime. There is a nice set of laws around job postings that kicked into effect.

nirvanist · 2026-03-22T23:33:14 1774222394

nice, thank you for the information, Quebec we should be inspired by that.

rvz · 2026-03-22T14:21:19 1774189279

This would really work well for teams. Are there any limits into how many people can collaborate on Revise?

artursapek · 2026-03-22T14:23:18 1774189398

No enforced limits right now, but HN might find the performance bounds of my backend today. I am planning to add team/org accounts soon!

rvz · 2026-03-22T12:16:09 1774181769

The technical write up is great, but Mac users should not get too excited just yet on running 300B+ parameter models locally as the TPS isn't that good.

>...at 4.4+ tokens/second

That is even when it is using 4-bit quantization and it is still at that speed.

> The entire 209GB model streams from SSD through a custom Metal compute pipeline.

This is my main problem.

If I were to run this on a Mac SSD, 24/7 for heavy usage such as Openclaw, that is going to significantly reduce the lifetime of the SSD.

Can't imagine using this in the long term right now, but improvements will follow. Still a great write up anyways.

Roxxik · 2026-03-22T12:25:17 1774182317

Does an SSD meaningfully degrade by read only workloads?

JSR_FDED · 2026-03-22T12:29:59 1774182599

Nope, reads don’t cause wear

zozbot234 · 2026-03-22T13:43:07 1774186987

No appreciable wear of course, but read disturb (requiring occasional rewrites) becomes more of an issue as NAND fabrication advances.

etiam · 2026-03-22T12:34:59 1774182899

> If I were to run this on a Mac SSD, 24/7 for heavy usage such as Openclaw, that is going to significantly reduce the lifetime of the SSD.

How sure are you about that? I've never looked closer at how a large LLM with mixture of experts architecture switches between expert modules, but staying on roughly the same topic for the use (as it often would when editing the same codebase), I wouldn't be surprised to see the switches of composition are fairly rare, fairly small, and to the extent it happens it's repeated reads from the flash disk rather than writes it tends to cause.

frotaur · 2026-03-22T12:50:57 1774183857

Afaik the experts are not usually very interpretable, and generally would be surprised if at least one does not change every token. I don't know what happens in practice, but I know at least during training, nothing is done to minimize the number of expert switches between tokens.

etiam · 2026-03-22T21:14:18 1774214058

I'd have thought at least a tiny explicit penalty term for switching, to discourage messing around with the composition without any expected gains from it.

If one is to use these on hardware that can't keep everything loaded I guess someone should examine how it works out in practice. Interpretability may be be a too much to ask, but I can't spontaneously see any reason why the experts can't at least be pushed to incorporate what's needed to remain the good choice for a longer segment.

zozbot234 · 2026-03-22T21:27:04 1774214824

The switching is done by layer, not just per token. Every layer is loading completely different parameters, you don't really benefit from continuity. You're generally better off shifting this work to the CPU, since CPU RAM is more abundant than the GPU's VRAM hence it matters less that so much of it is "wasted" on inactive expert layers. Disk storage is even more relatively abundant, so offloading experts to disk if you can't keep them in RAM (as OP does) is the next step.

Wowfunhappy · 2026-03-22T13:07:23 1774184843

Eh. I mean, 4 tokens a second works fine if you're patient. Go do something else while you wait.

I feel like whenever I'm trying to find information on which local models will work on my hardware, I have to overestimate because people don't know how to wait for things.

Also, reading data doesn't cause SSD wear.

hrmtst93837 · 2026-03-22T12:38:50 1774183130

If you want decent throughput and do not care about burning SSD write cycles on a box that was never meant to act like a tiny inference server, a used server with actual RAM is still the cheaper and less silly option. I woudn't expect Apple's warranty team to be much help.

K0balt · 2026-03-22T13:02:53 1774184573

Is it doing a bunch of ssd writes?

mkw · 2026-03-22T16:00:16 1774195216

stream from the SSD, perform the calculation, discard, repeat

rvz · 2026-03-22T11:44:35 1774179875

From "code" to "no-code" to "vibe coding" and back to "code".

What you are seeing here is that many are attempting to take shortcuts to building production-grade maintainable software with AI and now realizing that they have built their software on terrible architecture only to throw it away, rewriting it with now no-one truly understanding the code or can explain it.

We have a term for that already and it is called "comprehension debt". [0]

With the rise of over-reliance of agents, you will see "engineers" unable to explain technical decisions and will admit to having zero knowledge of what the agent has done.

This is exactly happening to engineers at AWS with Kiro causing outages [1] and now requiring engineers to manually review AI changes [2] (which slows them down even with AI).

[0] https://addyosmani.com/blog/comprehension-debt/

[1] https://www.theguardian.com/technology/2026/feb/20/amazon-cl...

[2] https://www.ft.com/content/7cab4ec7-4712-4137-b602-119a44f77...

suzzer99 · 2026-03-22T17:34:05 1774200845

> With the rise of over-reliance of agents, you will see "engineers" unable to explain technical decisions and will admit to having zero knowledge of what the agent has done.

I've had to work on multiple legacy systems like this where the original devs are long gone, there's no documentation, and everyone at the company admits it's complete mess. They send you off with a sympathetic, "Good luck, just do the best you can!"

I call it "throwing dye in the water." It's the opposite of fun programming.

On the other hand, it often takes creativity and general cleverness to get the app to do what you want with minimally-invasive code changes. So it should be the hardest for AI.

Insanity · 2026-03-22T17:42:41 1774201361

While I agree with everything you said, Amazon’s problems aren’t just Kiro messing up. It’s a brain drain due to layoffs, and then people quitting because of the continuous layoff culture.

While publicly they might say this is AI driven, I think that’s mostly BS.

Anyway, that doesn’t take away from your point, just adds additional context to the outages.

zer00eyz · 2026-03-22T17:44:41 1774201481

> We have a term for that already and it is called "comprehension debt".

This isn't any different than the "person who wrote it already doesn't work here any more".

> now requiring engineers to manually review AI changes [2] (which slows them down even with AI).

What does this say about the "code review" process if people cant understand the things they didn't write?

Maybe we have had the wrong hiring criteria. The "leet code", brain teaser (FAANG style) write some code interview might not have been the best filter for the sorts of people you need working in your org today.

Reading code, tooling up (debuggers, profilers), durable testing (Simulation, not unit) are the skill changes that NO ONE is talking about, and we have not been honing or hiring for.

No one is talking about requirements, problem scoping, how you rationalize and think about building things.

No one is talking about how your choice of dev environment is going to impact all of the above processes.

I see a lot of hype, and a lot of hate, but not a lot of the pragmatic middle.

legulere · 2026-03-22T22:27:34 1774218454

> This isn't any different than the "person who wrote it already doesn't work here any more".

It is very different. With empathy you can often deduct why people wrote code the way they did. With LLMs there often is no reason.

xienze · 2026-03-22T17:51:42 1774201902

> This isn't any different than the "person who wrote it already doesn't work here any more".

Yeah but that takes years to play out. Now developers are cranking out thousands of lines of “he doesn’t work here anymore” code every day.

zer00eyz · 2026-03-22T18:06:59 1774202819

> Yeah but that takes years to play out.

https://www.invene.com/blog/limiting-developer-turnover has some data, that aligns with my own experience putting the average at 2 years.

I have been doing this a long time: my longest running piece of code was 20 years. My current is 10. Most of my code is long dead and replaced because businesses evolve, close, move on. A lot of my code was NEVER ment to be permanent. It solved a problem in a moment, it accomplished a task, fit for purpose and disposable (and riddled with cursing, manual loops and goofy exceptions just to get the job done).

Meanwhile I have seen a LOT of god awful code written by humans. Business running on things that are SO BAD that I still have shell shock that they ever worked.

AI is just a tool. It's going from hammers to nail guns. The people involved are still the ones who are ultimately accountable.