More

theredbeard · 2026-03-27T07:01:08 1774594868

We haven’t been inching closer to users writing a half-decent ticket in decades though.

fhub · 2026-03-27T10:12:22 1774606342

Solutions like https://bugherd.com/ might make the issue context capture part more accurate.

aembleton · 2026-03-27T08:05:46 1774598746

Maybe the agent can ask the user clarifying questions. Even better if it could do it at the point of submission.

theredbeard · 2026-03-27T06:58:11 1774594691

This is because for some reason all agentic systems think that slapping cron on it is enough, but that completely ignores decades of knowledge about prospective memory. Take a look at https://theredbeard.io/blog/the-missing-memory-type/ for a write-up on exactly that.

theredbeard · 2026-03-26T16:45:44 1774543544

What in the ChatGPT is this?

theredbeard · 2026-03-25T19:14:06 1774466046

I’m getting pretty decent at spotting LLM text. This doesn’t contain the obvious tells at least.

theredbeard · 2026-03-11T18:31:39 1773253899

It’s a self fulfilling prophecy. They’re extremely expensive so they must be good so they must be worth it. And because at that level measurement is extremely subjective it’s mainly about the vibes.

Like everything it’s just marketing.

theredbeard · 2026-03-11T18:28:45 1773253725

I’m sorry but no attempt was made here. It contains all the red flags in the first few paragraphs.

theredbeard · 2026-03-11T18:27:06 1773253626

A vibe? It’s completely obvious AI slop with no attempt to make it legible. They didn’t even prompt out the emdashes. For such a cool finding this is extremely disappointing.

theredbeard · 2026-02-24T13:28:57 1771939737

It's a fair question. I've had problems with Gemini 3 due to rate limiting, and I've been working on this for a while now. I'm planning Gemini 3 for a follow up.

theredbeard · 2026-02-16T18:19:16 1771265956

It’s not groundbreaking in a technological sense. The codebase is actually a bit of a monstrosity. But it removed guardrails that were artificially put on these LLMs which suddenly gave it an entire new dimension and the timing was right.

theredbeard · 2026-02-13T10:13:41 1770977621

I built this because I was curious what Claude sends to the API, how subagents get work delegated and what contexts look like. Interesting to see how small part of the context the user interaction really is typically.

don_searchcraft · 2026-02-20T15:46:09 1771602369

Pretty interesting to see how widely the token use differs across the models for the same task.