As a user of an unsigned Firefox fork, Turnstile has ruined a moderate portion o...

tempest_ · 2026-02-27T23:01:29 1772233289

As bad as cloudflare is there is a reason people use it.

If you try and run a site that has content that LLMs want or expensive calls that require a lot of compute and can exhaust resources if they are over used the attack is relentless. It can be a full time job trying to stop people who are dedicated to scrapping the shit out of your site.

Even CF doesnt even really stop it any more. The agent run browsers seem to bypass it with relative ease.

noplacelikehome · 2026-02-28T10:49:53 1772275793

Granted, but there are open source alternatives that don’t have the same obsession with meaningless digital signatures. Turnstile is just a terrible product.

neoromantique · 2026-02-27T23:14:19 1772234059

Vast majority of websites today can and should be static, which makes even the aggressive llm scrapping non-issue.

PaulDavisThe1st · 2026-02-27T23:18:21 1772234301

One of the things that a lot of LLM scrapers are fetching are git repositories. They could just use git clone to fetch everything at once. But instead, they fetch them commit by commit. That's about as static as you can get, and it is absolutely NOT a non-issue.

LoganDark · 2026-02-27T23:29:26 1772234966

No... Basically all git servers have to generate the file contents, diffs etc. on-demand because they don't store static pages for every single possible combination of view parameters. Git repositories also typically don't store full copies of all versions of a file that have ever existed either; they're incremental. You could pre-render everything statically, but that could take up gigabytes or more for any repo of non-trivial size.

KolmogorovComp · 2026-02-27T23:40:58 1772235658

> Git repositories also typically don't store full copies of all versions of a file that have ever existed either; they're incremental

This is wrong. Git does store full copies.

meatmanek · 2026-02-28T00:21:37 1772238097

git stores files as objects, which are stored as full copies, unless those objects are stored in packfiles and are deltified, in which case they're stored as deltas. https://codewords.recurse.com/issues/three/unpacking-git-pac...

PaulDavisThe1st · 2026-02-28T15:39:25 1772293165

... which, in the context that is being discussed, is unusual.

KolmogorovComp · 2026-02-28T10:37:52 1772275072

Thank you for the insights.

neoromantique · 2026-02-27T23:38:58 1772235538

that's a pretty niche issue, but fairly easy to solve.

Prebuild statically the most common commits (last XX) and heavily rate limit deeper ones

PaulDavisThe1st · 2026-02-28T15:40:29 1772293229

1. that doesn't appear to match the fetching patterns of the scrapers at all

2. 1M independent IPs hitting random commits from across a 25 year history is not, in fact, "easy to solve". It is addressable, but not easy ...

3. why should I have to do anything at all to deal with these scrapers? why is the onus not on them to do the right thing?

flexagoon · 2026-02-27T23:09:08 1772233748

I see people saying that a lot, but I use Zen which is a fork of Firefox and I don't think I've ever had an issue with Turnstile, at least not noticeably more than I had on mobile Chrome.

pchew · 2026-02-27T23:18:26 1772234306

Zen has been signed for close to a year.

tick_tock_tick · 2026-02-27T23:21:14 1772234474

Isn't it the opposite? They allow you to still use it when it would almost certainly be better for cloudflare and the website behind then to just block you.

sebzim4500 · 2026-02-27T23:46:48 1772236008

How does Cloudflare know you are using the fork? Can you not just set the user agent to match firefox's (or even chrome's for that matter)

noplacelikehome · 2026-02-28T10:54:43 1772276083

Quite likely fingerprinting detection, which is remaining firmly enabled.

sebzim4500 · 2026-02-28T11:49:48 1772279388

How does that work technically? Presumably a fork of firefox is almost indistinguishable from firefox from Cloudflare's perspective?