More

simsla · 2026-03-18T02:51:36 1773802296

Typical stages of training for these models are:

Foundational:

- Pretraining - Mid/post-training (SFT) - RLHF or alignment post-training (RL)

And sometimes...

- Some more customer-specific fine-tuning.

Note that any supervised fine-tuning following the Pretraining stage is just swapping the dataset and maybe tweaking some of the optimiser settings. Presumably they're talking about this kind of pre-RL fine-tuning instead of post-RL fine-tuning, and not about swapping out the Pretraining stage entirely.

simsla · 2026-03-17T01:12:41 1773709961

I think my experience as an interviewer has helped. If you ask non-leading questions, sycophancy doesn't come into play as much.

Instead of saying "are you sure?" or "shouldn't we do X instead?" you could say "give me the benefits and drawbacks of this compared to X".

Also, when you yourself are sure, give clear stear. "This overcomplicates A, let's do B instead."

simsla · 2026-03-16T10:23:59 1773656639

For reference, I knew a guy who built bespoke cheats (less likely to get caught by ban waves) and he charged a few thousand per project.

simsla · 2026-03-13T15:57:55 1773417475

I've never experienced this, but I guess I always respond with something like "No, [critique/steer]" or "Mostly fine, but [critique/steer]".

simsla · 2026-03-04T14:09:06 1772633346

Choose to enable backups.

simsla · 2026-03-01T04:14:09 1772338449

The blog post literally explains how to do so.

hrmtst93837 · 2026-03-01T12:57:12 1772369832

It's true, the post lays out the details clearly, but a hands-on example can often make the concepts more tangible. Seeing it in action helps solidify understanding.

hrmtst93837 · 2026-03-01T12:26:39 1772367999

The post lays out the steps clearly, but implementing them often reveals unexpected challenges. It's usually more complicated in practice than it appears on paper.

profsummergig · 2026-03-01T19:26:28 1772393188

This. I literally am asking for a step-by-step guide outlining every step (including an existing corpus that can be used on a consumer-grade laptop to train the model in under a week).

hrmtst93837 · 2026-03-01T08:44:58 1772354698

If the implementation details are clear, replicating the setup can be worthwhile. Sometimes seeing it in action helps to better understand the nuances.

simsla · 2026-03-01T01:46:17 1772329577

Also because normal usage has predictable usage patterns, which allows them to optimise and predict costs. Flat rate pricing only makes sense in that regime.

simsla · 2026-02-26T18:09:53 1772129393

While I agree, if you need high profits to survive, you're not off to a great start as a nonprofit.

simsla · 2026-02-25T16:04:49 1772035489

There is a financial incentive to make the search results worse. (More searches, more ads, more money.)

There is no incentive for adding false positives to lists of malicious websites.

crote · 2026-02-25T16:32:34 1772037154

Sure, until their "smart filters" start considering GCP-hosted websites as pre-verified and small self-hosted websites as malicious. You know, like they have been doing with email?

Chrome is big enough that a website owner can't afford a false positive on their malware list, just like they can't afford to have all their email end up in spam for all Gmail users.

Due to their near-monopoly Google also has no incentive to avoid adding false positives to their blocklist - provided they don't accidentally block high-profile targets. And if a CxO is screaming over your shoulder that your website has been blocked, arguments about "false positives" aren't very compelling: they'll just demand you move off the "shitty basement provider" and switch to "proper hosting, like the Google Cloud"...

simsla · 2026-02-25T15:42:52 1772034172

Permissions scoping

esafak · 2026-02-25T15:45:33 1772034333

Then they attempt to download the missing tool or write a substitute from scratch. Am I the only one who runs into this??