Guys, I get it that Anthropic also scrapes the internet to train the model. But I feel like scraping open web and distilling from a frontier model is different?
Distillation also directly inherits a frontier model’s alignment and behavior, without paying the underlying R&D or safety costs. That may be a different incentive problem than web scraping.
This feels similar (even if not identical) to a pharmaceutical company reverse-engineering a drug developed through years of costly R&D. It surely can lower prices and expand access to more people, but it’s not obvious that this is a long-term win-win situation. I don't know.