> It's not like Meta can remove these books from the training set without retrai...

Der_Einzige · on July 14, 2023

I’ve been wondering when the landmark moral panic would start against Civit.AI and the coomer crowd. People have no idea just how much porn is being produced by this stuff. One of the top textual inversions right now is a… age slider… (https://civitai.com/models/65214/age-slider) ewww. It’s also extremely well rated and reviewed on there. I’m terrified at the impending backlash because depending on what happens the party going on in AI could end

brucethemoose2 · on July 14, 2023

People have been saying this about underage hand drawn hentai forever, but its still around.

Not that I am disagreeing with you. What I find particularly disturbing are the paid services for this.

Also, I have seen 2 seperate OnlyFans pimps ask for help in a text generation chatroom. Something about automating "private" texting from their "girls."

Der_Einzige · on July 14, 2023

It’s trivial to use these methods to produce real looking images, or even stuff in the likeness of real people…

AuryGlenz · on July 14, 2023

Yeah. I did a fine tuned model of my daughter and niece and I definitely have to put in “sexy, naked,” and the like in the negative prompt when using them.

I don’t think society is going to have a hissyfit until some app comes along that makes it super easy for people to train good models locally on people and then generate whatever they want. That day’s coming really soon though.

brucethemoose2 · on July 14, 2023

There are tons of web services for this. They are just obscure and distributed enough to avoid public ire.

The pieces to do local LORA training are all there, but honestly the tyranny of CUDA is the biggest blocker for the average person.

AuryGlenz · on July 15, 2023

Sure, but it's still not super user friendly. You upload photos, get a 2 GB checkpoint file that you run on some obscure, sometimes hard to install programs.

I know there was a phone app that did a limited thing where they gave you profile images and they made bank. I'm a little surprised nobody has tried going whole hog, if the app stores would even allow it.

whimsicalism · on July 14, 2023

That is not at all the same thing as removing the books.

twayt · on July 14, 2023

> They probably can:

No, actually they probably can’t. There is no verifiable way to remove the data from the model apart from completely removing all instances of information from the training data. The project you linked only describes a selective finetuning approach.

xnx · on July 14, 2023

It's an area of active research: https://ai.googleblog.com/2023/06/announcing-first-machine-u...

twayt · on July 14, 2023

Until you get models with completely disentangled feature spaces such that you know that the influence of a piece of data is completely removed (at the limit this is something like an embedding DB), there is absolutely no way you can claim you’ve removed the data from the model.

At most, these efforts will amount to data laundering where it will be impossible to prove that a piece of data was used to train the model, not provide conclusive proof that it was removed.

NBJack · on July 14, 2023

Which means we are probably at least 5-10 years away from verifiable action that a court of law will recognize.

nomel · on July 14, 2023

This assumes it's possible. I naively assume it's not, in a way that doesn't harm the model, beyond the content of the book.

brucethemoose2 · on July 14, 2023

They can probably prevent LLaMA from spitting out verbatim quotes from the books well enough to make proof difficult.

... But yeah, fundamentally the only way to throw out the books is to throw out the weights.

potsandpans · on July 14, 2023

that is quite the spicy claim