cross-posted from: https://lemmy.ca/post/19946388
An anticapitalist tech blog. Embrace the technology that liberates us. Smash that which does not.
cross-posted from: https://lemmy.ca/post/19946388
An anticapitalist tech blog. Embrace the technology that liberates us. Smash that which does not.
I think you missed the part where you were strongly suggested “not” to use copyrighted text.
The point is not to get rid of the original text. It’s to “poison” the training data.
If the AI trainers have the original text then “poisoning” the live site’s content isn’t going to do anything at all.
You can’t touch the original text. It’s already been archived.
If they scrape the updated comments again and ingest copyrighted text, you are poisoning the data.
That’s my point. They won’t.
And even if they did, it’s unclear that copyright has anything to say about AI training anyway.