Skip to main content

You are not logged in. Your edit will be placed in a queue until it is peer reviewed.

We welcome edits that make the post easier to understand and more valuable for readers. Because community members review edits, please try to make the post substantially better than how you found it, for example, by fixing grammar or adding additional resources and hyperlinks.

Required fields*

6
  • 50
    my guess is there is internal disagreement about how to handle this. Hence the mixed messaging. I have ZERO inside insight, that's just my opinion. 🤷 I am a very very VERY strong advocate of sticking to the guarantees we made with the CC license from inception, though. I think CC needs to revise / update their license to cover the AI use case, personally. Commented Jun 14, 2023 at 0:15
  • 13
    @JeffAtwood First someone has to argue successfully that using copyrighted work to train AI to generate content doesn't fall under Fair Use. I expect that to be quite expensive because all the companies using GenAI are going to be on the other side with deep pockets. Commented Jun 14, 2023 at 3:11
  • Training a large language model is a derivative work, therefore requires attribution of everyone who contributed to the training data? Commented Jun 14, 2023 at 6:50
  • 5
    @gerrit Is it illegal for a firm to train an AI model on a CC BY-SA 4.0 corpus and make a commercial use of it without distributing the model under CC BY-SA?. Note that before LLMs, we had the same question for word embeddings, but few people raised the issue back then Distributing machine learning models (e.g., word embeddings) based on non-sharable datasets and I don't recall seeing any court cases about it. Commented Jun 14, 2023 at 7:11
  • 1
    Considering that thanks to "some meddling ex employees" the cat was already out of the bag there was no need to try to keep this secret anymore. Even better, by conceding this right now they are giving the userbase the "false" idea that they "won" something, and this could make them more well disposed and yielding in whatever discussion about ending the strike is going on right now. Commented Jun 14, 2023 at 8:36
  • 2
    @JeffAtwood "I think CC needs to revise / update their license to cover the AI use case, personally." But then this newly revised CC licenses maybe wouldn't be compatible with the older CC versions anymore with difficulties for editing content. And how should that be solved practically? I cannot literally acknowledge every single place I learned something from when typing something. Should I write endless lists of footnotes with possible references for why I might say what I say? Commented Jun 14, 2023 at 8:45