Copy-left open-source license for AI code use

  • Posted 9 hours ago by program_whiz
  • 3 points
I'm thinking that we need a new open source license that copies an existing license, such as GNU AGPL license (or any flavor really), but has language specific to AI training:

``` This code may be used by AI models freely, but any model trained with this code, or using this code as part of inference, in whole or in part, with or without modification, must be made public under a "Copyleft AI License". All trained model weights, as well as model training and inference source code, and a copy of this source code, must be published in an open-source format with a copy of this license to this website (github.com). ```

The basic idea is that people can freely use the code to train an AI/LLM model, but in doing so will be required to publish the weights and model-related code as well to the same place where they took the code. It also covers the case that they may not be training on the code, but they are scraping it as source material for whatever they are doing in inference.

This hopefully gives legal recourse to any code owners.

It also forces AI companies to think twice and/or develop better filtering, allowing people to better "opt out" of being used for AI.

It also forces companies to publicly publish the best model training and weights, if they use open-source code to train.

Finally, it gives some legal standing to smaller sites (such as gitea or gitlab, or private sites) that are being relentlessly scraped.

0 comments