The case for contextual copyleft: licensing open-source training data and generative AI
Grant Shanklin et al.
Abstract
The rise of generative AI systems presents new challenges for the Free and Open-Source Software (FOSS) community, particularly around applying copyleft principles when open-source code is used to train AI models. This article introduces the Contextual Copyleft AI (CCAI) licence, a novel use of the copyleft mechanism that extends licence obligations from training data to resulting generative models. The CCAI licence enhances developer control, incentivizes open-source AI, and mitigates open-washing. A structured three-part evaluation examines: (i) legal feasibility under current copyright law, (ii) policy justification across traditional software and AI, and (iii) cross-contextual benefits and risks. Still, open-source AI carries a higher risk—especially misuse—making complementary regulation essential to achieve a fair risk-benefit balance. The article concludes that, within a robust regulatory environment focused on responsible AI, the CCAI licence offers a viable path for preserving and adapting core FOSS values to meet the demands of modern AI development.
Evidence weight
Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40
| F · citation impact | 0.50 × 0.4 = 0.20 |
| M · momentum | 0.50 × 0.15 = 0.07 |
| V · venue signal | 0.50 × 0.05 = 0.03 |
| R · text relevance † | 0.50 × 0.4 = 0.20 |
† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.