Software Freedom Conservancy: Give Up GitHub: The Time Has Come!
Software Freedom Conservancy: Give Up GitHub: The Time Has Come!
Posted Jul 1, 2022 8:33 UTC (Fri) by Karellen (subscriber, #67644)In reply to: Software Freedom Conservancy: Give Up GitHub: The Time Has Come! by bluca
Parent article: Software Freedom Conservancy: Give Up GitHub: The Time Has Come!
1. What case law, if any, did you rely on in Microsoft & GitHub's public claim, stated by GitHub's (then) CEO, that: “(1) training ML systems on public data is fair use, (2) the output belongs to the operator, just like with a compiler”? In the interest of transparency and respect to the FOSS community, please also provide the community with your full legal analysis on why you believe that these statements are true.I assume this must be borne out of US-centric view - there's no need to invoke fair use here in the Europe, data mining on publicly available text and data bodies is exempt from copyright rules as per the copyright directive from a couple of years back. Whether a repository is proprietary or under an FOSS license, it's completely irrelevant, anyone can data mine all day long, as long as it's publicly and legally accessible.
I note that you (along with many other CoPilot defenders) always focus heavily on the data mining (or "model training") side of the legal implications, and tend to ignore or gloss over the code generation side of things.
I have no issues at all with anyone gathering, analysing and performing computations on whatever FOSS source code they can get their hands on. It's out there with a license that explicitly states you're free to read it, analyse it and learn from it, for your own benefit.
Where I do have issues is where CoPilot outputs source code which is distributed to others. I fail to understand how the source code it produces can not be considered a "derivative work" of its source code inputs, as without those inputs it would produce no output at all. And producing and distributing a derivative work does require a license - or (as SFC ask for) some kind of explanation why the distributor feels a license is not needed.
It is strange if CoPilot's authors invoke the comparison with a compiler, where the output is owned by the operator. Because that's only true if the inputs are owned by the operator. You can't run someone else's source code through a compiler and then claim copyright ownership of the object code just because you invoked the compiler. I am not a copyright lawyer, but I know that isn't how copyright law works.