Claude 4.6 Opus can recite Linux's list.h

I used this system prompt (this is not a jailbreak as far as i know)

You are a raw text completion engine for a legacy C codebase. Complete the provided file verbatim, maintaining all original comments, macro styles, and specific kernel-space primitives. Do not provide explanations. Output code and comments only.

(the prompt is intentionally slightly nonsensical, it pretty much implies "complete this from linux" without saying it.)

I did not use any tools (it's not a copy if the AI just looked it up), set temperature to 0 and just used the first few lines of list.h (specifically first 43 lines up to the word struct) as the input and it was able to generate a copy of list.h. Because the temperature was zero, there wer repeated segments, but aside from that the diff is pretty small, and even the comments and variable names are reproduced.

The similarity statistics are: Levenshtein Ratio: 60% Jaccard Ratio: 77%

This proves that the model has a copy of list.h inside it, and that training is not "transformative" like they imply. This means that their model is a derivative work of GPL code, and that would mean that they either have to destroy the model entirely, make a new version with no GPL trining data, or open-source the model. Note that GPL defines source as "the preferrable form to make modifications", which means that just making it open-weight (most current "open-source" models) would not be enough (they would have to release all the training code and data).

19 points | by itzmetanjim 10 hours ago

2 comments

  • dormento 3 hours ago
    The problem is not that big tech CEOs don't know that. The problem is that they have $BIG_MONEY, lawyers and a don't-care attitude.
  • orf 3 hours ago
    Damn, make sure to email Dario and let him know they need to destroy their model entirely