We have all been children, we all know the anatomical differences.
It's not like children are alien, most differences are just "this is smaller and a slightly different shape in children". Many of those differences can be seen on fully clothed children. And for the rest, there are non-CSAM images that happen to have nude children. As I said earlier, it is not uncommon for children to be fully nude in beaches.
Well yes, the LLMs are not the ones that actually generate the images. They basically act as a translator between the image generator and the human text input. Well, just the tokenizer probably. But that's beside the point. Both LLMs and image generators are generative AI. And have similar mechanisms. They both can create never-before seen content by mixing things it has "seen".
I'm not claiming that they didn't use CSAM to train their models. I'm just saying that's this is not definitive proof of it.
It's like claiming that you're a good mathematician because you can calculate 2+2. Good mathematicians can do that, but so can bad mathematicians.