No, I don't think they need to learn that at all before speculating.
Yea, diffusion models denoise to get something a human can look at and understand.
Yea, if you could put one gigantic block across 1.3mp in a single RGB value and denoise it, that would take less compute.
Putting my neck out...