Disclaimer: I'm not an ML engineer but I've been trying to make sense of it myself. Basically, I think the answer is that the code developed up through v11 was not at all a waste or retrospectively unnecessary.
The v12 NN is not a formless blank slate that is trained from zero on the video data; rather it has a starting-point form defined by a large number of non-zero interconnection weights, and crucially a very huge number of zero-valued weights - all of this effectively defining special-purpose localized processing sub-networks that interpret the data and make decisions. You can roughly analogize it to various cortexes of the brain, the visual and auditory etc. corresponding to perception centers, and the cerebral cortex, hippocampus for memory and spatial comprehension and so on.
Without the predecessor work developed by code and incremental ML techniques throughout the prior versions, Tesla wouldn't have the foundation networks to train. You can't just start from zero with everything connected to everything else and let it train on a giant formless network, because that has too many parameters to handle even with enormous compute resources, and it wouldn't converge to anything useful on its own. (unless the given problem - unlike driving - can be represented by a small number of well-defined rules, like the often cited Alpha Zero trained from scratch to become the world's best expert Go player).
Of course this doesn't mean that if Tesla started over again, knowing what they know now, that they would take exactly the same path to get to a trainable and capable network starting point. And it doesn't mean that they won't, sooner or later, do it over again to develop an even more capable and efficient trainable network. But for now, I believe the NN "output network" inherited from v11 is the essential basis of the trainable v12 network structure.
Below are some earlier posts I made on the same topic if you're interested, but I tried to restate and summarize the ideas in this post.
HW4 has three camera imputs that are not used on the cars. On S&X, yes; on 3&Y HW4 has no unused inputs. At least that is what I thought green said.
teslamotorsclub.com
It's been 24 hours since the last wave... what are the odds of another drop tonight? No idea. May as well shake a magic 8 ball. We are also past 80% install from last nights drop.
teslamotorsclub.com
What video are you watching? None of what you describe is shown. Yeah even as dissatisfied as I am with FSD and the years of breathless nonsense from the Chief Twit, this UPL video looks…fine (to me).
teslamotorsclub.com