diplomat33
Average guy who loves autonomous vehicles
Armchair thoughts:
A modular NN can be one monolithic NN once completed (layers all directly stacked). The difference between those two cases is that intermediate layers have fixed meaning in the modular case. So training back props from that boundary layer to the previous as GIGO applies.
Success of a modular approach requires the designers can correctly determine what data is important to pass on and what can be discarded. (Making it difficult on themselves)
Without the quantized intermediates, training occurs over the whole NN and validation of intermediates becomes difficult. Internal structure itself may be more uniform due to removal of the category filters. They may be able to get some clues using a fMRI type heat map of the NN while stimulating it. What images propagate through? Training suite would need to monitor this as it's beyond human scale. (Making it compute heavy)
Modular: NN recognizes the X types of road signs, recognizes lane lines in our Y test cases, follows rational paths in the Z senarios.
E2E: NN followed the road safely and legally in the Z scenarios, but what data it acted on is less clear.
Here is an interesting video where Drago (Waymo) and Alex (Wayve) discuss the differences between modular and E2E:
Interestingly, Drago says the trend in ML is towards larger models. He thinks too many modules and E2E are both extreme on opposite sides of the spectrum. He argues for a middle ground. He sees many advantages in a modular approach so he advocates for a modular approach but with a small number of large models. Alex argues for E2E, saying it is simpler and more efficient since you just need data and you can automate training of one large model that does everything.
I wonder if E2E will ultimately prevail, it is just a matter of how to get there. The modular approach will get to E2E in a more incremental way, starting with many modules and gradually merging modules into fewer and larger modules until they eventually are just left with one big module (E2E). The other E2E approach from say Wayve is to build and train one large model from scratch.
Last edited: