Tesla designed FSD Beta neural network architectures including a "Spatial Recurrent Neural Network Video Module" that conditionally updated internal knowledge based on visibility, positioning and timing. Presumably some of that architecture fed into the decisions for Occupancy network and now end-to-end, so there should be the capability of tracking occluded objects over time, but indeed unclear how long it can remember and how much is necessary.What is v12's scenario recall capacity? 100's of Milliseconds? What's really needed for safe driving?
Are there examples of 12.x doing things based on a previous memory of something no longer visible? These examples might provide some estimations of how long end-to-end can remember, and for this particular failure to allow oncoming stop sign traffic to go first could be 12.2.1 control network not paying enough attention to those memory signals.