DeepMind's paper on MuZero is out in Nature. Here's their blog post:
MuZero: Mastering Go, chess, shogi and Atari without rules
For many years, researchers have sought methods that can both learn a model that explains their environment, and can then use that model to plan the best course of action.
So does this portend a future without traffic laws? If we move to a world where the notion of rules of the road disappears, and all that's left is goal oriented behavior, then this paper shows that we already have some idea how to deal with that.
If we have autonomous transportation software that can figure out the local conventions and preferences from scratch, then it can happily pilot pedestrians, bicycles, cars, trucks, delivery vans, tuk-tuks, etc. Why would rules be needed?
MuZero: Mastering Go, chess, shogi and Atari without rules
For many years, researchers have sought methods that can both learn a model that explains their environment, and can then use that model to plan the best course of action.
So does this portend a future without traffic laws? If we move to a world where the notion of rules of the road disappears, and all that's left is goal oriented behavior, then this paper shows that we already have some idea how to deal with that.
If we have autonomous transportation software that can figure out the local conventions and preferences from scratch, then it can happily pilot pedestrians, bicycles, cars, trucks, delivery vans, tuk-tuks, etc. Why would rules be needed?