Lucid Dream Test
Imagine the following scenario. You enter a room and you are asked to wear a VR headset that has a camera and supports a passthrough mode (which can display a real-time feed of your surroundings). Once you put on the headset, you find yourself in what appears to be the exact same room.
For the next five minutes, you are asked to walk around, interact with objects, and have conversations with people who enter the room.
Finally, while still wearing the headset, you are asked the question: do you believe that you are viewing the real world via your headset’s passthrough mode, or is everything you're experiencing generated by an AI model?
As the quality of world simulators further improves, we’ll need to increasingly focus on evaluations that include the ability to interact with the simulated environment. This is a harder task than simply generating physically plausible videos, where the models can “cheat” by avoiding generating difficult futures.