For decades, the benchmark for machine intelligence was the chessboard. But as AI models move into the real world to act as ...