The toughest benchmark that challenges AI models to solve mini video games with no written instructions just got a whole lot tougher. As designed, the games are easy for humans to figure out after a few minutes of experimentation, but incredibly difficult for computers to solve, as shown by these terrible stats. See how you do.