Physion
a benchmark evaluating intuitive physics in vision models via object contact prediction across realistic 3D scenarios. Humans consistently outperform state-of-the-art models, emphasizing the need for object-centric, physics-informed representations