Is This Edit Correct? A Multi-Dimensional Benchmark for Reasoning-Aware Image Editing
Paper • 2606.05172 • Published • 1
None defined yet.
Eliciting Complex Spatial Reasoning in MLLMs through Wide-Baseline Matching
Where to Look: Can Foundation Models Reach a Target Viewpoint Through Active Exploration?