GroundedPlanBench: Spatially grounded long-horizon task planning for robot manipulation

At a glance
- VLM-based robot planners struggle with long, complex tasks because natural-language plans can be ambiguous, especially when specifying both actions and locations.
- GroundedPlanBench evaluates whether models can