AsgardBench: A benchmark for visually grounded interactive planning

At a glance
- To successfully complete tasks, embodied AI agents must ground and update their plans based on visual feedback.
- AsgardBench isolates whether agents can use visual
Deep Learning, NLP, NMT, AI, ML
