podcast
details
.com
Share
Embodied AI 101
AgentFloor: A Benchmark for Long-Horizon Agent Planning
Listen
Description
A 30-task benchmark for evaluating long-horizon planning capabilities across 16 different AI models.
podcast
details
Print
Share
Want to check another podcast?
Enter the RSS feed of a podcast, and see all of their public statistics.
Check it
Made by
Alex Barredo
. Send your feedback to
alex@barredo.es
.