podcastdetails.com

Share

Embodied AI 101

AgentFloor: A Benchmark for Long-Horizon Agent Planning

Listen

Description

A 30-task benchmark for evaluating long-horizon planning capabilities across 16 different AI models.

Print Share

Want to check another podcast?

Enter the RSS feed of a podcast, and see all of their public statistics.

Made by Alex Barredo. Send your feedback to alex@barredo.es.