Look for any podcast host, guest or anyone
Showing episodes and shows of

Zhang Yunlong

Shows

Daily Paper CastDaily Paper CastCodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings 🤗 Upvotes: 30 | cs.CL Authors: Shanghaoran Quan, Jiaxi Yang, Bowen Yu, Bo Zheng, Dayiheng Liu, An Yang, Xuancheng Ren, Bofei Gao, Yibo Miao, Yunlong Feng, Zekun Wang, Jian Yang, Zeyu Cui, Yang Fan, Yichang Zhang, Binyuan Hui, Junyang Lin Title: CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings Arxiv: http://arxiv.org/abs/2501.01257v1 Abstract: With the increasing code reasoning capabilities of existing large language models (LLMs) and breakthroughs in reasoning models like OpenAI o1 and o3, there is a growing need to develop more cha...2025-01-0423 minPapers Read on AIPapers Read on AIVideo Understanding with Large Language Models: A SurveyWith the burgeoning growth of online video platforms and the escalating volume of video content, the demand for proficient video understanding tools has intensified markedly. Given the remarkable capabilities of Large Language Models (LLMs) in language and multimodal tasks, this survey provides a detailed overview of the recent advancements in video understanding harnessing the power of LLMs (Vid-LLMs). The emergent capabilities of Vid-LLMs are surprisingly advanced, particularly their ability for open-ended spatial-temporal reasoning combined with commonsense knowledge, suggesting a promising path for future video understanding. We examine the unique characteristics and capabilities of Vid-LLMs, categorizing the approaches into four main...2024-01-101h 32