News

大模型推理能力的进化路径:可验证奖励、离策略学习和测试时强化学习

Speaker:Dr. Yafu LI, Shanghai Artificial Intelligence Laboratory

Time:Oct 27, 2025, 14:00-16:30

Location:Room 112, Lecture Hall 3

Post