56:19 Prof. Lifu Huang: Goodhart’s Revenge: Reward Hacking in RL-Tuned LLMs, and How We Fight Back AI Agent Frontier 46 views - 1 week ago
1:00:08 Prof. Daniel Fried: Inducing and Using Abstractions of Agent Actions AI Agent Frontier 114 views - 2 weeks ago
1:00:12 Professor Maarten Sap: Enabling Human-centric and Culturally Aware Safety of AI Agents AI Agent Frontier 78 views - 3 weeks ago
1:02:08 Prof. Yu Su: Computer Use: Modern Moravec’s Paradox AI Agent Frontier 134 views - 4 weeks ago
57:00 Prof. Melanie Mitchell: Investigating Abstract Reasoning in Humans and Machines AI Agent Frontier 140 views - 4 weeks ago
1:00:36 Prof. Diyi Yang: Automation or Augmentation? Optimizing Human-AI Collaboration AI Agent Frontier 280 views - 3 months ago
47:54 Prof. Muhao Chen: Reasoning Guardrails for the Agentic Web AI Agent Frontier 131 views - 3 months ago
1:01:50 Prof. Huan Sun: Advancing the Capability and Safety of Computer-Use Agents Together AI Agent Frontier 124 views - 3 months ago
54:33 Prof. Lin Yang: Winning Gold at IMO 2025 with a Model-Agnostic Self-Verification Pipeline AI Agent Frontier 261 views - 4 months ago
58:20 Prof. Eric Xin Wang: Building AI Agents that Reason and Act Like Humans AI Agent Frontier 421 views - 4 months ago
47:05 Prof. Natasha Jaques: Multi-agent Reinforcement Learning (MARL) for LLMs AI Agent Frontier 1K views - 4 months ago
1:03:20 Prof. Peter Stone: Human-in-the-Loop Machine Learning for Robot Navigation and Manipulation AI Agent Frontier 382 views - 4 months ago
1:02:20 Prof. Manling Li: RAGEN: Training Agents by Reinforcing Reasoning AI Agent Frontier 560 views - 5 months ago
54:10 Prof. Furong Huang: Towards AI Security – An Interplay of Stress-Testing and Alignment AI Agent Frontier 158 views - 6 months ago
59:11 Prof. Tengyu Ma: STP: Self-play LLM Theorem Provers with Iterative Conjecturing and Proving AI Agent Frontier 262 views - 7 months ago
45:17 Contextual Bandits with Constraints Revisited: A Modular Approach with Improved Rates AI Agent Frontier 155 views - 1 year ago
1:01:47 The Curious Price of Distributional Robustness in Reinforcement Learning: AI Agent Frontier 610 views - 2 years ago
44:16 Solving Stabilize-Avoid Optimal Control via Epigraph Form and Deep Reinforcement Learning AI Agent Frontier 177 views - 2 years ago
1:08:56 Explainable reinforcement learning approaches for safe and interpretable autonomous driving AI Agent Frontier 893 views - 2 years ago
54:19 Towards robust, efficient, and safe reinforcement learning AI Agent Frontier 747 views - 2 years ago
52:06 Safe Reinforcement Learning in the Presence of Non-stationarity: Theory and Algorithms AI Agent Frontier 691 views - 2 years ago
53:01 Safe and Reliable Robot Reinforcement Learning in Dynamic Environments AI Agent Frontier 908 views - 2 years ago
34:40 Chi Jin-Talk Title: When Is Partially Observable Reinforcement Learning Not Scary? AI Agent Frontier 343 views - 3 years ago
48:38 Ding Zhao-Talk Title: Trustworthy Reinforcement Learning. AI Agent Frontier 456 views - 3 years ago
33:08 Ilias Kazantzidis-Talk Title: Human-in-the-loop Safe Reinforcement Learning. AI Agent Frontier 289 views - 3 years ago
29:33 Martim Brandao-Talk Title: Are robots safe for everyone? AI Agent Frontier 214 views - 3 years ago
29:45 Michael Everett-Talk Title: Certifiable Learning Machines. AI Agent Frontier 665 views - 3 years ago
50:54 Sergey Levine-Talk Title: Safety in Reinforcement Learning by Leveraging Offline Data. AI Agent Frontier 1.7K views - 3 years ago
32:58 Simon Shaolei Du-Talk Title: When are Offline Two-Player Zero-Sum Markov Games Solvable? AI Agent Frontier 227 views - 3 years ago
27:59 Zhehua Zhou-Talk Title: Safe Reinforcement Learning with Model Order Reduction Techniques. AI Agent Frontier 288 views - 3 years ago
32:04 Yali Du-Talk Title: Decision Structure in Decentralized Multi-Agent Learning. AI Agent Frontier 356 views - 3 years ago