9:00 GDPO Explained: NVIDIA Fixes GRPO for LLM Reinforcement Learning AI Papers Academy 3.3K views - 3 months ago
11:23 Why Reinforcement Learning Unlocks Reasoning in LLMs (Aha Moments Explained) AI Papers Academy 2.5K views - 3 months ago
13:40 DINOv3 Paper Explained: The Computer Vision Foundation Model AI Papers Academy 18.1K views - 7 months ago
8:30 Reinforcement Pre-Training (RPT) By Microsoft Explained AI Papers Academy 2.5K views - 9 months ago
8:07 Darwin Gödel Machine Explained: Self-Improving AI Agents AI Papers Academy 4.5K views - 10 months ago
9:36 Continuous Thought Machines (CTMs) - The Era of AI Beyond Transformers? AI Papers Academy 11.4K views - 10 months ago
8:35 Perception Language Models (PLMs) by Meta – A Fully Open SOTA VLM AI Papers Academy 7.9K views - 11 months ago
14:38 GRPO Reinforcement Learning Explained (DeepSeekMath Paper) AI Papers Academy 5.4K views - 1 year ago
8:21 Cheating LLMs & How (Not) To Stop Them | OpenAI Paper Explained AI Papers Academy 2.5K views - 1 year ago
8:04 START by Alibaba: Teaching LLMs to Debug Their Thinking with Python AI Papers Academy 2.5K views - 1 year ago
8:31 SWE-RL by Meta — Reinforcement Learning for Software Engineering LLMs AI Papers Academy 3K views - 1 year ago
9:29 Large Language Diffusion Models - The Era Of Diffusion LLMs? AI Papers Academy 23.9K views - 1 year ago
8:49 s1: Simple Test-Time Scaling - Can 1k Samples Rival o1-Preview? AI Papers Academy 6K views - 1 year ago
9:01 DeepSeek Janus-Pro: DeepSeek's Revolution in Multimodal AI? AI Papers Academy 9.2K views - 1 year ago
9:09 DeepSeek-R1 Paper Explained - A New RL LLMs Era in AI? AI Papers Academy 84.6K views - 1 year ago
10:23 rStar-Math by Microsoft: Can SLMs Beat OpenAI o1 in Math? AI Papers Academy 5.3K views - 1 year ago
10:23 Large Concept Models (LCMs) by Meta: The Era of AI After LLMs? AI Papers Academy 39K views - 1 year ago
10:07 Byte Latent Transformer (BLT) by Meta AI - A Tokenizer-free LLM AI Papers Academy 13.3K views - 1 year ago
9:41 Coconut by Meta AI - LLM Reasoning With Chain of Continuous Thought AI Papers Academy 9.9K views - 1 year ago
8:57 Hymba by NVIDIA: A Hybrid Mamba-Transformer SOTA Small LM AI Papers Academy 3.9K views - 1 year ago
7:51 Generative Reward Models: Merging the Power of RLHF and RLAIF for Smarter AI AI Papers Academy 2.2K views - 1 year ago
4:51 Writing in the Margins: Better LLM Inference Pattern for Long Context Retrieval AI Papers Academy 991 views - 1 year ago
4:33 Sapiens by Meta AI: Foundation for Human Vision Models AI Papers Academy 4.5K views - 1 year ago
7:37 Mixture of Nested Experts by Google: Efficient Alternative To MoE? AI Papers Academy 1.1K views - 1 year ago
4:41 Introduction to Mixture-of-Experts | Original MoE Paper Explained AI Papers Academy 12.8K views - 1 year ago
3:54 Mixture-of-Agents (MoA) Enhances Large Language Model Capabilities AI Papers Academy 3.4K views - 1 year ago
4:52 Arithmetic Transformers with Abacus Positional Embeddings | AI Paper Explained AI Papers Academy 948 views - 1 year ago
7:26 CLLMs: Consistency Large Language Models | AI Paper Explained AI Papers Academy 1.4K views - 1 year ago
7:30 ReFT: Representation Finetuning for Language Models | AI Paper Explained AI Papers Academy 3.9K views - 2 years ago
9:21 Stealing Part of a Production Language Model | AI Paper Explained AI Papers Academy 2.1K views - 2 years ago
6:10 The Era of 1-bit LLMs by Microsoft | AI Paper Explained AI Papers Academy 96.5K views - 2 years ago
11:35 V-JEPA by Meta AI - A Human-Like Computer Vision Video-based Model AI Papers Academy 11.4K views - 2 years ago
6:50 Self-Rewarding Language Models by Meta AI - Path to Open-Source AGI? AI Papers Academy 4.2K views - 2 years ago
11:59 Fast Inference of Mixture-of-Experts Language Models with Offloading AI Papers Academy 2K views - 2 years ago
5:23 TinyGPT-V: Small but Mighty Multimodal Large Language Model AI Papers Academy 2K views - 2 years ago
6:28 LLM in a flash: Efficient Large Language Model Inference with Limited Memory AI Papers Academy 4.8K views - 2 years ago
6:21 Orca 2 by Microsoft: Teaching Small Language Models How to Reason AI Papers Academy 2.4K views - 2 years ago
5:50 LCM-LoRA: From Diffusion Models to Fast SDXL with Latent Consistency Models AI Papers Academy 4K views - 2 years ago
5:27 CODEFUSION by Microsoft: A Pre-trained Diffusion Model for Code Generation AI Papers Academy 1.4K views - 2 years ago
9:31 Table-GPT by Microsoft: Empower LLMs To Understand Tables AI Papers Academy 9K views - 2 years ago
9:20 Vision Transformers Need Registers - Fixing a Bug in DINOv2? AI Papers Academy 4.3K views - 2 years ago
6:01 Emu by Meta AI: Enhancing Image Generation Models Using Photogenic Needles in a Haystack AI Papers Academy 923 views - 2 years ago
6:28 Large Language Models As Optimizers - OPRO by Google DeepMind AI Papers Academy 3.9K views - 2 years ago
6:46 FACET by Meta AI - Fairness in Computer Vision Evaluation Benchmark AI Papers Academy 587 views - 2 years ago
8:26 WizardMath from Microsoft - Best Open Source Math LLM with Reinforced Evol-Instruct AI Papers Academy 4.1K views - 2 years ago
7:36 Shepherd by Meta AI - A Critic for Large Language Models AI Papers Academy 792 views - 2 years ago
7:31 Soft Mixture of Experts - An Efficient Sparse Transformer AI Papers Academy 5.7K views - 2 years ago
6:43 Universal and Transferable LLM Attacks - A New Threat to AI Safety AI Papers Academy 3.5K views - 2 years ago