AIRLab

@UCU_55s_mNE1V8jd57mst9cw - 3.5K subscribers

경희대학교 AI & Robotics 연구실 AIRLab 입니다.

Home Videos Live Playlists

[ICML 2026 (Oral)] Stabilizing the Q-Gradient Field for Policy Smoothness in Actor-Critic Methods

[ICML 2026 (Oral)] Stabilizing the Q-Gradient Field for Policy Smoothness in Actor-Critic Methods AIRLab

151 views - 1 week ago

물성 야호~ 너가 고무인지 액체인지 맞춰볼게 UniPhy:Learning a Unified Constitutive Model for Inverse Physics Simulation

물성 야호~ 너가 고무인지 액체인지 맞춰볼게 UniPhy:Learning a Unified Constitutive Model for Inverse Physics Simulation AIRLab

88 views - 2 weeks ago

로봇 학습 데이터 부족? 사람영상 쓰면 되죠 ImMimic:Cross-Domain Imitation from Human Videos via Mapping, Interpolation

로봇 학습 데이터 부족? 사람영상 쓰면 되죠 ImMimic:Cross-Domain Imitation from Human Videos via Mapping, Interpolation AIRLab

83 views - 3 weeks ago

Robots Learn by Dreaming: DreamGen—Unlocking Generalization in Robot Learning through Video World...

Robots Learn by Dreaming: DreamGen—Unlocking Generalization in Robot Learning through Video World... AIRLab

98 views - 1 month ago

Long-term extrapolation? No problem! MoGaF: Space-Time Forecasting of Dynamic Scenes with Motion-...

Long-term extrapolation? No problem! MoGaF: Space-Time Forecasting of Dynamic Scenes with Motion-... AIRLab

88 views - 1 month ago

[CVPR 2026] Keep it SymPL:Symbolic Projective Layout for Allocentric Spatial Reasoning in VLMs

[CVPR 2026] Keep it SymPL:Symbolic Projective Layout for Allocentric Spatial Reasoning in VLMs AIRLab

103 views - 1 month ago

Stable Comfort~ FreeGave: 3D Physics Learning from Dynamic Videos by Gaussian Velocity

Stable Comfort~ FreeGave: 3D Physics Learning from Dynamic Videos by Gaussian Velocity AIRLab

104 views - 1 month ago

God still has 10 demos.. Tether: Autonomous Functional Play with Trajectory Warping

God still has 10 demos.. Tether: Autonomous Functional Play with Trajectory Warping AIRLab

103 views - 2 months ago

Neural ODEs do not stop even in the extrapolation interval BOY~↗ ParticleGS: Gaussian Particle Dy...

Neural ODEs do not stop even in the extrapolation interval BOY~↗ ParticleGS: Gaussian Particle Dy... AIRLab

131 views - 2 months ago

What we see is 2D, what the model needs to see is 4D. SeeU: Seeing the Unseen World via 4D Dynami...

What we see is 2D, what the model needs to see is 4D. SeeU: Seeing the Unseen World via 4D Dynami... AIRLab

92 views - 2 months ago

[ICLR 2026] SiNGER: A Clearer Voice Distills Vision Transformers Further

[ICLR 2026] SiNGER: A Clearer Voice Distills Vision Transformers Further AIRLab

176 views - 3 months ago

How to build a large model from a small model. Learning to Grow Pretrained Models for Efficient T...

How to build a large model from a small model. Learning to Grow Pretrained Models for Efficient T... AIRLab

205 views - 3 months ago

I roughly got it. Your nature. Toward Material-Agnostic System Identification from Videos

I roughly got it. Your nature. Toward Material-Agnostic System Identification from Videos AIRLab

86 views - 3 months ago

Since when did you think the camera is the center of the world? TAPIP3D: Tracking Any Point in Pe...

Since when did you think the camera is the center of the world? TAPIP3D: Tracking Any Point in Pe... AIRLab

703 views - 3 months ago

로봇없는 로봇팀 Real2Render2Real: Scaling Robot Data Without Dynamics Simulation or Robot Hardware

로봇없는 로봇팀 Real2Render2Real: Scaling Robot Data Without Dynamics Simulation or Robot Hardware AIRLab

161 views - 3 months ago

Today's topic is ViT Distillation. However, accompanied by Singular Nullspace. SiNGER: Clearer Vo...

Today's topic is ViT Distillation. However, accompanied by Singular Nullspace. SiNGER: Clearer Vo... AIRLab

176 views - 4 months ago

The Moment When Internet Photos Become Robot Data Robot Learning from Any Images

The Moment When Internet Photos Become Robot Data Robot Learning from Any Images AIRLab

199 views - 4 months ago

I like the future predictions that videos make🤙🏻👍🏻 Cosmos Policy: Fine-Tuning Video Models for Vi...

I like the future predictions that videos make🤙🏻👍🏻 Cosmos Policy: Fine-Tuning Video Models for Vi... AIRLab

195 views - 4 months ago

One page won't do. Space is a 'flow' 🎥 Thinking in Space: How MLLMs See, Remember, and Recall Spaces

One page won't do. Space is a 'flow' 🎥 Thinking in Space: How MLLMs See, Remember, and Recall Spaces AIRLab

162 views - 4 months ago

What does AI understand about the world? The Platonic Representation Hypothesis

What does AI understand about the world? The Platonic Representation Hypothesis AIRLab

249 views - 5 months ago

어… 너 누구야? 나야? 움직임이 왜 똑같지? 🤨 PhysTwin

어… 너 누구야? 나야? 움직임이 왜 똑같지? 🤨 PhysTwin AIRLab

124 views - 5 months ago

이야 오늘 유체 복원 많이 된다 자기 전에 생각 날거야~ Learning an Implicit Physics Model for Image-based Fluid Simulation

이야 오늘 유체 복원 많이 된다 자기 전에 생각 날거야~ Learning an Implicit Physics Model for Image-based Fluid Simulation AIRLab

156 views - 5 months ago

당신의 데모는 필요 없습니다 휴먼 🤖 GraspVLA: a Grasping Foundation Model on Billion-scale Synthetic Action Data

당신의 데모는 필요 없습니다 휴먼 🤖 GraspVLA: a Grasping Foundation Model on Billion-scale Synthetic Action Data AIRLab

237 views - 5 months ago

No more ResNets that just add up✋➕ Deep Delta Learning

No more ResNets that just add up✋➕ Deep Delta Learning AIRLab

208 views - 5 months ago

카메라와 라이다의 정합 정도가 굉장히 타이트 하네요^^ General, Target-less, and Automatic LiDAR-Camera Calibration Toolbox

카메라와 라이다의 정합 정도가 굉장히 타이트 하네요^^ General, Target-less, and Automatic LiDAR-Camera Calibration Toolbox AIRLab

131 views - 5 months ago

Can't predict the future? Then use a video generation model. VideoVLA: Video Generators Can Be Ge...

Can't predict the future? Then use a video generation model. VideoVLA: Video Generators Can Be Ge... AIRLab

184 views - 5 months ago

로봇판 백문이불여일견 👀⚖️⤴️⤵️🔃↩️↪️🔄 One Demo is Worth a Thousand Trajectories

로봇판 백문이불여일견 👀⚖️⤴️⤵️🔃↩️↪️🔄 One Demo is Worth a Thousand Trajectories AIRLab

175 views - 6 months ago

더 강해져서 돌아왔다! RoboTwin 2.0: A Robotic Data Generator and Benchmark with Domain Randomization

더 강해져서 돌아왔다! RoboTwin 2.0: A Robotic Data Generator and Benchmark with Domain Randomization AIRLab

141 views - 6 months ago

요리보고~ 조리보고~ 3R-GS: Best Practice in Optimizing Camera Poses Along with 3DGS

요리보고~ 조리보고~ 3R-GS: Best Practice in Optimizing Camera Poses Along with 3DGS AIRLab

145 views - 6 months ago

VLM is like this... What's in the Image? A Deep-Dive into the Vision of Vision Language Models

VLM is like this... What's in the Image? A Deep-Dive into the Vision of Vision Language Models AIRLab

840 views - 6 months ago

날 따라 해봐요 이렇게~ TRACE: Learning 3D Gaussian Physical Dynamics from Multi-view Videos

날 따라 해봐요 이렇게~ TRACE: Learning 3D Gaussian Physical Dynamics from Multi-view Videos AIRLab

138 views - 6 months ago

Q: "이 픽셀 여기 맞나요?" VLM: "아니.. 더 뒤다..!!" DepthLM: Metric Depth From Vision Language Models

Q: "이 픽셀 여기 맞나요?" VLM: "아니.. 더 뒤다..!!" DepthLM: Metric Depth From Vision Language Models AIRLab

213 views - 7 months ago

🏳️Send token 1, don't send 2, send 3🏴 MoR-ViT: Efficient Vision Transformer with Mixture-of-Recur...

🏳️Send token 1, don't send 2, send 3🏴 MoR-ViT: Efficient Vision Transformer with Mixture-of-Recur... AIRLab

147 views - 7 months ago

🤖삐빅. 인간 시대의 끝이 도래했다..🤖 RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning

🤖삐빅. 인간 시대의 끝이 도래했다..🤖 RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning AIRLab

172 views - 7 months ago

타겟이 없는곳에서 이정도의 정합성을? HiGS-Calib: Hierarchical 3DGS based Targetless LiDAR-Camera Calibration

타겟이 없는곳에서 이정도의 정합성을? HiGS-Calib: Hierarchical 3DGS based Targetless LiDAR-Camera Calibration AIRLab

147 views - 7 months ago

시뮬레이터에서 폐관수련..? X-SIM: Cross-Embodiment Learning via Real-to-Sim-to-Real

시뮬레이터에서 폐관수련..? X-SIM: Cross-Embodiment Learning via Real-to-Sim-to-Real AIRLab

214 views - 7 months ago

OpenVLA 너 더 좋아질 수 있어! Fine-Tuning Vision-Language-Action Models:Optimizing Speed and Success

OpenVLA 너 더 좋아질 수 있어! Fine-Tuning Vision-Language-Action Models:Optimizing Speed and Success AIRLab

747 views - 7 months ago

뎁스 카메라 여러개 쓰는데 간섭이 문제인가요? 제가 딱 해결해 드림. DRIM: Depth Restoration With Interference Mitigation

뎁스 카메라 여러개 쓰는데 간섭이 문제인가요? 제가 딱 해결해 드림. DRIM: Depth Restoration With Interference Mitigation AIRLab

136 views - 8 months ago

사진? 완벽히 이해했어!(이해 못했음) Is a Picture Worth a Thousand Words? Spatial Reasoning in VLM

사진? 완벽히 이해했어!(이해 못했음) Is a Picture Worth a Thousand Words? Spatial Reasoning in VLM AIRLab

149 views - 8 months ago

카메라와 라이다를 Gaussian splatting과 함께 드셔보세요 Robust LiDAR-Camera Calibration With 2D Gaussian Splatting

카메라와 라이다를 Gaussian splatting과 함께 드셔보세요 Robust LiDAR-Camera Calibration With 2D Gaussian Splatting AIRLab

301 views - 8 months ago

보인다 보여~👀 Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning of Vision Language Models

보인다 보여~👀 Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning of Vision Language Models AIRLab

121 views - 8 months ago

The Giant DINO is Coming... DINOv3: Self-Supervised Learning for Vision at Unprecedented Scale

The Giant DINO is Coming... DINOv3: Self-Supervised Learning for Vision at Unprecedented Scale AIRLab

755 views - 8 months ago

안보인다는건 핑계야! DIFIX3D+: Improving 3D Reconstructions with Single-Step Diffusion Models

안보인다는건 핑계야! DIFIX3D+: Improving 3D Reconstructions with Single-Step Diffusion Models AIRLab

198 views - 8 months ago

Diffusion? "느려..." CARP: Visuomotor Policy Learning via Coarse-to-Fine AutoRegressive Prediction

Diffusion? "느려..." CARP: Visuomotor Policy Learning via Coarse-to-Fine AutoRegressive Prediction AIRLab

150 views - 8 months ago

뭐라고? 로봇 데이터가 복사가 된다고?! 🖨️🖨️🖨️ Constraint-Preserving Data Generation for Visuomotor Policy Learning

뭐라고? 로봇 데이터가 복사가 된다고?! 🖨️🖨️🖨️ Constraint-Preserving Data Generation for Visuomotor Policy Learning AIRLab

167 views - 9 months ago

텍스트로 로봇을 제어한다고? CLIP-RT: Learning Robotic Policies from Natural Language Supervision

텍스트로 로봇을 제어한다고? CLIP-RT: Learning Robotic Policies from Natural Language Supervision AIRLab

249 views - 9 months ago

I have a Robot~🦾 I have a twin~🪞 Uh! RoboTwin:Dual-Arm Robot Benchmark with Generative Digital Twins

I have a Robot~🦾 I have a twin~🪞 Uh! RoboTwin:Dual-Arm Robot Benchmark with Generative Digital Twins AIRLab

122 views - 9 months ago

Why Is Spatial Reasoning Hard for VLMs An Attention Mechanism Perspective on Focus Areas (ICML 2025)

Why Is Spatial Reasoning Hard for VLMs An Attention Mechanism Perspective on Focus Areas (ICML 2025) AIRLab

201 views - 10 months ago

자, 잘 봐. 이게 화살표라는거야↗️↙️↘️⬇️↙️. 이대로만 하면 돼💫 Robotic Visual Instruction (CVPR 2025)

자, 잘 봐. 이게 화살표라는거야↗️↙️↘️⬇️↙️. 이대로만 하면 돼💫 Robotic Visual Instruction (CVPR 2025) AIRLab

125 views - 10 months ago

그럴듯함을 넘어 물리 법칙까지 정확하게 PhysFlow:Multi-modal Foundation and Video Diffusion for 4D Physical Simulation

그럴듯함을 넘어 물리 법칙까지 정확하게 PhysFlow:Multi-modal Foundation and Video Diffusion for 4D Physical Simulation AIRLab

167 views - 10 months ago

오일러!🤜 라그랑지안!🤛 크로스!!!🤝 ELPINN: Eulerian Lagrangian Physics-Informed Neural Network

오일러!🤜 라그랑지안!🤛 크로스!!!🤝 ELPINN: Eulerian Lagrangian Physics-Informed Neural Network AIRLab

141 views - 10 months ago

네가 그렇게 잘해? 어디 이것도 잘하나 보자 🔎️🧐️ GENMANIP: LLM-driven Simulation for Instruction-Following Manipulation

네가 그렇게 잘해? 어디 이것도 잘하나 보자 🔎️🧐️ GENMANIP: LLM-driven Simulation for Instruction-Following Manipulation AIRLab

94 views - 10 months ago

VLM : 방금 왼쪽 오른쪽 구분하는 상상함🤣🤣 Perspective-Aware Reasoning in VLM via Mental Imagery Simulation

VLM : 방금 왼쪽 오른쪽 구분하는 상상함🤣🤣 Perspective-Aware Reasoning in VLM via Mental Imagery Simulation AIRLab

189 views - 10 months ago

준비됐지, 팔?💪 물론이지, 다리.🦵 Visual Whole-Body Control for Legged Loco-Manipulation (CoRL 2024)

준비됐지, 팔?💪 물론이지, 다리.🦵 Visual Whole-Body Control for Legged Loco-Manipulation (CoRL 2024) AIRLab

161 views - 10 months ago

로봇 매니퓰레이션을 VLM으로 뚝딱 🦾 OmniManip: General Robotic Manipulation via Object Primitives as Constraints

로봇 매니퓰레이션을 VLM으로 뚝딱 🦾 OmniManip: General Robotic Manipulation via Object Primitives as Constraints AIRLab

254 views - 11 months ago

뭘 해야 하는지는 알겠는데 어떻게 해야 할지 모르겠다고? 잘 봐, 알려줄게👀 UAD: Unsupervised Affordance Distillation (ICRA2025)

뭘 해야 하는지는 알겠는데 어떻게 해야 할지 모르겠다고? 잘 봐, 알려줄게👀 UAD: Unsupervised Affordance Distillation (ICRA2025) AIRLab

142 views - 11 months ago

물리 법칙 무시하는 AI는 LLM이 처리했으니 안심하라고👍PhyT2V: LLM-Guided Physics-Based Video Generation (CVPR 2025)

물리 법칙 무시하는 AI는 LLM이 처리했으니 안심하라고👍PhyT2V: LLM-Guided Physics-Based Video Generation (CVPR 2025) AIRLab

163 views - 11 months ago

Normalization에 대해서 생각해본 적 있어? 🤔🤔 Transformers without Normalization (CVPR 2025)

Normalization에 대해서 생각해본 적 있어? 🤔🤔 Transformers without Normalization (CVPR 2025) AIRLab

241 views - 11 months ago

하나로 모든 걸 할 수 있는 최강자 등장💪 (두둥탁) . VGGT:Visual Geometry Grounded Transformer (CVPR 2025)

하나로 모든 걸 할 수 있는 최강자 등장💪 (두둥탁) . VGGT:Visual Geometry Grounded Transformer (CVPR 2025) AIRLab

976 views - 11 months ago

이젠 로봇 관절마저 렌더링 해버림 ㄷㄷ 🩻🩻 Differentiable Robot Rendering (CoRL 2024 Oral)

이젠 로봇 관절마저 렌더링 해버림 ㄷㄷ 🩻🩻 Differentiable Robot Rendering (CoRL 2024 Oral) AIRLab

462 views - 1 year ago