操作与视觉语言模型
OpenVLA系列工作

Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success
arXiv 论文链接具身操作VLA foundation model
π系列工作
TikTok GR系列工作

UNLEASHING LARGE-SCALE VIDEO GENERATIVE PRE-TRAINING FOR VISUAL ROBOT MANIPULATION
arXiv 论文链接字节跳动提出的基于大规模视频预训练模型

GR-2: A Generative Video-Language-Action Model with Web-Scale Knowledge for Robot Manipulation
arXiv 论文链接字节跳动提出的基于大规模视频预训练模型
仿真平台和基准

RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation
arXiv 论文链接最佳仿真平台升级版。

RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
arXiv 论文链接仿真平台集成。
人形机器人运动/模仿学习

ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body Skills
arXiv 论文链接人形机器人对高敏捷人类行为的模仿学习。