Computer Vision

PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
A general 3D pre-training approach establishing a pathway to 3D foundational models.
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge
Building open-ended agents with internet-scale knowledge in Minecraft.
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge