Hi, there!

I am a first year Ph.D. student majoring in Computer Science jointly at Shanghai Artificial Intelligence Laboratory and University of Science and Technology of China advised by Prof. Xiaogang Wang and Prof. Wanli Ouyang. I obtained my B.S. degree in Artificial Intelligence Honor Class at Shanghai Jiao Tong University supervised by Prof. Cewu Lu. My research interests include embodied AI, computer vision and robot learning. Feel free to follow me on and for latest research announcements and updates!

In my daily life, I have great passions (but amateur) in football, music, literature, philosophy, traditional Chinese painting and modern Chinese poems!

News

  • Oct. 2023: PonderV2 and UniPAD has been announced! PonderV2 is a universal pre-training paradigm for 3D vision, paving the way for 3D foundation model. It achieves SOTA on 11 indoor and outdoor benchmarks. Check out our paper and code!
  • Jul. 2023: RH20T has been announced! RH20T is a large-scale open-source robotic dataset for learning diverse skills in one-shot, comprising over 110,000 contact-rich robot manipulation sequences across diverse skills, contexts, robots, and camera viewpoints, all collected in the real world. Please check out our website for latest updates!
  • Jun. 2023: AlphaTraker has been published at Frontiers in Behavioral Neuroscience. It is a multi-animal tracking and behavioral analysis tool for behavioral research. Also please check out our open-source code on !
  • Nov. 2022: MineDojo has won 🎉 Outstanding Paper Award 🎉 at NeurIPS announcement!
  • Nov. 2022: AlphaPose paper is accepted by TPAMI! AlphaPose is an accurate multi-person pose estimator, which has received more than 6.5K stars on Github. Check out the paper for more details and feel free to star on !
  • Oct. 2022: X-NeRF is accepted by WACV 2023! Checkout our code on !
  • Jun. 2022: MineDojo has been announced! MineDojo is a new framework for building generally capable agents with internet-scale knowledge in Minecraft. Paper, code, and databases are all open access. Check it out today!
Interests
  • Embodied AI
  • Computer Vision
  • Robot Learning
Education
  • B.S. in Artificial Intelligence Honor Class, 2019 - 2023

    Shanghai Jiao Tong University

  • Ph.D. in Computer Science, 2023 - Present

    Shanghai AI Lab & USTC

Publications

Visit my Google Scholar page for a comprehensive listing!

*
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
arXiv preprint.
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
UniPAD: A Universal Pre-training Paradigm for Autonomous Driving
arXiv preprint.
UniPAD: A Universal Pre-training Paradigm for Autonomous Driving
RH20T: A Robotic Dataset for Learning Diverse Skills in One-Shot
RSS 2023 Workshop on Learning for Task and Motion Planning.
RH20T: A Robotic Dataset for Learning Diverse Skills in One-Shot
AlphaTracker: a multi-animal tracking and behavioral analysis tool
Frontiers in Behavioral Neuroscience, 2023.
AlphaTracker: a multi-animal tracking and behavioral analysis tool
AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI).
AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time
X-NeRF: Explicit Neural Radiance Field for Multi-Scene 360 Insufficient RGB-D Views
IEEE Winter Conference on Applications of Computer Vision (WACV), 2023.
X-NeRF: Explicit Neural Radiance Field for Multi-Scene 360 Insufficient RGB-D Views
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge
💫✨ Outstanding Paper Award ✨💫. Neural Information Processing Systems (NeurIPS) Dataset & Benchmark, 2022.
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge
Unsupervised Multi-Task Learning for 3D Subtomogram Image Alignment, Clustering and Segmentation
IEEE International Conference on Image Processing (ICIP), 2022.
Unsupervised Multi-Task Learning for 3D Subtomogram Image Alignment, Clustering and Segmentation

Experience

 
 
 
 
 
Shanghai AI Lab & USTC
Ph.D in Computer Science
Sep 2023 – Present Shanghai, China
 
 
 
 
 
Shanghai AI Lab
Research Intern
Nov 2022 – Present Shanghai, China
  • Conducting AI research on 3D vision, foundation model and Embodied AI.
  • Joint Ph.D. with USTC.
 
 
 
 
 
MVIG, SJTU
B.S. in Artificial Intelligence
Sep 2019 – Jun 2023 Shanghai, China
 
 
 
 
 
Jim Team, NVIDIA AI Lab and Caltech
Remote Research Intern
Feb 2022 – Feb 2023 Shanghai, China
 
 
 
 
 
Xu Lab, CMU
Remote Research Intern
Apr 2021 – Feb 2022 Shanghai, China

Poems

Some of my modern Chinese poems

石头
孤独:永恒
凌晨随笔
emo