publications

publications by categories in reversed chronological order.

2024

  1. gvgen.png
    arXiv
    GVGEN:Text-to-3D Generation with Volumetric Representation
    Xianglong He*, Junyi Chen*, Sida PengDi Huang, Yangguang Li, Xiaoshui Huang, Chun Yuan, Wanli Ouyang, and 1 more author
    arXiv preprint, 2024
    (* indicates equal contribution)
  2. agent3d.png
    arXiv
    Agent3D-Zero: An Agent for Zero-shot 3D Understanding
    Sha Zhang, Di Huang, Jiajun Deng, Shixiang Tang, Wanli OuyangTong He, and Yanyong Zhang
    arXiv preprint, 2024
  3. FiT.png
    arXiv
    FiT: Flexible Vision Transformer for Diffusion Model
    Zeyu Lu*, Zidong Wang*, Di Huang, Chengyue Wu, Xihui LiuWanli Ouyang, and Lei Bai
    arXiv preprint, 2024
    (* indicates equal contribution)
  4. pointcloudmatters.png
    arXiv
    Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning
    Haoyi Zhu, Yating Wang, Di Huang, Weicai Ye, Wanli Ouyang, and Tong He
    arXiv preprint, 2024
  5. unipad.png
    CVPR
    UniPAD: A Universal Pre-training Paradigm for Autonomous Driving
    Honghui Yang, Sha Zhang, Di Huang, Xiaoyang Wu, Haoyi Zhu, Tong He, Shixiang Tang, Hengshuang Zhao, and 3 more authors
    CVPR, 2024
  6. motiongpt.png
    AAAI
    MotionGPT: Finetuned LLMs are General-Purpose Motion Generators
    Yaqi Zhang, Di Huang, Bin Liu, Shixiang Tang, Yan Lu, Lu Chen, Lei Bai, Qi Chu, and 2 more authors
    AAAI, 2024

2023

  1. ponderv2.png
    arXiv
    PonderV2: Pave the Way for 3D Foundataion Model with A Universal Pre-training Paradigm
    Haoyi Zhu*, Honghui Yang*, Xiaoyang Wu*, Di Huang*, Sha Zhang, Xianglong He, Tong He, Hengshuang Zhao, and 3 more authors
    arXiv preprint, 2023
    (* indicates equal contribution)
  2. sentry crop.png
    NeurIPS
    Seeing is not always believing: Benchmarking Human and Model Perception of AI-Generated Images
    Zeyu Lu*, Di Huang*Lei Bai*Xihui Liu, Jingjing Qu, and Wanli Ouyang
    NeurIPS dataset and benchmark track, 2023
    (* indicates equal contribution)
  3. ponder.jpeg
    ICCV
    Ponder: Point cloud pre-training via neural rendering
    Di HuangSida PengTong He, Honghui Yang, Xiaowei Zhou, and Wanli Ouyang
    ICCV, 2023

2022

  1. OnePose++.png
    NeurIPS
    Onepose++: Keypoint-free one-shot object pose estimation without CAD models
    Xingyi He, Jiaming Sun, Yuang Wang, Di Huang, Hujun Bao, and Xiaowei Zhou
    NeurIPS, 2022
  2. hhor crop.png
    SiggraphAsia
    Reconstructing hand-held objects from monocular video
    Di Huang, Xiaopeng Ji, Xingyi He, Jiaming Sun, Tong He, Qing Shuai, Wanli Ouyang, and Xiaowei Zhou
    Siggraph Asia, 2022

2021

  1. easymocap.png
    Github
    EasyMoCap - Make human motion capture easier.
    Qing Shuai, Qi Fang, Junting DongSida PengDi Huang, Hujun Bao, and Xiaowei Zhou
    2021