Skip to content

UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding

Notifications You must be signed in to change notification settings

fudan-zvg/UniUGG

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 

Repository files navigation

UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding

arXiv Website
Yueming Xu1*, Jiahui Zhang1*, Ze Huang1*, Yurui Chen1, Yanpeng Zhou2, Zhenyu Chen2, Yu-Jie Yuan2, Pengxiang Xia2, Guowei Huang2, Xinyue Cai2, Zhongang Qi2, Xingyue Quan2, Jianye Hao2, Hang Xu2, Li Zhang1†
1Fudan University  2Huawei Noah’s Ark Lab 

Overview of our UniUGG, the first unified framework for spatial understanding and generation. (A) UniUGG supports spatial-level VQA and generates geometrically consistent 3D scenes. (B) Given a reference image, it can creatively generate 3D variations and describe them accurately. (C) UniUGG outperforms baselines in both spatial understanding and generation, with our specially tuned vision encoder excelling in downstream tasks.

🎞️ Demo Video

📚 Bibtex

If you find this project or dataset helpful, please consider citing our paper:

@article{xu2025uniugg,
    title={UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding},
    author={Xu, Yueming and Zhang, Jiahui and Huang, Ze and Chen, Yurui and Zhou, Yanpeng and Chen Zhenyu and Yuan, Yujie and Xia, Pengxiang and Huang, Guowei and Cai, Xinyue and Qi, Zhongang and Quan, Xingyue and Hao, Jianye and Xu, Hang and Zhang, Li},
    year={2025},
    journal={arXiv preprint arXiv:2508.11952},
}

About

UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •