# EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence
[](https://horizonrobotics.github.io/robot_lab/embodied_gen/index.html)
[](#)
[](https://www.youtube.com/watch?v=SnHhzHeb_aI)
[](https://huggingface.co/spaces/HorizonRobotics/EmbodiedGen-Image-to-3D)
[](https://huggingface.co/spaces/HorizonRobotics/EmbodiedGen-Text-to-3D)
[](https://huggingface.co/spaces/HorizonRobotics/EmbodiedGen-Texture-Gen)
**EmbodiedGen** generates interactive 3D worlds with real-world scale and physical realism at low cost.
---
## โจ Table of Contents of EmbodiedGen
- [๐ผ๏ธ Image-to-3D](#image-to-3d)
- [๐ Text-to-3D](#text-to-3d)
- [๐จ Texture Generation](#texture-generation)
- [๐ 3D Scene Generation](#3d-scene-generation)
- [โ๏ธ Articulated Object Generation](#articulated-object-generation)
- [๐๏ธ Layout Generation](#layout-generation)
## ๐ Quick Start
```sh
git clone https://github.com/HorizonRobotics/EmbodiedGen
cd EmbodiedGen
conda create -n embodiedgen python=3.10.13 -y
conda activate embodiedgen
pip install -r requirements.txt --use-deprecated=legacy-resolver
pip install -e .
```
---
## ๐ข Setup GPT Agent
Update the API key in file: `embodied_gen/utils/gpt_config.yaml`.
You can choose between two backends for the GPT agent:
- **`gpt-4o`** (Recommended) โ Use this if you have access to **Azure OpenAI**.
- **`qwen2.5-vl`** โ An open alternative with free usage via [OpenRouter](https://openrouter.ai/settings/keys) (50 free requests per day)
---
๐ผ๏ธ Image-to-3D
[](https://huggingface.co/spaces/HorizonRobotics/EmbodiedGen-Image-to-3D) Generate physically plausible 3D asset from input image.
### Local Service
Run the image-to-3D generation service locally. The first run will download required models.
```sh
# Run in foreground
python apps/image_to_3d.py
# Or run in the background
CUDA_VISIBLE_DEVICES=0 nohup python apps/image_to_3d.py > /dev/null 2>&1 &
```
### Local API
Generate a 3D model from an image using the command-line API.
```sh
python3 embodied_gen/scripts/imageto3d.py \
--image_path apps/assets/example_image/sample_04.jpg apps/assets/example_image/sample_19.jpg \
--output_root outputs/imageto3d/
# See result(.urdf/mesh.obj/mesh.glb/gs.ply) in ${output_root}/sample_xx/result
```
---
๐ Text-to-3D
[](https://huggingface.co/spaces/HorizonRobotics/EmbodiedGen-Text-to-3D) Create 3D assets from text descriptions for a wide range of geometry and styles.
### Local Service
Run the text-to-3D generation service locally.
```sh
python apps/text_to_3d.py
```
### Local API
```sh
bash embodied_gen/scripts/textto3d.sh \
--prompts "small bronze figurine of a lion" "ๅธฆๆจ่ดจๅบๅบง๏ผๅ
ทๆ็ป็บฌ็บฟ็ๅฐ็ไปช" "ๆฉ่ฒ็ตๅจๆ้ป๏ผๆ็ฃจๆ็ป่" \
--output_root outputs/textto3d/
```
---
๐จ Texture Generation
[](https://huggingface.co/spaces/HorizonRobotics/EmbodiedGen-Texture-Gen) Generate visually rich textures for 3D mesh.
### Local Service
Run the texture generation service locally.
```sh
python apps/texture_edit.py
```
### Local API
Generate textures for a 3D mesh using a text prompt.
```sh
bash embodied_gen/scripts/texture_gen.sh \
--mesh_path "apps/assets/example_texture/meshes/robot_text.obj" \
--prompt "ไธพ็็ๅญ็็บข่ฒๅๅฎ้ฃๆ ผๆบๅจไบบ๏ผ็ๅญไธๅ็โHelloโ" \
--output_root "outputs/texture_gen/" \
--uuid "robot_text"
```
---
๐ 3D Scene Generation
๐ง *Coming Soon*
---
โ๏ธ Articulated Object Generation
๐ง *Coming Soon*
---
๐๏ธ Layout Generation
๐ง *Coming Soon*
---
## ๐ Citation
If you use EmbodiedGen in your research or projects, please cite:
```bibtex
Coming Soon
```
---
## ๐ Acknowledgement
EmbodiedGen builds upon the following amazing projects and models:
- ๐ [Trellis](https://github.com/microsoft/TRELLIS)
- ๐ [Hunyuan-Delight](https://huggingface.co/tencent/Hunyuan3D-2/tree/main/hunyuan3d-delight-v2-0)
- ๐ [Segment Anything Model](https://github.com/facebookresearch/segment-anything)
- ๐ [Rembg: a tool to remove images background](https://github.com/danielgatis/rembg)
- ๐ [RMBG-1.4: BRIA Background Removal](https://huggingface.co/briaai/RMBG-1.4)
- ๐ [stable-diffusion-x4-upscaler](https://huggingface.co/stabilityai/stable-diffusion-x4-upscaler)
- ๐ [Real-ESRGAN](https://github.com/xinntao/Real-ESRGAN)
- ๐ [Kolors](https://github.com/Kwai-Kolors/Kolors)
- ๐ [ChatGLM3](https://github.com/THUDM/ChatGLM3)
- ๐ [Aesthetic Score Model](http://captions.christoph-schuhmann.de/aesthetic_viz_laion_sac+logos+ava1-l14-linearMSE-en-2.37B.html)
- ๐ [Pano2Room](https://github.com/TrickyGo/Pano2Room)
- ๐ [Diffusion360](https://github.com/ArcherFMY/SD-T2I-360PanoImage)
- ๐ [kaolin](https://github.com/NVIDIAGameWorks/kaolin)
- ๐ [diffusers](https://github.com/huggingface/diffusers)
- ๐ GPT: QWEN2.5VL, GPT4o
---
## โ๏ธ License
This project is licensed under the [Apache License 2.0](LICENSE). See the `LICENSE` file for details.