Go to file

chore(docs): Update README

2025-07-04 16:30:22 +08:00

apps

chore(docs): Update README. (#19 )

2025-07-04 16:30:22 +08:00

embodied_gen

feat(urdf): Improve the scale restoration logic to make it more robust.(#17 )

2025-06-27 00:39:42 +08:00

scripts

feat(pipeline): Add EmbodiedGen version v0.1.0. (#2 )

2025-06-11 22:09:22 +08:00

thirdparty

feat(pipeline): Add EmbodiedGen version v0.1.0. (#2 )

2025-06-11 22:09:22 +08:00

.gitignore

feat(model): Update texture gen model. (#14 )

2025-06-16 02:05:17 +08:00

.gitmodules

chore(env): Update env setup. (#5 )

2025-06-12 22:09:09 +08:00

.pre-commit-config.yaml

feat(pipeline): Add EmbodiedGen version v0.1.0. (#2 )

2025-06-11 22:09:22 +08:00

install.sh

chore(model): Put image_encoder to cuda to adapt to hf zero-gpu. (#7 )

2025-06-13 01:09:49 +08:00

LICENSE

feat(pipeline): Add EmbodiedGen version v0.1.0. (#2 )

2025-06-11 22:09:22 +08:00

MANIFEST.in

feat(pipeline): Add EmbodiedGen version v0.1.0. (#2 )

2025-06-11 22:09:22 +08:00

pyproject.toml

feat(pipeline): Add EmbodiedGen version v0.1.0. (#2 )

2025-06-11 22:09:22 +08:00

README.md

chore(docs): Update README. (#19 )

2025-07-04 16:30:22 +08:00

requirements.txt

feat(urdf): Improve the scale restoration logic to make it more robust.(#17 )

2025-06-27 00:39:42 +08:00

setup.cfg

feat(pipeline): Add EmbodiedGen version v0.1.0. (#2 )

2025-06-11 22:09:22 +08:00

README.md

EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence

EmbodiedGen is a generative engine to create diverse and interactive 3D worlds composed of high-quality 3D assets(mesh & 3DGS) with plausible physics, leveraging generative AI to address the challenges of generalization in embodied intelligence related research. It composed of six key modules: Image-to-3D, Text-to-3D, Texture Generation, Articulated Object Generation, Scene Generation and Layout Generation.

✨ Table of Contents of EmbodiedGen

🖼️ Image-to-3D
📝 Text-to-3D
🎨 Texture Generation
🌍 3D Scene Generation
⚙️ Articulated Object Generation
🏞️ Layout(Interactive 3D Worlds) Generation

🚀 Quick Start

✅ Setup Environment

git clone https://github.com/HorizonRobotics/EmbodiedGen.git
cd EmbodiedGen
git checkout v0.1.0
git submodule update --init --recursive --progress
conda create -n embodiedgen python=3.10.13 -y
conda activate embodiedgen
bash install.sh

✅ Setup GPT Agent

Update the API key in file: embodied_gen/utils/gpt_config.yaml.

You can choose between two backends for the GPT agent:

gpt-4o (Recommended) – Use this if you have access to Azure OpenAI.
qwen2.5-vl – An alternative with free usage via OpenRouter, apply a free key here and update api_key in embodied_gen/utils/gpt_config.yaml (50 free requests per day)

🖼️ Image-to-3D

Generate physically plausible 3D asset URDF from single input image, offering high-quality support for digital twin systems.

☁️ Service

Run the image-to-3D generation service locally. Models downloaded automatically on first run, please be patient.

# Run in foreground
python apps/image_to_3d.py
# Or run in the background
CUDA_VISIBLE_DEVICES=0 nohup python apps/image_to_3d.py > /dev/null 2>&1 &

⚡ API

Generate physically plausible 3D assets from image input via the command-line API.

python3 embodied_gen/scripts/imageto3d.py \
    --image_path apps/assets/example_image/sample_04.jpg apps/assets/example_image/sample_19.jpg \
    --output_root outputs/imageto3d

# See result(.urdf/mesh.obj/mesh.glb/gs.ply) in ${output_root}/sample_xx/result

📝 Text-to-3D

Create 3D assets from text descriptions for a wide range of geometry and styles.

☁️ Service

Deploy the text-to-3D generation service locally.

Text-to-image based on the Kolors model, supporting Chinese and English prompts. Models downloaded automatically on first run, see download_kolors_weights, please be patient.

python apps/text_to_3d.py

⚡ API

Text-to-image based on the Kolors model.

bash embodied_gen/scripts/textto3d.sh \
    --prompts "small bronze figurine of a lion" "A globe with wooden base and latitude and longitude lines" "橙色电动手钻，有磨损细节" \
    --output_root outputs/textto3d

🎨 Texture Generation

Generate visually rich textures for 3D mesh.

☁️ Service

Run the texture generation service locally. Models downloaded automatically on first run, see download_kolors_weights, geo_cond_mv.

python apps/texture_edit.py

⚡ API

bash embodied_gen/scripts/texture_gen.sh \
    --mesh_path "apps/assets/example_texture/meshes/robot_text.obj" \
    --prompt "举着牌子的写实风格机器人，大眼睛，牌子上写着“Hello”的文字" \
    --output_root "outputs/texture_gen/" \
    --uuid "robot_text"

🌍 3D Scene Generation

🚧 Coming Soon

⚙️ Articulated Object Generation

🚧 Coming Soon

🏞️ Layout(Interactive 3D Worlds) Generation

💬 Generate Layout from task description

🚧 Coming Soon

🖼️ Generate Layout from image

🚧 Coming Soon

📚 Citation

If you use EmbodiedGen in your research or projects, please cite:

@misc{wang2025embodiedgengenerative3dworld,
      title={EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence},
      author={Xinjie Wang and Liu Liu and Yu Cao and Ruiqi Wu and Wenkang Qin and Dehui Wang and Wei Sui and Zhizhong Su},
      year={2025},
      eprint={2506.10600},
      archivePrefix={arXiv},
      primaryClass={cs.RO},
      url={https://arxiv.org/abs/2506.10600},
}

🙌 Acknowledgement

⚖️ License

This project is licensed under the Apache License 2.0. See the LICENSE file for details.

README.md Unescape Escape

EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence

✨ Table of Contents of EmbodiedGen

🚀 Quick Start

✅ Setup Environment

✅ Setup GPT Agent

🖼️ Image-to-3D

☁️ Service

⚡ API

📝 Text-to-3D

☁️ Service

⚡ API

🎨 Texture Generation

☁️ Service

⚡ API

🌍 3D Scene Generation

⚙️ Articulated Object Generation

🏞️ Layout(Interactive 3D Worlds) Generation

💬 Generate Layout from task description

🖼️ Generate Layout from image

📚 Citation

🙌 Acknowledgement

⚖️ License

README.md