Skywork / Matrix-3D

huggingface.co
Total runs: 0
24-hour runs: 0
7-day runs: 0
30-day runs: 0
Model's Last Updated: September 02 2025
image-to-3d

Introduction of Matrix-3D

Model Details of Matrix-3D

Matrix3D: Omnidirectional Explorable
3D World Generation

logo
🌟 Introduction

Matrix-3D utilizes panoramic representation for wide-coverage omnidirectional explorable 3D world generation that combines conditional video generation and panoramic 3D reconstruction.

  • Large-Scale Scene Generation : Compared to existing scene generation approaches, Matrix-3D supports the generation of broader, more expansive scenes that allow for complete 360-degree free exploration.
  • High Controllability : Matrix-3D supports both text and image inputs, with customizable trajectories and infinite extensibility.
  • Strong Generalization Capability : Built upon self-developed 3D data and video model priors, Matrix-3D enables the generation of diverse and high-quality 3D scenes.
  • Speed-Quality Balance : Two types of panoramic 3D reconstruction methods are proposed to achieve rapid and detailed 3D reconstruction respectively.
📦 Installation

Clone the repo and create the environment:

# Clone the repository 
git clone --recursive https://github.com/SkyworkAI/Matrix-3D.git
cd Matrix-3D

# Create a new conda environment
conda create -n matrix3d python=3.10
conda activate matrix3d

# Install torch and torchvision (with GPU support, we use CUDA 12.4 Version)
pip3 install torch==2.7.1 torchvision==0.22.1

#Run installation script
chmod +x install.sh
./install.sh
💫 Pretrained Models
Model Name Description Download
Text2PanoImage text2panoimage_lora.safetensors Link
PanoVideoGen-480p pano_video_gen_480p.ckpt Link
PanoVideoGen-720p pano_video_gen_720p.bin Link
PanoLRM-480p pano_lrm_480p.pt Link
🎮 Usage
  • 🔧 Checkpoint Download
python scripts/download_checkpoints.py
  • 🔥 One-command 3D World Generation

Now you can generate a 3D world by just running a single command:

./generate.sh

Or you can choose to generate a 3D world step by step.

  • 🖼️ Step 1: Text/Image to Panorama Image

You can either generate a panorama image from text prompt:

python code/panoramic_image_generation.py \
    --mode=t2p \
    --prompt="a vibrant, industrial-style scene, featuring a large warehouse with exposed brick walls, metal beams, and scattered barrels and crates, set against a backdrop of modern skyscrapers and lush greenery" \
    --output_path="./output/example1"

Or from an input image:

python code/panoramic_image_generation.py \
    --mode=i2p \
    --input_image_path="./data/image1.jpg" \
    --output_path="./output/example1"

The generated panorama image will be saved in the output/example1 folder.

  • 📹 Step 2: Generate Panoramic Video
VISIBLE_GPU_NUM=1
torchrun --nproc_per_node ${VISIBLE_GPU_NUM} code/panoramic_image_to_video.py \
  --inout_dir="./output/example1"  \
  --resolution=720

You can switch the resolution option in [480,720] to perform video generation under 960 × 480 resolution or 1440 × 720 resolution. The generated panoramic tour video will be saved in output/example1/pano_video.mp4 . It will take about an hour to generate a 720p video on an A800 GPU. You can accelerate this process with multi-gpu inference by setting VISIBLE_GPU_NUM.

  • 🏡 Step 3: Extract 3D Scene

Here we provide two options, one is high-quality optimization-based 3D scene reconstruction and another is efficient feed-forward 3D scene reconstruction.

To perform optimization-based reconstruction, run

 python code/panoramic_video_to_3DScene.py \
    --inout_dir="./output/example1" \
    --resolution=720

Modify the resolution option as the value used in panoramic video generation. The extracted 3D scene in .ply format will be saved in output/example1/generated_3dgs_opt.ply .

To perform feed-forward reconstruction, run

python code/panoramic_video_480p_to_3DScene_lrm.py \
--video_path="./data/case1/sample_video.mp4" \
--pose_path='./data/case1/sample_cam.json' \
--out_path='./output/example2'

The extracted 3D scene in .ply format and rendered perspective videos will be saved output/example2 . If you want to reconstruct 3D scene with another panorama videos and conditioned camera pose, just replace the video_path and pose_path accordingly.

🎬 Create Your Own

We provide three movement modes: Straight Travel , S-curve Travel , and Forward on the Right , which can be configured in --movement_mode in code/panoramic_image_to_video.py .

You can also provide your own camera trajectory in .json format and use it for video generation.

VISIBLE_GPU_NUM=1
torchrun --nproc_per_node ${VISIBLE_GPU_NUM} code/panoramic_image_to_video.py \
  --inout_dir="./output/example1"  \
  --resolution=720
  --json_path YOUR_TRAJECTORY_FILE.json

All camera matrices used in our project are world to camera matrices in opencv format. Please refer to the sample file ./data/test_cameras/test_cam_front.json , and use code/generate_example_camera.py to generate your own camera trajectory.

📚 Citation

If you find this project useful, please consider citing it as follows:

@article{yang2025matrix3d,
  title     = {Matrix-3D: Omnidirectional Explorable 3D World Generation},
  author    = {Zhongqi Yang and Wenhang Ge and Yuqi Li and Jiaqi Chen and Haoyuan Li and Mengyin An and Fei Kang and Hua Xue and Baixin Xu and Yuyang Yin and Eric Li and Yang Liu and Yikai Wang and Hao-Xiang Guo and Yahui Zhou},
  journal   = {arXiv preprint arXiv:2508.08086},
  year      = {2025}
}

🤝 Acknowledgements

This project is built on top of the follows, please consider citing them if you find them useful:

📧 Contact

If you have any questions, please feel free post an issue.

Runs of Skywork Matrix-3D on huggingface.co

0
Total runs
0
24-hour runs
0
3-day runs
0
7-day runs
0
30-day runs

More Information About Matrix-3D huggingface.co Model

More Matrix-3D license Visit here:

https://choosealicense.com/licenses/mit

Matrix-3D huggingface.co

Matrix-3D huggingface.co is an AI model on huggingface.co that provides Matrix-3D's model effect (), which can be used instantly with this Skywork Matrix-3D model. huggingface.co supports a free trial of the Matrix-3D model, and also provides paid use of the Matrix-3D. Support call Matrix-3D model through api, including Node.js, Python, http.

Matrix-3D huggingface.co Url

https://huggingface.co/Skywork/Matrix-3D

Skywork Matrix-3D online free

Matrix-3D huggingface.co is an online trial and call api platform, which integrates Matrix-3D's modeling effects, including api services, and provides a free online trial of Matrix-3D, you can try Matrix-3D online for free by clicking the link below.

Skywork Matrix-3D online free url in huggingface.co:

https://huggingface.co/Skywork/Matrix-3D

Matrix-3D install

Matrix-3D is an open source model from GitHub that offers a free installation service, and any user can find Matrix-3D on GitHub to install. At the same time, huggingface.co provides the effect of Matrix-3D install, users can directly use Matrix-3D installed effect in huggingface.co for debugging and trial. It also supports api for free installation.

Matrix-3D install url in huggingface.co:

https://huggingface.co/Skywork/Matrix-3D

Url of Matrix-3D

Matrix-3D huggingface.co Url

Provider of Matrix-3D huggingface.co

Skywork
ORGANIZATIONS

Other API from Skywork

huggingface.co

Total runs: 86
Run Growth: 54
Growth Rate: 62.79%
Updated:April 08 2025
huggingface.co

Total runs: 59
Run Growth: 1
Growth Rate: 1.69%
Updated:January 28 2026
huggingface.co

Total runs: 45
Run Growth: -16
Growth Rate: -35.56%
Updated:June 13 2025
huggingface.co

Total runs: 35
Run Growth: -5
Growth Rate: -14.29%
Updated:March 04 2025
huggingface.co

Total runs: 21
Run Growth: -43
Growth Rate: -204.76%
Updated:June 26 2025
huggingface.co

Total runs: 7
Run Growth: -2
Growth Rate: -25.00%
Updated:February 07 2026
huggingface.co

Total runs: 0
Run Growth: 0
Growth Rate: 0.00%
Updated:December 03 2025