PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from Videos

Hanxiao Jiang^1,2, Hao-Yu Hsu², Kaifeng Zhang¹, Hsin-Ni Yu², Shenlong Wang², Yunzhu Li¹

¹Columbia University, ²University of Illinois Urbana-Champaign

Website | Paper | Arxiv

Overview

This repository contains the official implementation of the PhysTwin framework.

Update

This repository will be actively maintained by the authors, with continuous updates introducing new features to inspire further research.

[25.7.22] Remote Control Feature & Bug Fix: Fixed a deprojection error in the data processing pipeline. Added support for remote control—previously, the interactive playground only responded to physical keyboard input; it now accepts virtual keyboard signals from remote devices as well.
[25.4.15] GPU Memory Optimization: Thanks to user feedback and testing, we've further optimized the code to reduce GPU memory usage in the interactive playground—now requiring only about 2GB in total. Previously, LBS initialization consumed a significant amount of GPU memory; it's now offloaded to the CPU and only needs to run once at startup. Everything runs smoothly as a result.
[25.4.8] Optmization Speed: Regarding the questions on optimization speed, thanks to Nvidia Warp, our differentiable Spring-Mass simulator enables first-order optimization in approximately 5 minutes—and even faster with visualizations disabled—significantly outperforming prior work that typically requires hours. The zero-order, sampling-based optimization (CMA-ES) takes around 12 minutes, depending on the number of epochs. These statistics are based on the stuffed animal experiments without self-collision enabled.
[25.4.4] Material Visualization: Show the experimental features to visualize the materials approximated from the underlying spring-mass model. (See below for detailed instructions)

[25.4.3] Multiple Objects Demos: Show the experimental features for handling collisions among multiple PhysTwins we construct. (See below for detailed instructions)

[25.4.3] LBS GPU Memory Fix: Clear intermediate variables to significantly reduce GPU memory usage in the interactive playground. The sloth case now requires only about 4GB in total. (Pull the latest code to apply the fix.)
[25.4.1] Force Visualization: Visualize the forces applied to objects after optimization, aiding in force analysis from videos. (See below for detailed instructions)

Long-Term Plans

Batch Inferencing Support: Integrate batch inferencing into the underlying SpringMass code, enabling faster rollouts and efficient data generation.

Setup

🐧Linux Setup

# Here we use cuda-12.1
export PATH={YOUR_DIR}/cuda/cuda-12.1/bin:$PATH
export LD_LIBRARY_PATH={YOUR_DIR}/cuda/cuda-12.1/lib64:$LD_LIBRARY_PATH
# Create conda environment
conda create -y -n phystwin python=3.10
conda activate phystwin

# Install the packages
# If you only want to explore the interactive playground, you can skip installing Trellis, Grounding-SAM-2, RealSense, and SDXL.
bash ./env_install/env_install.sh

# Download the necessary pretrained models for data processing
bash ./env_install/download_pretrained_models.sh

🪟Windows Setup

Thanks to @GuangyanCai contributions, now we also have a windows setup codebase in windows_setup branch.

🐳Docker Setup

Thanks to @epiception contributions, we now have Docker support as well.

export DOCKER_USERNAME="your_alias" # default is ${whoami} (optional)
chmod +x ./docker_scripts/build.sh
./docker_scripts/build.sh

# The script accepts architecture version from https://developer.nvidia.com/cuda-gpus as an additional argument
./docker_scripts/build.sh 8.9+PTX # For NVIDIA RTX 40 series GPUs

🐧Linux Setup (RTX 5090 + CUDA 12.8 + Python 3.10 Specific)

# Here we use CUDA 12.8
export PATH={YOUR_DIR}/cuda/bin:$PATH
export LD_LIBRARY_PATH={YOUR_DIR}/cuda/lib64:$LD_LIBRARY_PATH
export CUDA_HOME={YOUR_DIR}/cuda

# Create conda environment
conda create -y -n phystwin python=3.10
conda activate phystwin

# Open gaussian_splatting/submodules/diff-gaussian-rasterization/cuda_rasterizer/rasterizer_impl.h and add an include directive for cstdint
# Forcefully create a symbolic soft link between system libstdc++.so.6 and conda environment libstdc++.so.6 e.g. `ln -sf /usr/lib/x86_64-linux-gnu/libstdc++.so.6 {CONDA_PATH}/envs/phystwin/bin/../lib/libstdc++.so.6`

# Install the packages (if you only want to explore the interactive playground, you can skip installing TRELLIS, Grounded-SAM-2, Grounding-DINO, RealSense, and SDXL)
bash ./env_install/5090_env_install.sh

# Download the necessary pretrained models for data processing
bash ./env_install/download_pretrained_models.sh

Download the PhysTwin Data

Download the original data, processed data, and results into the project's root folder. (The following sections will explain how to process the raw observations and obtain the training results.)

data: this includes the original data for different cases and the processed data for quick run. The different case_name can be found under different_types folder.
experiments_optimization: results of our first-stage zero-order optimization.
experiments: results of our second-order optimization.
gaussian_output: results of our static gaussian appearance.
(optional) additional_data: data for extra clothing demos not included in the original paper.

Play with the Interactive Playground

Use the previously constructed PhysTwin to explore the interactive playground. Users can interact with the pre-built PhysTwin using keyboard. The next section will provide a detailed guide on how to construct the PhysTwin from the original data.

Run the interactive playground with our different cases (Need to wait some time for the first usage of interactive playground; Can achieve about 37 FPS using RTX 4090 on sloth case)

python interactive_playground.py \
(--inv_ctrl) \
--n_ctrl_parts [1 or 2] \
--case_name [case_name]

# Examples of usage:
python interactive_playground.py --n_ctrl_parts 2 --case_name double_stretch_sloth
python interactive_playground.py --inv_ctrl --n_ctrl_parts 2 --case_name double_lift_cloth_3

or in Docker

./docker_scripts/run.sh /path/to/data \
                        /path/to/experiments \
                        /path/to/experiments_optimization \
                        /path/to/gaussian_output \
# inside container
conda activate phystwin_env
python interactive_playground.py --inv_ctrl --n_ctrl_parts 2 --case_name double_lift_cloth_3

Options:

--inv_ctrl: inverse the control direction
--n_ctrol_parts: number of control panel (single: 1, double: 2)
--case_name: case name of the PhysTwin case

Train the PhysTwin with the data

Use the processed data to train the PhysTwin. Instructions on how to get above experiments_optimization, experiments and gaussian_output (Can adjust the code below to only train on several cases). After this step, you get the PhysTwin that can be used in the interactive playground.

# Zero-order Optimization
python script_optimize.py

# First-order Optimization
python script_train.py

# Inference with the constructed models
python script_inference.py

# Train the Gaussian with the first-frame data
bash gs_run.sh

Evaluate the performance of the contructed PhysTwin

To evaluate the performance of the constructed PhysTwin, need to render the images in the original viewpoint (similar logic to interactive playground)

# Use LBS to render the dynamic videos (The final videos in ./gaussian_output_dynamic folder)
bash gs_run_simulate.sh
python export_render_eval_data.py
# Get the quantative results
bash evaluate.sh

# Get the qualitative results
bash gs_run_simulate_white.sh
python visualize_render_results.py

Data Processing from Raw Videos

The original data in each case only includes color, depth, calibrate.pkl, metadata.json. All other data are processed as below to get, including the projection, tracking and shape priors. (Note: Be aware of the conflict in the diff-gaussian-rasterization library between Gaussian Splatting and Trellis. For data processing, you don't need to install the gaussian splatting; ignore the last section in env_install.sh)

# Process the data
python script_process_data.py

# Further get the data for first-frame Gaussian
python export_gaussian_data.py

# Get human mask data for visualization and rendering evaluation
python export_video_human_mask.py

Control Force Visualization

Visualize the force applied by the hand to the object as inferred from our PhysTwin model, based solely on video data.

python visualize_force.py \
--n_ctrl_parts [1 or 2] \
--case_name [case_name]

# Examples of usage:
python visualize_force.py --case_name single_push_rope_1 --n_ctrl_parts 1 
python visualize_force.py --case_name single_clift_cloth_1 --n_ctrl_parts 1    
python visualize_force.py --case_name double_stretch_sloth

The visualziation video is saved under experiments folder.

Material Visualization

Experimental feature to visualize the approximated material from the constructed PhysTwin.

python visualize_material.py \
--case_name [case_name]

# Examples of usage:
python visualize_material.py --case_name double_lift_cloth_1
python visualize_material.py --case_name single_push_rope
python visualize_material.py --case_name double_stretch_sloth

Multiple Objects Demos

Try the experimental features for handling collisions among the multiple PhysTwins we construct.

# The stuff is deployed in the 'claw_matchine' branch
git pull
git checkout claw_machine

# Play with the examples
python interactive_playground.py --n_ctrl_parts 1 --case_name single_push_rope_1 --n_dup 4
python interactive_playground.py --n_ctrl_parts 2 --case_name double_stretch_sloth --n_dup 2

Follow-up and Potential Collaborations

If you are interested in collaborating or extending this work for your research, feel free to contact us at [email protected].

Citation

If you find this repo useful for your research, please consider citing the paper

@article{jiang2025phystwin,
    title={PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from Videos},
    author={Jiang, Hanxiao and Hsu, Hao-Yu and Zhang, Kaifeng and Yu, Hsin-Ni and Wang, Shenlong and Li, Yunzhu},
    journal={ICCV},
    year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
assets		assets
configs		configs
data_process		data_process
docker_scripts		docker_scripts
env_install		env_install
gaussian_splatting		gaussian_splatting
qqtt		qqtt
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
data_config.csv		data_config.csv
evaluate.sh		evaluate.sh
evaluate_chamfer.py		evaluate_chamfer.py
evaluate_track.py		evaluate_track.py
export_gaussian_data.py		export_gaussian_data.py
export_render_eval_data.py		export_render_eval_data.py
export_video_human_mask.py		export_video_human_mask.py
gs_render.py		gs_render.py
gs_render_dynamics.py		gs_render_dynamics.py
gs_run.sh		gs_run.sh
gs_run_simulate.sh		gs_run_simulate.sh
gs_run_simulate_white.sh		gs_run_simulate_white.sh
gs_train.py		gs_train.py
inference_warp.py		inference_warp.py
interactive_playground.py		interactive_playground.py
optimize_cma.py		optimize_cma.py
process_data.py		process_data.py
script_inference.py		script_inference.py
script_optimize.py		script_optimize.py
script_process_data.py		script_process_data.py
script_train.py		script_train.py
temp.sh		temp.sh
train_warp.py		train_warp.py
visualize_force.py		visualize_force.py
visualize_material.py		visualize_material.py
visualize_render_results.py		visualize_render_results.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from Videos

Website | Paper | Arxiv

Overview

Update

Long-Term Plans

Setup

🐧Linux Setup

🪟Windows Setup

🐳Docker Setup

🐧Linux Setup (RTX 5090 + CUDA 12.8 + Python 3.10 Specific)

Download the PhysTwin Data

Play with the Interactive Playground

Train the PhysTwin with the data

Evaluate the performance of the contructed PhysTwin

Data Processing from Raw Videos

Control Force Visualization

Material Visualization

Multiple Objects Demos

Follow-up and Potential Collaborations

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Languages

License

Jianghanxiao/PhysTwin

Folders and files

Latest commit

History

Repository files navigation

PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from Videos

Website | Paper | Arxiv

Overview

Update

Long-Term Plans

Setup

🐧Linux Setup

🪟Windows Setup

🐳Docker Setup

🐧Linux Setup (RTX 5090 + CUDA 12.8 + Python 3.10 Specific)

Download the PhysTwin Data

Play with the Interactive Playground

Train the PhysTwin with the data

Evaluate the performance of the contructed PhysTwin

Data Processing from Raw Videos

Control Force Visualization

Material Visualization

Multiple Objects Demos

Follow-up and Potential Collaborations

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Languages

Packages