🧃 SAFT:
Shape and Appearance of Fabrics from Template
via Differentiable Physical Simulations from Monocular Video

David Stotko · Reinhard Klein
University of Bonn

ICCV 2025

Project Page | Video

This repository contains our source code of the paper "SAFT: Shape and Appearance of Fabrics from Template via Differentiable Physical Simulations from Monocular Video".

Abstract

The reconstruction of three-dimensional dynamic scenes is a well-established yet challenging task within the domain of computer vision. In this paper, we propose a novel approach that combines the domains of 3D geometry reconstruction and appearance estimation for physically based rendering and present a system that is able to perform both tasks for fabrics, utilizing only a single monocular RGB video sequence as input. In order to obtain realistic and high-quality deformations and renderings, a physical simulation of the cloth geometry and differentiable rendering are employed. In this paper, we introduce two novel regularization terms for the 3D reconstruction task that improve the plausibility of the reconstruction by addressing the depth ambiguity problem in monocular video. In comparison with the most recent methods in the field, we have reduced the error in the 3D reconstruction by a factor of $2.64$ while requiring a medium runtime of $30,\mathrm{min}$ per scene. Furthermore, the optimized motion achieves sufficient quality to perform an appearance estimation of the deforming object, recovering sharp details from this single monocular RGB video.

Prerequisites

To install the necessary dependencies, you can use the provided requirements.sh. Note that this is a shell file because not all packages can be installed in parallel. Moreover, the file is writen for CUDA 12.4. Please change the pytorch installation according to your CUDA version or install CUDA 12.4.

python3 -m venv saft
source saft/bin/activate
bash requirements.sh

This should work for Linux systems. On Windows you may have to install pytorch3D manually.

You can clone nvdiffrecmc using the other script

bash nvdiffrecmc.sh

This will download the repository and automatically apply a necessary patch to the mipmapping procedure.

Usage

To run the code, you need to provide RGB and mask images of a video sequence together with a polygon mesh of the initial cloth geometry of the first video frame. The mesh has to contain the uv-coordinates and polygon edges encodes the vertex connections that are used during the physical simulation. Although we automatically triangulate the mesh for rendering, the resulting triangles may be really bad and we therefore recommend using quad meshes or triangle meshes directly.

We evaluate the method on the ϕ-SfT dataset. You can get the already prepared data from here. You have to extract all the files into the main directory and are then able to run the code. The data information is provided by json files in ./data/scenes/. You can run all real-world scenes at once using the command

python .\code\python\main.py R1 R2 R3 R4 R5 R6 R7 R8 R9

Please note, that the reconstruction quality of some scenes may vary significantly due to non-deterministic GPU computations. The results of the paper are loaded by changing the override_data parameter in line 74 of the json files to true. The results are saved at multiple epochs in the results directory.

You can evaluate the reconstruction metrics by running the

python .\code\python\metrics.py

file. It is configured to load the saved data of the last epoch for each scene R1 to R9. Please change the loaded scene and other options in metrics.py according to your needs.

Citation

@inproceedings{stotko2025saft,
	title = {{SAFT: Shape and Appearance of Fabrics from Template via Differentiable Physical Simulations from Monocular Video}},
	author = {Stotko, David and Klein, Reinhard},
	booktitle = {{International Conference on Computer Vision (ICCV)}},
	year = {2025},
}

License

This software is provided under MIT license. See LICENSE for more information.

Acknowledgements

This work has been funded by the DFG project KL 1142/11-2 (DFG Research Unit FOR 2535 Anticipating Human Behavior), by the Federal Ministry of Education and Research of Germany and the state of North-Rhine Westphalia as part of the Lamarr-Institute for Machine Learning and Artificial Intelligence and the InVirtuo 4.0 project, and additionally by the Federal Ministry of Education and Research under grant no. 01IS22094A WEST-AI.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
code		code
readme_images		readme_images
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
nvdiffrecmc.sh		nvdiffrecmc.sh
patch.diff		patch.diff
requirements.sh		requirements.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧃 SAFT:
Shape and Appearance of Fabrics from Template
via Differentiable Physical Simulations from Monocular Video

David Stotko · Reinhard Klein
University of Bonn

ICCV 2025

Project Page | Video

Abstract

Prerequisites

Usage

Citation

License

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

License

vc-bonn/saft

Folders and files

Latest commit

History

Repository files navigation

🧃 SAFT: Shape and Appearance of Fabrics from Template via Differentiable Physical Simulations from Monocular Video

David Stotko · Reinhard Klein University of Bonn

ICCV 2025

Project Page | Video

Abstract

Prerequisites

Usage

Citation

License

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

🧃 SAFT:
Shape and Appearance of Fabrics from Template
via Differentiable Physical Simulations from Monocular Video

David Stotko · Reinhard Klein
University of Bonn

Packages