Instructions to use CompVis/cleandift with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusion Single File
How to use CompVis/cleandift with Diffusion Single File:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- Notebooks
- Google Colab
- Kaggle
| pipeline_tag: image-feature-extraction | |
| license: mit | |
| library_name: diffusion-single-file | |
| # CleanDIFT Model Card | |
| Diffusion models learn powerful world representations that have proven valuable for tasks like semantic correspondence detection, | |
| depth estimation, semantic segmentation, and classification. | |
| However, diffusion models require noisy input images, which destroys information and introduces the noise level as a hyperparameter that needs to be tuned for each task. | |
| We introduce CleanDIFT, a novel method to extract noise-free, timestep-independent features by enabling diffusion models to work directly with clean input images. | |
| The approach is efficient, training on a single GPU in just 30 minutes. We publish these models alongside our paper ["CleanDIFT: Diffusion Features without Noise"](https://compvis.github.io/cleandift/). | |
| We provide checkpoints for Stable Diffusion 1.5 and Stable Diffusion 2.1. | |
| ## Usage | |
| For detailed examples on how to extract features with CleanDIFT and how to use them for downstream tasks, please refer to the notebooks provided [here](https://github.com/CompVis/CleanDIFT/tree/main/notebooks). | |
| Our checkpoints are fully compatible with the `diffusers` library. | |
| If you already have a pipeline using SD 1.5 or SD 2.1 from `diffusers`, you can simply replace the U-Net state dict: | |
| ```python | |
| from diffusers import UNet2DConditionModel | |
| from huggingface_hub import hf_hub_download | |
| unet = UNet2DConditionModel.from_pretrained("stabilityai/stable-diffusion-2-1", subfolder="unet") | |
| ckpt_pth = hf_hub_download(repo_id="CompVis/cleandift", filename="cleandift_sd21_unet.safetensors") | |
| state_dict = load_file(ckpt_pth) | |
| unet.load_state_dict(state_dict, strict=True) | |
| ``` | |
| ## Citation | |
| ```bibtex | |
| @misc{stracke2024cleandiftdiffusionfeaturesnoise, | |
| title={CleanDIFT: Diffusion Features without Noise}, | |
| author={Nick Stracke and Stefan Andreas Baumann and Kolja Bauer and Frank Fundel and Björn Ommer}, | |
| year={2024}, | |
| eprint={2412.03439}, | |
| archivePrefix={arXiv}, | |
| primaryClass={cs.CV}, | |
| url={https://arxiv.org/abs/2412.03439}, | |
| } | |
| ``` |