Text-to-Video

rCM: Score-Regularized Continuous-Time Consistency Model

Causal-rCM: Teacher-Forcing meets Self-Forcing in Autoregressive Diffusion Distillation for Streaming Video Generation and Interactive World Models

Paper | Website | Code

This repo holds converted Wan official checkpoints in (Causal-)rCM/TurboDiffusion style.

Specifically, (Causal-)rCM equivalently replaces the Conv3d layer in the original Wan with a Linear layer for patch embedding, facilitating further optimization. The layer weight is directly reshaped without value change, e.g., from shape [5120, 16, 1, 2, 2] (Conv3d) to shape [5120, 64] (Linear).

Citation

@article{zheng2025rcm,
  title={Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency},
  author={Zheng, Kaiwen and Wang, Yuji and Ma, Qianli and Chen, Huayu and Zhang, Jintao and Balaji, Yogesh and Chen, Jianfei and Liu, Ming-Yu and Zhu, Jun and Zhang, Qinsheng},
  journal={arXiv preprint arXiv:2510.08431},
  year={2025}
}
@article{zheng2026causal,
  title={Causal-rCM: Teacher-Forcing meets Self-Forcing in Autoregressive Diffusion Distillation for Streaming Video Generation and Interactive World Models},
  author={Zheng, Kaiwen and He, Guande and Zhao, Min and Zhu, Hongzhou and Zhang, Jintao and Chen, Huayu and Chen, Jianfei and Lin, Chen-Hsuan and Liu, Ming-Yu and Zhu, Jun and Ma, Qianli},
  journal={TODO},
  year={2026}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for worstcoder/Wan

Finetuned
(54)
this model

Paper for worstcoder/Wan