FastHunyuan / README.md
PY007's picture
Update README.md
ca7b356 verified
|
raw
history blame
2.04 kB
metadata
pipeline_tag: text-to-video
license: other
license_name: tencent-hunyuan-community
license_link: LICENSE

FastHunyuan Model Card

Model Details

FastHunyuan is an accelerated HunyuanVideo model. It can sample high quality videos with 6 diffusion steps. That brings around 8X speed up compared to the original HunyuanVideo with 50 steps.

Usage

  • Clone Fastvideo repository and follow the inference instructions in the README.
  • Alternatively, you can inference FastHunyuan using the official Hunyuan Video repository by setting the shift to 17, steps to 6, resolution to 720X1280X125, and cfg bigger than 6. We find that a large CFG scale generally leads to faster videos.

Training details

FastHunyuan is consistency distillated on the MixKit dataset with the following hyperparamters:

  • Batch size: 16
  • Resulotion: 720x1280
  • Num of frames: 125
  • Train steps: 320
  • GPUs: 32
  • LR: 1e-6
  • Loss: huber

Evaluation

We provide some qualitative comparison between FastHunyuan 6 step inference v.s. the original Hunyuan with 6 step inference:

FastHunyuan 6 step Hunyuan 6 step
FastHunyuan 6 step Hunyuan 6 step
FastHunyuan 6 step Hunyuan 6 step
FastHunyuan 6 step Hunyuan 6 step
FastHunyuan 6 step Hunyuan 6 step