DFloat11
/

Wan2.2-T2V-A14B-DF11

@@ -113,13 +113,14 @@ We apply Huffman coding to the exponent bits of BFloat16 model weights, which ar
 4. To run without CPU offloading (40GB VRAM required):
     ```bash
-    python t2v.py
     ```
-   To run with CPU offloading (22.5GB VRAM required):
-   ```bash
-   python t2v.py --cpu_offload
-   ```
 ### 📄 Learn More

 4. To run without CPU offloading (40GB VRAM required):
     ```bash
+    PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True python t2v.py
     ```
+    To run with CPU offloading (22.5GB VRAM required):
+    ```bash
+    PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True python t2v.py --cpu_offload
+    ```
+    > Setting `PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True` is strongly recommended to prevent out-of-memory errors caused by GPU memory fragmentation.
 ### 📄 Learn More