Update README.md
Browse files
README.md
CHANGED
@@ -1,349 +1,352 @@
|
|
1 |
-
---
|
2 |
-
license: apache-2.0
|
3 |
-
language:
|
4 |
-
- en
|
5 |
-
- zh
|
6 |
-
pipeline_tag: text-to-video
|
7 |
-
library_name: diffusers
|
8 |
-
tags:
|
9 |
-
- video
|
10 |
-
- video-generation
|
11 |
-
|
12 |
-
|
13 |
-
|
14 |
-
|
15 |
-
|
16 |
-
|
17 |
-
|
18 |
-
|
19 |
-
|
20 |
-
|
21 |
-
|
22 |
-
|
23 |
-
|
24 |
-
|
25 |
-
|
26 |
-
|
27 |
-
- [
|
28 |
-
- [
|
29 |
-
- [
|
30 |
-
- [
|
31 |
-
|
32 |
-
|
33 |
-
|
34 |
-
|
35 |
-
|
36 |
-
|
37 |
-
|
|
38 |
-
|
39 |
-
|
40 |
-
|
41 |
-
|
42 |
-
|
43 |
-
|
44 |
-
|
45 |
-
|
46 |
-
|
47 |
-
|
48 |
-
|
49 |
-
|
50 |
-
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/
|
51 |
-
|
52 |
-
|
53 |
-
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/
|
54 |
-
|
55 |
-
<td>
|
56 |
-
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/
|
57 |
-
|
58 |
-
|
59 |
-
|
60 |
-
|
61 |
-
|
62 |
-
|
63 |
-
|
64 |
-
|
65 |
-
|
66 |
-
<td>
|
67 |
-
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/
|
68 |
-
</td>
|
69 |
-
|
70 |
-
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/
|
71 |
-
|
72 |
-
|
73 |
-
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/
|
74 |
-
</td>
|
75 |
-
|
76 |
-
|
77 |
-
|
78 |
-
|
79 |
-
|
80 |
-
|
81 |
-
|
82 |
-
|
83 |
-
|
84 |
-
|
85 |
-
|
86 |
-
<td>
|
87 |
-
|
88 |
-
</td>
|
89 |
-
<td>
|
90 |
-
|
91 |
-
</td>
|
92 |
-
|
93 |
-
|
94 |
-
|
95 |
-
|
96 |
-
<td>
|
97 |
-
<
|
98 |
-
</td>
|
99 |
-
|
100 |
-
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/
|
101 |
-
|
102 |
-
|
103 |
-
|
104 |
-
|
105 |
-
|
106 |
-
|
107 |
-
|
108 |
-
|
109 |
-
|
110 |
-
|
111 |
-
|
112 |
-
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/
|
113 |
-
|
114 |
-
|
115 |
-
|
116 |
-
|
117 |
-
<
|
118 |
-
|
119 |
-
|
120 |
-
|
121 |
-
|
122 |
-
<td>
|
123 |
-
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/
|
124 |
-
</td>
|
125 |
-
|
126 |
-
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/
|
127 |
-
|
128 |
-
|
129 |
-
|
130 |
-
|
131 |
-
|
132 |
-
<td>
|
133 |
-
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/
|
134 |
-
</td>
|
135 |
-
|
136 |
-
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/
|
137 |
-
|
138 |
-
|
139 |
-
|
140 |
-
|
141 |
-
|
142 |
-
|
143 |
-
|
144 |
-
|
145 |
-
|
146 |
-
|
147 |
-
|
148 |
-
<td>
|
149 |
-
Pan
|
150 |
-
</td>
|
151 |
-
|
152 |
-
Pan
|
153 |
-
|
154 |
-
|
155 |
-
|
156 |
-
</td>
|
157 |
-
|
158 |
-
|
159 |
-
|
160 |
-
|
161 |
-
<td>
|
162 |
-
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/
|
163 |
-
</td>
|
164 |
-
|
165 |
-
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/
|
166 |
-
|
167 |
-
|
168 |
-
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/
|
169 |
-
</td>
|
170 |
-
|
171 |
-
|
172 |
-
|
173 |
-
|
174 |
-
<td>
|
175 |
-
Pan
|
176 |
-
</td>
|
177 |
-
|
178 |
-
Pan Up + Pan
|
179 |
-
|
180 |
-
<td>
|
181 |
-
|
182 |
-
</td>
|
183 |
-
|
184 |
-
|
185 |
-
|
186 |
-
|
187 |
-
<td>
|
188 |
-
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/
|
189 |
-
</td>
|
190 |
-
|
191 |
-
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/
|
192 |
-
|
193 |
-
|
194 |
-
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/
|
195 |
-
</td>
|
196 |
-
|
197 |
-
|
198 |
-
|
199 |
-
|
200 |
-
|
201 |
-
|
202 |
-
|
203 |
-
|
204 |
-
|
205 |
-
|
206 |
-
|
207 |
-
|
208 |
-
|
209 |
-
|
210 |
-
|
211 |
-
|
212 |
-
|
213 |
-
|
214 |
-
|
215 |
-
|
216 |
-
|
217 |
-
|
218 |
-
|
219 |
-
|
220 |
-
#
|
221 |
-
docker
|
222 |
-
|
223 |
-
#
|
224 |
-
|
225 |
-
|
226 |
-
#
|
227 |
-
|
228 |
-
|
229 |
-
#
|
230 |
-
|
231 |
-
|
232 |
-
|
233 |
-
|
234 |
-
|
235 |
-
|
236 |
-
#
|
237 |
-
|
238 |
-
#
|
239 |
-
# https://
|
240 |
-
|
241 |
-
#
|
242 |
-
# https://
|
243 |
-
|
244 |
-
|
245 |
-
|
246 |
-
|
247 |
-
|
248 |
-
|
249 |
-
|
250 |
-
|
251 |
-
|
252 |
-
|
253 |
-
-
|
254 |
-
-
|
255 |
-
-
|
256 |
-
|
257 |
-
|
258 |
-
-
|
259 |
-
|
260 |
-
|
261 |
-
-
|
262 |
-
-
|
263 |
-
-
|
264 |
-
|
265 |
-
|
266 |
-
|
267 |
-
|
268 |
-
|
269 |
-
|
270 |
-
|
271 |
-
|
272 |
-
|
273 |
-
|
274 |
-
|
275 |
-
|
276 |
-
|
277 |
-
|
278 |
-
│
|
279 |
-
│
|
280 |
-
|
281 |
-
|
282 |
-
|
283 |
-
```
|
284 |
-
|
285 |
-
|
286 |
-
|
287 |
-
|
288 |
-
|
289 |
-
│
|
290 |
-
├── 📂
|
291 |
-
│
|
292 |
-
|
293 |
-
|
294 |
-
|
295 |
-
|
296 |
-
|
297 |
-
|
298 |
-
|
299 |
-
|
300 |
-
|
301 |
-
|
302 |
-
-
|
303 |
-
|
304 |
-
|
305 |
-
|
306 |
-
|
307 |
-
|
308 |
-
|
309 |
-
|
310 |
-
####
|
311 |
-
|
312 |
-
|
313 |
-
|
314 |
-
|
315 |
-
|
316 |
-
-
|
317 |
-
- 使用examples/cogvideox_fun/
|
318 |
-
-
|
319 |
-
|
320 |
-
|
321 |
-
-
|
322 |
-
-
|
323 |
-
|
324 |
-
|
325 |
-
-
|
326 |
-
-
|
327 |
-
|
328 |
-
-
|
329 |
-
|
330 |
-
|
331 |
-
|
332 |
-
|
333 |
-
|
334 |
-
|
335 |
-
-
|
336 |
-
|
337 |
-
|
338 |
-
|
339 |
-
-
|
340 |
-
|
341 |
-
|
342 |
-
-
|
343 |
-
-
|
344 |
-
-
|
345 |
-
-
|
346 |
-
-
|
347 |
-
|
348 |
-
|
349 |
-
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
language:
|
4 |
+
- en
|
5 |
+
- zh
|
6 |
+
pipeline_tag: text-to-video
|
7 |
+
library_name: diffusers
|
8 |
+
tags:
|
9 |
+
- video
|
10 |
+
- video-generation
|
11 |
+
- wan2.2
|
12 |
+
base_model:
|
13 |
+
- Wan-AI/Wan2.2-I2V-A14B
|
14 |
+
---
|
15 |
+
|
16 |
+
# Wan-Fun
|
17 |
+
|
18 |
+
😊 Welcome!
|
19 |
+
|
20 |
+
[](https://huggingface.co/spaces/alibaba-pai/Wan2.1-Fun-1.3B-InP)
|
21 |
+
|
22 |
+
[](https://github.com/aigc-apps/VideoX-Fun)
|
23 |
+
|
24 |
+
[English](./README_en.md) | [简体中文](./README.md)
|
25 |
+
|
26 |
+
# 目录
|
27 |
+
- [目录](#目录)
|
28 |
+
- [模型地址](#模型地址)
|
29 |
+
- [视频作品](#视频作品)
|
30 |
+
- [快速启动](#快速启动)
|
31 |
+
- [如何使用](#如何使用)
|
32 |
+
- [参考文献](#参考文献)
|
33 |
+
- [许可证](#许可证)
|
34 |
+
|
35 |
+
# 模型地址
|
36 |
+
|
37 |
+
| 名称 | 存储空间 | Hugging Face | Model Scope | 描述 |
|
38 |
+
|--|--|--|--|--|
|
39 |
+
| Wan2.2-Fun-A14B-InP | 64.0 GB | [🤗Link](https://huggingface.co/alibaba-pai/Wan2.2-Fun-A14B-InP) | [😄Link](https://modelscope.cn/models/PAI/Wan2.2-Fun-A14B-InP) | Wan2.2-Fun-14B文图生视频权重,以多分辨率训练,支持首尾图预测。 |
|
40 |
+
| Wan2.2-Fun-A14B-Control | 64.0 GB | [🤗Link](https://huggingface.co/alibaba-pai/Wan2.2-Fun-A14B-Control) | [😄Link](https://modelscope.cn/models/PAI/Wan2.2-Fun-A14B-Control)| Wan2.2-Fun-14B视频控制权重,支持不同的控制条件,如Canny、Depth、Pose、MLSD等,同时支持使用轨迹控制。支持多分辨率(512,768,1024)的视频预测,支持多分辨率(512,768,1024)的视频预测,以81帧、每秒16帧进行训练,支持多语言预测 |
|
41 |
+
| Wan2.2-Fun-A14B-Control-Camera | 64.0 GB | [🤗Link](https://huggingface.co/alibaba-pai/Wan2.2-Fun-A14B-Control-Camera) | [😄Link](https://modelscope.cn/models/PAI/Wan2.2-Fun-A14B-Control-Camera)| Wan2.2-Fun-14B相机镜头控制权重。支持多分辨率(512,768,1024)的视频预测,支持多分辨率(512,768,1024)的视频预测,以81帧、每秒16帧进行训练,支持多语言预测 |
|
42 |
+
|
43 |
+
# 视频作品
|
44 |
+
|
45 |
+
### Wan2.2-Fun-A14B-InP
|
46 |
+
|
47 |
+
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
48 |
+
<tr>
|
49 |
+
<td>
|
50 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/inp_1.mp4" width="100%" controls autoplay loop></video>
|
51 |
+
</td>
|
52 |
+
<td>
|
53 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/inp_2.mp4" width="100%" controls autoplay loop></video>
|
54 |
+
</td>
|
55 |
+
<td>
|
56 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/inp_3.mp4" width="100%" controls autoplay loop></video>
|
57 |
+
</td>
|
58 |
+
<td>
|
59 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/inp_4.mp4" width="100%" controls autoplay loop></video>
|
60 |
+
</td>
|
61 |
+
</tr>
|
62 |
+
</table>
|
63 |
+
|
64 |
+
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
65 |
+
<tr>
|
66 |
+
<td>
|
67 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/inp_5.mp4" width="100%" controls autoplay loop></video>
|
68 |
+
</td>
|
69 |
+
<td>
|
70 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/inp_6.mp4" width="100%" controls autoplay loop></video>
|
71 |
+
</td>
|
72 |
+
<td>
|
73 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/inp_7.mp4" width="100%" controls autoplay loop></video>
|
74 |
+
</td>
|
75 |
+
<td>
|
76 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/inp_8.mp4" width="100%" controls autoplay loop></video>
|
77 |
+
</td>
|
78 |
+
</tr>
|
79 |
+
</table>
|
80 |
+
|
81 |
+
### Wan2.2-Fun-A14B-Control
|
82 |
+
|
83 |
+
Generic Control Video + Reference Image:
|
84 |
+
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
85 |
+
<tr>
|
86 |
+
<td>
|
87 |
+
Reference Image
|
88 |
+
</td>
|
89 |
+
<td>
|
90 |
+
Control Video
|
91 |
+
</td>
|
92 |
+
<td>
|
93 |
+
Wan2.2-Fun-14B-Control
|
94 |
+
</td>
|
95 |
+
<tr>
|
96 |
+
<td>
|
97 |
+
<image src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/8.png" width="100%" controls autoplay loop></image>
|
98 |
+
</td>
|
99 |
+
<td>
|
100 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/pose.mp4" width="100%" controls autoplay loop></video>
|
101 |
+
</td>
|
102 |
+
<td>
|
103 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/14b_ref.mp4" width="100%" controls autoplay loop></video>
|
104 |
+
</td>
|
105 |
+
<tr>
|
106 |
+
</table>
|
107 |
+
|
108 |
+
Generic Control Video (Canny, Pose, Depth, etc.) and Trajectory Control:
|
109 |
+
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
110 |
+
<tr>
|
111 |
+
<td>
|
112 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/guiji.mp4" width="100%" controls autoplay loop></video>
|
113 |
+
</td>
|
114 |
+
<td>
|
115 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/guiji_out.mp4" width="100%" controls autoplay loop></video>
|
116 |
+
</td>
|
117 |
+
<tr>
|
118 |
+
</table>
|
119 |
+
|
120 |
+
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
121 |
+
<tr>
|
122 |
+
<td>
|
123 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/pose.mp4" width="100%" controls autoplay loop></video>
|
124 |
+
</td>
|
125 |
+
<td>
|
126 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/canny.mp4" width="100%" controls autoplay loop></video>
|
127 |
+
</td>
|
128 |
+
<td>
|
129 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/depth.mp4" width="100%" controls autoplay loop></video>
|
130 |
+
</td>
|
131 |
+
<tr>
|
132 |
+
<td>
|
133 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/pose_out.mp4" width="100%" controls autoplay loop></video>
|
134 |
+
</td>
|
135 |
+
<td>
|
136 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/canny_out.mp4" width="100%" controls autoplay loop></video>
|
137 |
+
</td>
|
138 |
+
<td>
|
139 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/depth_out.mp4" width="100%" controls autoplay loop></video>
|
140 |
+
</td>
|
141 |
+
</tr>
|
142 |
+
</table>
|
143 |
+
|
144 |
+
### Wan2.2-Fun-A14B-Control-Camera
|
145 |
+
|
146 |
+
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
147 |
+
<tr>
|
148 |
+
<td>
|
149 |
+
Pan Up
|
150 |
+
</td>
|
151 |
+
<td>
|
152 |
+
Pan Left
|
153 |
+
</td>
|
154 |
+
<td>
|
155 |
+
Pan Right
|
156 |
+
</td>
|
157 |
+
<td>
|
158 |
+
Zoom In
|
159 |
+
</td>
|
160 |
+
<tr>
|
161 |
+
<td>
|
162 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/Pan_Up.mp4" width="100%" controls autoplay loop></video>
|
163 |
+
</td>
|
164 |
+
<td>
|
165 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/Pan_Left.mp4" width="100%" controls autoplay loop></video>
|
166 |
+
</td>
|
167 |
+
<td>
|
168 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/Pan_Right.mp4" width="100%" controls autoplay loop></video>
|
169 |
+
</td>
|
170 |
+
<td>
|
171 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/Zoom_In.mp4" width="100%" controls autoplay loop></video>
|
172 |
+
</td>
|
173 |
+
<tr>
|
174 |
+
<td>
|
175 |
+
Pan Down
|
176 |
+
</td>
|
177 |
+
<td>
|
178 |
+
Pan Up + Pan Left
|
179 |
+
</td>
|
180 |
+
<td>
|
181 |
+
Pan Up + Pan Right
|
182 |
+
</td>
|
183 |
+
<td>
|
184 |
+
Zoom Out
|
185 |
+
</td>
|
186 |
+
<tr>
|
187 |
+
<td>
|
188 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/Pan_Down.mp4" width="100%" controls autoplay loop></video>
|
189 |
+
</td>
|
190 |
+
<td>
|
191 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/Pan_UL.mp4" width="100%" controls autoplay loop></video>
|
192 |
+
</td>
|
193 |
+
<td>
|
194 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/Pan_UR.mp4" width="100%" controls autoplay loop></video>
|
195 |
+
</td>
|
196 |
+
<td>
|
197 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset_Wan2_2/v1.0/Zoom_Out.mp4" width="100%" controls autoplay loop></video>
|
198 |
+
</td>
|
199 |
+
</tr>
|
200 |
+
</table>
|
201 |
+
|
202 |
+
|
203 |
+
# 快速启动
|
204 |
+
### 1. 云使用: AliyunDSW/Docker
|
205 |
+
#### a. 通过阿里云 DSW
|
206 |
+
DSW 有免费 GPU 时间,用户可申请一次,申请后3个月内有效。
|
207 |
+
|
208 |
+
阿里云在[Freetier](https://free.aliyun.com/?product=9602825&crowd=enterprise&spm=5176.28055625.J_5831864660.1.e939154aRgha4e&scm=20140722.M_9974135.P_110.MO_1806-ID_9974135-MID_9974135-CID_30683-ST_8512-V_1)提供免费GPU时间,获取并在阿里云PAI-DSW中使用,5分钟内即可启动CogVideoX-Fun。
|
209 |
+
|
210 |
+
[](https://gallery.pai-ml.com/#/preview/deepLearning/cv/cogvideox_fun)
|
211 |
+
|
212 |
+
#### b. 通过ComfyUI
|
213 |
+
我们的ComfyUI界面如下,具体查看[ComfyUI README](comfyui/README.md)。
|
214 |
+

|
215 |
+
|
216 |
+
#### c. 通过docker
|
217 |
+
使用docker的情况下,请保证机器中已经正确安装显卡驱动与CUDA环境,然后以此执行以下命令:
|
218 |
+
|
219 |
+
```
|
220 |
+
# pull image
|
221 |
+
docker pull mybigpai-public-registry.cn-beijing.cr.aliyuncs.com/easycv/torch_cuda:cogvideox_fun
|
222 |
+
|
223 |
+
# enter image
|
224 |
+
docker run -it -p 7860:7860 --network host --gpus all --security-opt seccomp:unconfined --shm-size 200g mybigpai-public-registry.cn-beijing.cr.aliyuncs.com/easycv/torch_cuda:cogvideox_fun
|
225 |
+
|
226 |
+
# clone code
|
227 |
+
git clone https://github.com/aigc-apps/VideoX-Fun.git
|
228 |
+
|
229 |
+
# enter VideoX-Fun's dir
|
230 |
+
cd VideoX-Fun
|
231 |
+
|
232 |
+
# download weights
|
233 |
+
mkdir models/Diffusion_Transformer
|
234 |
+
mkdir models/Personalized_Model
|
235 |
+
|
236 |
+
# Please use the hugginface link or modelscope link to download the model.
|
237 |
+
# CogVideoX-Fun
|
238 |
+
# https://huggingface.co/alibaba-pai/CogVideoX-Fun-V1.1-5b-InP
|
239 |
+
# https://modelscope.cn/models/PAI/CogVideoX-Fun-V1.1-5b-InP
|
240 |
+
|
241 |
+
# Wan
|
242 |
+
# https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-14B-InP
|
243 |
+
# https://modelscope.cn/models/PAI/Wan2.1-Fun-V1.1-14B-InP
|
244 |
+
# https://huggingface.co/alibaba-pai/Wan2.2-Fun-A14B-InP
|
245 |
+
# https://modelscope.cn/models/PAI/Wan2.2-Fun-A14B-InP
|
246 |
+
```
|
247 |
+
|
248 |
+
### 2. 本地安装: 环境检查/下载/安装
|
249 |
+
#### a. 环境检查
|
250 |
+
我们已验证该库可在以下环境中执行:
|
251 |
+
|
252 |
+
Windows 的详细信息:
|
253 |
+
- 操作系统 Windows 10
|
254 |
+
- python: python3.10 & python3.11
|
255 |
+
- pytorch: torch2.2.0
|
256 |
+
- CUDA: 11.8 & 12.1
|
257 |
+
- CUDNN: 8+
|
258 |
+
- GPU: Nvidia-3060 12G & Nvidia-3090 24G
|
259 |
+
|
260 |
+
Linux 的详细信息:
|
261 |
+
- 操作系统 Ubuntu 20.04, CentOS
|
262 |
+
- python: python3.10 & python3.11
|
263 |
+
- pytorch: torch2.2.0
|
264 |
+
- CUDA: 11.8 & 12.1
|
265 |
+
- CUDNN: 8+
|
266 |
+
- GPU:Nvidia-V100 16G & Nvidia-A10 24G & Nvidia-A100 40G & Nvidia-A100 80G
|
267 |
+
|
268 |
+
我们需要大约 60GB 的可用磁盘空间,请检查!
|
269 |
+
|
270 |
+
#### b. 权重放置
|
271 |
+
我们最好将[权重](#model-zoo)按照指定路径进行放置:
|
272 |
+
|
273 |
+
**通过comfyui**:
|
274 |
+
将模型放入Comfyui的权重文件夹`ComfyUI/models/Fun_Models/`:
|
275 |
+
```
|
276 |
+
📦 ComfyUI/
|
277 |
+
├── 📂 models/
|
278 |
+
│ └── 📂 Fun_Models/
|
279 |
+
│ ├── 📂 CogVideoX-Fun-V1.1-2b-InP/
|
280 |
+
│ ├── 📂 CogVideoX-Fun-V1.1-5b-InP/
|
281 |
+
│ ├── 📂 Wan2.1-Fun-V1.1-14B-InP
|
282 |
+
│ └── 📂 Wan2.1-Fun-V1.1-1.3B-InP/
|
283 |
+
```
|
284 |
+
|
285 |
+
**运行自身的python文件或ui界面**:
|
286 |
+
```
|
287 |
+
📦 models/
|
288 |
+
├── 📂 Diffusion_Transformer/
|
289 |
+
│ ├── 📂 CogVideoX-Fun-V1.1-2b-InP/
|
290 |
+
│ ├── 📂 CogVideoX-Fun-V1.1-5b-InP/
|
291 |
+
│ ├── 📂 Wan2.1-Fun-V1.1-14B-InP
|
292 |
+
│ └── 📂 Wan2.1-Fun-V1.1-1.3B-InP/
|
293 |
+
├── 📂 Personalized_Model/
|
294 |
+
│ └── your trained trainformer model / your trained lora model (for UI load)
|
295 |
+
```
|
296 |
+
|
297 |
+
# 如何使用
|
298 |
+
|
299 |
+
<h3 id="video-gen">1. 生成 </h3>
|
300 |
+
|
301 |
+
#### a、显存节省方案
|
302 |
+
由于Wan2.2的参数非常大,我们需要考虑显存节省方案,以节省显存适应消费级显卡。我们给每个预测文件都提供了GPU_memory_mode,可以在model_cpu_offload,model_cpu_offload_and_qfloat8,sequential_cpu_offload中进行选择。该方案同样适用于CogVideoX-Fun的生成。
|
303 |
+
|
304 |
+
- model_cpu_offload代表整个模型在使用后会进入cpu,可以节省部分显存。
|
305 |
+
- model_cpu_offload_and_qfloat8代表整个模型在使用后会进入cpu,并且对transformer模型进行了float8的量化,可以节省更多的显存。
|
306 |
+
- sequential_cpu_offload代表模型的每一层在使用后会进入cpu,速度较慢,节省大量显存。
|
307 |
+
|
308 |
+
qfloat8会部分降低模型的性能,但可以节省更多的显存。如果显存足够,推荐使用model_cpu_offload。
|
309 |
+
|
310 |
+
#### b、通过comfyui
|
311 |
+
具体查看[ComfyUI README](https://github.com/aigc-apps/VideoX-Fun/tree/main/comfyui)。
|
312 |
+
|
313 |
+
#### c、运行python文件
|
314 |
+
- 步骤1:下载对应[权重](#model-zoo)放入models文件夹。
|
315 |
+
- 步骤2:根据不同的权重与预测目标使用不同的文件进行预测。当前该库支持CogVideoX-Fun、Wan2.1、Wan2.1-Fun、Wan2.2,在examples文件夹下用文件夹名以区分,不同模型支持的功能不同,请视具体情况予以区分。以CogVideoX-Fun为例。
|
316 |
+
- 文生视频:
|
317 |
+
- 使用examples/cogvideox_fun/predict_t2v.py文件中修改prompt、neg_prompt、guidance_scale和seed。
|
318 |
+
- 而后运行examples/cogvideox_fun/predict_t2v.py文件��等待生成结果,结果保存在samples/cogvideox-fun-videos文件夹中。
|
319 |
+
- 图生视频:
|
320 |
+
- 使用examples/cogvideox_fun/predict_i2v.py文件中修改validation_image_start、validation_image_end、prompt、neg_prompt、guidance_scale和seed。
|
321 |
+
- validation_image_start是视频的开始图片,validation_image_end是视频的结尾图片。
|
322 |
+
- 而后运行examples/cogvideox_fun/predict_i2v.py文件,等待生成结果,结果保存在samples/cogvideox-fun-videos_i2v文件夹中。
|
323 |
+
- 视频生视频:
|
324 |
+
- 使用examples/cogvideox_fun/predict_v2v.py文件中修改validation_video、validation_image_end、prompt、neg_prompt、guidance_scale和seed。
|
325 |
+
- validation_video是视频生视频的参考视频。您可以使用以下视频运行演示:[演示视频](https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/cogvideox_fun/asset/v1/play_guitar.mp4)
|
326 |
+
- 而后运行examples/cogvideox_fun/predict_v2v.py文件,等待生成结果,结果保存在samples/cogvideox-fun-videos_v2v文件夹中。
|
327 |
+
- 普通控制生视频(Canny、Pose、Depth等):
|
328 |
+
- 使用examples/cogvideox_fun/predict_v2v_control.py文件中修改control_video、validation_image_end、prompt、neg_prompt、guidance_scale和seed。
|
329 |
+
- control_video是控制生视频的控制视频,是使用Canny、Pose、Depth等算子提取后的视频。您可以使用以下视频运行演示:[演示视频](https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/cogvideox_fun/asset/v1.1/pose.mp4)
|
330 |
+
- 而后运行examples/cogvideox_fun/predict_v2v_control.py文件,等待生成结果,结果保存在samples/cogvideox-fun-videos_v2v_control文件夹中。
|
331 |
+
- 步骤3:如果想结合自己训练的其他backbone与Lora,则看情况修改examples/{model_name}/predict_t2v.py中的examples/{model_name}/predict_i2v.py和lora_path。
|
332 |
+
|
333 |
+
#### d、通过ui界面
|
334 |
+
|
335 |
+
webui支持文生视频、图生视频、视频生视频和普通控制生视频(Canny、Pose、Depth等)。在examples文件夹下用文件夹名以区分,不同模型支持的功能不同,请视具体情况予以区分。以CogVideoX-Fun为例。
|
336 |
+
|
337 |
+
- 步骤1:下载对应[权重](#model-zoo)放入models文件夹。
|
338 |
+
- 步骤2:运行examples/cogvideox_fun/app.py文件,进入gradio页面。
|
339 |
+
- 步骤3:根据页面选择生成模型,填入prompt、neg_prompt、guidance_scale和seed等,点击生成,等待生成结果,结果保存在sample文件夹中。
|
340 |
+
|
341 |
+
# 参考文献
|
342 |
+
- CogVideo: https://github.com/THUDM/CogVideo/
|
343 |
+
- EasyAnimate: https://github.com/aigc-apps/EasyAnimate
|
344 |
+
- Wan2.1: https://github.com/Wan-Video/Wan2.1/
|
345 |
+
- Wan2.1: https://github.com/Wan-Video/Wan2.2/
|
346 |
+
- ComfyUI-KJNodes: https://github.com/kijai/ComfyUI-KJNodes
|
347 |
+
- ComfyUI-EasyAnimateWrapper: https://github.com/kijai/ComfyUI-EasyAnimateWrapper
|
348 |
+
- ComfyUI-CameraCtrl-Wrapper: https://github.com/chaojie/ComfyUI-CameraCtrl-Wrapper
|
349 |
+
- CameraCtrl: https://github.com/hehao13/CameraCtrl
|
350 |
+
|
351 |
+
# 许可证
|
352 |
+
本项目采用 [Apache License (Version 2.0)](https://github.com/modelscope/modelscope/blob/master/LICENSE).
|