Upload folder using huggingface_hub
Browse files- .DS_Store +0 -0
- .gitattributes +3 -0
- README.md +24 -2
- assets/arch.png +3 -0
- assets/eval_res_en.png +3 -0
- assets/results_show.png +3 -0
.DS_Store
CHANGED
|
Binary files a/.DS_Store and b/.DS_Store differ
|
|
|
.gitattributes
CHANGED
|
@@ -34,3 +34,6 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
assets/image_edit_demo.gif filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
assets/image_edit_demo.gif filter=lfs diff=lfs merge=lfs -text
|
| 37 |
+
assets/arch.png filter=lfs diff=lfs merge=lfs -text
|
| 38 |
+
assets/eval_res_en.png filter=lfs diff=lfs merge=lfs -text
|
| 39 |
+
assets/results_show.png filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
|
@@ -1,10 +1,15 @@
|
|
| 1 |
---
|
| 2 |
license: mit
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3 |
---
|
| 4 |
|
| 5 |
## 🔥🔥🔥 News!!
|
| 6 |
-
* Apr 25, 2025: 👋 We release the
|
| 7 |
-
* Apr 25, 2025: 👋 We release the inference code and model weights of Step1X-Edit. [Download Step1X-Edit model](https://huggingface.co/stepfun-ai/Step1X-Edit)
|
| 8 |
* Apr 25, 2025: 🎉 We have made our technical report available as open source. [Read](https://arxiv.org/abs/2504.17761)
|
| 9 |
|
| 10 |
## Image Edit Demos
|
|
@@ -15,6 +20,23 @@ license: mit
|
|
| 15 |
</div>
|
| 16 |
|
| 17 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 18 |
## Citation
|
| 19 |
```
|
| 20 |
@article{liu2025step1x-edit,
|
|
|
|
| 1 |
---
|
| 2 |
license: mit
|
| 3 |
+
language:
|
| 4 |
+
- en
|
| 5 |
+
pipeline_tag: image-text-to-image
|
| 6 |
+
tags:
|
| 7 |
+
- multimodal
|
| 8 |
+
library_name: transformers
|
| 9 |
---
|
| 10 |
|
| 11 |
## 🔥🔥🔥 News!!
|
| 12 |
+
* Apr 25, 2025: 👋 We release the inference code and model weights of Step1X-Edit. [inference code](https://github.com/stepfun-ai/Step1X-Edit)
|
|
|
|
| 13 |
* Apr 25, 2025: 🎉 We have made our technical report available as open source. [Read](https://arxiv.org/abs/2504.17761)
|
| 14 |
|
| 15 |
## Image Edit Demos
|
|
|
|
| 20 |
</div>
|
| 21 |
|
| 22 |
|
| 23 |
+
## Model introduction
|
| 24 |
+
<div align="center">
|
| 25 |
+
<img width="720" alt="demo" src="assets/arch.png">
|
| 26 |
+
</div>
|
| 27 |
+
|
| 28 |
+
Framework of Step1X-Edit. Step1X-Edit leverages the image understanding capabilities
|
| 29 |
+
of MLLMs to parse editing instructions and generate editing tokens, which are then decoded into
|
| 30 |
+
images using a DiT-based network.More details please refer to our [technical report](https://arxiv.org/abs/2504.17761).
|
| 31 |
+
|
| 32 |
+
|
| 33 |
+
## Benchmark
|
| 34 |
+
We release [GEdit-Bench](https://huggingface.co/datasets/stepfun-ai/GEdit-Bench) as a new benchmark, grounded in real-world usages is developed to support more authentic and comprehensive evaluation. This benchmark, which is carefully curated to reflect actual user editing needs and a wide range of editing scenarios, enables more authentic and comprehensive evaluations of image editing models.
|
| 35 |
+
The evaluation process and related code can be found in [GEdit-Bench/EVAL.md](GEdit-Bench/EVAL.md). Part results of the benchmark are shown below:
|
| 36 |
+
<div align="center">
|
| 37 |
+
<img width="1080" alt="results" src="assets/eval_res_en.png">
|
| 38 |
+
</div>
|
| 39 |
+
|
| 40 |
## Citation
|
| 41 |
```
|
| 42 |
@article{liu2025step1x-edit,
|
assets/arch.png
ADDED
|
Git LFS Details
|
assets/eval_res_en.png
ADDED
|
Git LFS Details
|
assets/results_show.png
ADDED
|
Git LFS Details
|