File size: 836 Bytes
f9703a7 2542bb3 ee6782f 6c3f0d5 ee6782f c6db579 3b8b95b f9703a7 11414d8 2542bb3 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 |
---
license: mit
---
[](https://discord.gg/2JhHVh7CGu)
A semi custom network based on the follow paper [Simpler Diffusion (SiD2)](https://arxiv.org/abs/2410.19324v1)
This network uses the optimal transport flow matching objective outlined [Flow Matching for Generative Modeling](https://arxiv.org/abs/2210.02747)
xATGLU Layers are used instead of linears for entry into the transformer MLP layer [Expanded Gating Ranges
Improve Activation Functions](https://arxiv.org/pdf/2405.20768)
`python train.py` will train a new image network on the provided dataset. A pretrained model is provided here, step_1799.safetensors. To test, use:
`python test_sample.py step_1799.safetensors`



 |