bghira commited on
Commit
9041c3a
·
verified ·
1 Parent(s): 9f27d38

Model card auto-generated by SimpleTuner

Browse files
Files changed (1) hide show
  1. README.md +273 -0
README.md ADDED
@@ -0,0 +1,273 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ base_model: "black-forest-labs/FLUX.1-dev"
4
+ tags:
5
+ - flux
6
+ - flux-diffusers
7
+ - text-to-image
8
+ - diffusers
9
+ - simpletuner
10
+ - not-for-all-audiences
11
+ - lora
12
+ - template:sd-lora
13
+ - lycoris
14
+ inference: true
15
+ widget:
16
+ - text: 'unconditional (blank prompt)'
17
+ parameters:
18
+ negative_prompt: 'blurry, cropped, ugly'
19
+ output:
20
+ url: ./assets/image_0_0.png
21
+ - text: 'an iconic concept art image of domokun'
22
+ parameters:
23
+ negative_prompt: 'blurry, cropped, ugly'
24
+ output:
25
+ url: ./assets/image_1_0.png
26
+ - text: 'domokun, a minimalist masterpiece, is captured in a serene and elegant style, evoking a sense of simplicity and sophistication.'
27
+ parameters:
28
+ negative_prompt: 'blurry, cropped, ugly'
29
+ output:
30
+ url: ./assets/image_2_0.png
31
+ - text: 'what style of graffiti collage would you like to feature, and what themes or subjects would you like to include in the artwork?'
32
+ parameters:
33
+ negative_prompt: 'blurry, cropped, ugly'
34
+ output:
35
+ url: ./assets/image_3_0.png
36
+ - text: 'an image of a serene and elegant domokun, a traditional nigerian wooden mask, adorned with intricate art deco motifs, set against a neutral background to emphasize its cultural significance.'
37
+ parameters:
38
+ negative_prompt: 'blurry, cropped, ugly'
39
+ output:
40
+ url: ./assets/image_4_0.png
41
+ - text: 'a realistic portrayal of domokun, inspired by pixel art, with intricate details and textures, capturing the essence of a mysterious and ancient being, as if it were a character from a classic video game, with a focus on its facial features and body language, conveying a sense of otherworldliness and mystique, in a style reminiscent of retro video games, with a dash of modern'
42
+ parameters:
43
+ negative_prompt: 'blurry, cropped, ugly'
44
+ output:
45
+ url: ./assets/image_5_0.png
46
+ - text: 'what is the dominant color palette in a surrealist domokun artwork?'
47
+ parameters:
48
+ negative_prompt: 'blurry, cropped, ugly'
49
+ output:
50
+ url: ./assets/image_6_0.png
51
+ - text: 'domokun, in a renaissance composition, the artist''s brushstrokes would dance across the canvas, evoking the grandeur of a bygone era, as the subject, a young woman, is rendered in all her beauty, her features refined and delicate, yet strong and resilient, a testament to the timeless allure of the human form, set against a backdrop of rich'
52
+ parameters:
53
+ negative_prompt: 'blurry, cropped, ugly'
54
+ output:
55
+ url: ./assets/image_7_0.png
56
+ - text: 'detailed and atmospheric image that showcases domokun''s (i assume this refers to the japanese wooden puppet) intricate and ornate design, set against a backdrop of baroque elements, such as ornate patterns, grand architecture, or lavish furnishings.'
57
+ parameters:
58
+ negative_prompt: 'blurry, cropped, ugly'
59
+ output:
60
+ url: ./assets/image_8_0.png
61
+ - text: ' dystopian urban landscape where neon-lit skyscrapers pierce the smog-filled sky, and the streets are alive with the hum of augmented reality implants and the chatter of pedestrians in virtual reality contact lenses. in this gritty, high-tech world, a lone figure emerges from the shadows - a mysterious, masked vigilante known only as domokun, who uses their'
62
+ parameters:
63
+ negative_prompt: 'blurry, cropped, ugly'
64
+ output:
65
+ url: ./assets/image_9_0.png
66
+ - text: 'visually striking typography-themed artwork featuring the character domokun, incorporating bold, vibrant colors and intricate details to bring this beloved character to life in a captivating and dynamic composition.'
67
+ parameters:
68
+ negative_prompt: 'blurry, cropped, ugly'
69
+ output:
70
+ url: ./assets/image_10_0.png
71
+ - text: 'domokun, a steampunk inventor, sits amidst a cluttered workshop, surrounded by gears, clockwork devices, and various contraptions.'
72
+ parameters:
73
+ negative_prompt: 'blurry, cropped, ugly'
74
+ output:
75
+ url: ./assets/image_11_0.png
76
+ - text: 'visually striking image featuring a character named domokun, incorporating collage patterns that reflect their unique personality and style.'
77
+ parameters:
78
+ negative_prompt: 'blurry, cropped, ugly'
79
+ output:
80
+ url: ./assets/image_12_0.png
81
+ - text: 'what street art poster featuring domokun do you have in mind?'
82
+ parameters:
83
+ negative_prompt: 'blurry, cropped, ugly'
84
+ output:
85
+ url: ./assets/image_13_0.png
86
+ - text: 'visually appealing and emotive flat design portrait of domokun, focusing on capturing his personality and expression in a clean and minimalist style.'
87
+ parameters:
88
+ negative_prompt: 'blurry, cropped, ugly'
89
+ output:
90
+ url: ./assets/image_14_0.png
91
+ - text: 'domokun, rendered in pop surrealism tones, a dreamlike scene of a cityscape at dusk, with buildings and streets twisted and distorted, as if through a kaleidoscope of colors and patterns, evoking a sense of mystery and enchantment.'
92
+ parameters:
93
+ negative_prompt: 'blurry, cropped, ugly'
94
+ output:
95
+ url: ./assets/image_15_0.png
96
+ - text: 'what is the name of the artist who painted a domokun figure in an oil painting?'
97
+ parameters:
98
+ negative_prompt: 'blurry, cropped, ugly'
99
+ output:
100
+ url: ./assets/image_16_0.png
101
+ - text: ' constructivist representation of domokun, a figure embodying the fluidity and dynamism of modern life, as depicted in a dynamic and abstracted manner.'
102
+ parameters:
103
+ negative_prompt: 'blurry, cropped, ugly'
104
+ output:
105
+ url: ./assets/image_17_0.png
106
+ - text: 'an abstract representation of domokun in black and white, conveying the essence of this enigmatic figure, emphasizing the nuances of its presence, and inviting the viewer to contemplate its mysterious aura.'
107
+ parameters:
108
+ negative_prompt: 'blurry, cropped, ugly'
109
+ output:
110
+ url: ./assets/image_18_0.png
111
+ - text: 'vibrant oil painting collage that incorporates domokun, a traditional japanese woodblock print, as a central element, exploring the intersection of art and culture, and the beauty of imperfection in a mixed-media piece.'
112
+ parameters:
113
+ negative_prompt: 'blurry, cropped, ugly'
114
+ output:
115
+ url: ./assets/image_19_0.png
116
+ - text: 'domokun, the enigmatic figure, is often seen as a symbol of rebellion and nonconformity in the world of graffiti art.'
117
+ parameters:
118
+ negative_prompt: 'blurry, cropped, ugly'
119
+ output:
120
+ url: ./assets/image_20_0.png
121
+ - text: 'a realistic portrayal of domokun, inspired by avant-garde art forms, that seamlessly blend elements of the human experience with a touch of the unknown, exploring themes of identity and existential crisis, through a unique lens of surrealism and psychological complexity.'
122
+ parameters:
123
+ negative_prompt: 'blurry, cropped, ugly'
124
+ output:
125
+ url: ./assets/image_21_0.png
126
+ - text: 'create an image of domokun in the style of flat design.'
127
+ parameters:
128
+ negative_prompt: 'blurry, cropped, ugly'
129
+ output:
130
+ url: ./assets/image_22_0.png
131
+ - text: 'domokun, a figure from a bygone era, is captured through the lens of vintage photography, evoking a sense of nostalgia and timelessness.'
132
+ parameters:
133
+ negative_prompt: 'blurry, cropped, ugly'
134
+ output:
135
+ url: ./assets/image_23_0.png
136
+ - text: ' serene and tranquil scene featuring the iconic samurai warrior, domokun, standing alongside the renowned artist, ukiyo-e, in a traditional japanese setting, surrounded by vibrant and colorful woodblock prints, showcasing the beauty of japanese art and culture.'
137
+ parameters:
138
+ negative_prompt: 'blurry, cropped, ugly'
139
+ output:
140
+ url: ./assets/image_24_0.png
141
+ - text: 'domokun, as depicted in the pop art movement, a 20th-century art style characterized by bold, vibrant colors and a focus on consumer culture, often featuring everyday objects and icons in a stylized and exaggerated manner.'
142
+ parameters:
143
+ negative_prompt: 'blurry, cropped, ugly'
144
+ output:
145
+ url: ./assets/image_25_0.png
146
+ - text: 'A photo-realistic image of a cat'
147
+ parameters:
148
+ negative_prompt: 'blurry, cropped, ugly'
149
+ output:
150
+ url: ./assets/image_26_0.png
151
+ ---
152
+
153
+ # flux-domokun
154
+
155
+ This is a LyCORIS adapter derived from [black-forest-labs/FLUX.1-dev](https://huggingface.co/black-forest-labs/FLUX.1-dev).
156
+
157
+
158
+ The main validation prompt used during training was:
159
+
160
+
161
+
162
+ ```
163
+ A photo-realistic image of a cat
164
+ ```
165
+
166
+ ## Validation settings
167
+ - CFG: `3.0`
168
+ - CFG Rescale: `0.0`
169
+ - Steps: `20`
170
+ - Sampler: `None`
171
+ - Seed: `42`
172
+ - Resolution: `1024x1024`
173
+
174
+ Note: The validation settings are not necessarily the same as the [training settings](#training-settings).
175
+
176
+ You can find some example images in the following gallery:
177
+
178
+
179
+ <Gallery />
180
+
181
+ The text encoder **was not** trained.
182
+ You may reuse the base model text encoder for inference.
183
+
184
+
185
+ ## Training settings
186
+
187
+ - Training epochs: 0
188
+ - Training steps: 100
189
+ - Learning rate: 0.0002
190
+ - Effective batch size: 9
191
+ - Micro-batch size: 3
192
+ - Gradient accumulation steps: 1
193
+ - Number of GPUs: 3
194
+ - Prediction type: flow-matching
195
+ - Rescaled betas zero SNR: False
196
+ - Optimizer: bnb-lion8bit
197
+ - Precision: Pure BF16
198
+ - Quantised: Yes: nf4-bnb
199
+ - Xformers: Not used
200
+ - LyCORIS Config:
201
+ ```json
202
+ {
203
+ "bypass_mode": true,
204
+ "algo": "lokr",
205
+ "multiplier": 1.0,
206
+ "linear_dim": 10000,
207
+ "linear_alpha": 1,
208
+ "factor": 12,
209
+ "apply_preset": {
210
+ "target_module": [
211
+ "Attention",
212
+ "FeedForward"
213
+ ],
214
+ "module_algo_map": {
215
+ "Attention": {
216
+ "factor": 12
217
+ },
218
+ "FeedForward": {
219
+ "factor": 6
220
+ }
221
+ }
222
+ }
223
+ }
224
+ ```
225
+
226
+ ## Datasets
227
+
228
+ ### domokun-uncropped-512
229
+ - Repeats: 10
230
+ - Total number of images: ~36
231
+ - Total number of aspect buckets: 7
232
+ - Resolution: 0.262144 megapixels
233
+ - Cropped: False
234
+ - Crop style: None
235
+ - Crop aspect: None
236
+ ### domokun-cropped-512
237
+ - Repeats: 10
238
+ - Total number of images: ~36
239
+ - Total number of aspect buckets: 7
240
+ - Resolution: 0.262144 megapixels
241
+ - Cropped: False
242
+ - Crop style: None
243
+ - Crop aspect: None
244
+
245
+
246
+ ## Inference
247
+
248
+
249
+ ```python
250
+ import torch
251
+ from diffusers import DiffusionPipeline
252
+ from lycoris import create_lycoris_from_weights
253
+
254
+ model_id = 'black-forest-labs/FLUX.1-dev'
255
+ adapter_id = 'pytorch_lora_weights.safetensors' # you will have to download this manually
256
+ lora_scale = 1.0
257
+ wrapper, _ = create_lycoris_from_weights(lora_scale, adapter_id, pipeline.transformer)
258
+ wrapper.merge_to()
259
+
260
+ prompt = "A photo-realistic image of a cat"
261
+
262
+ pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu')
263
+ image = pipeline(
264
+ prompt=prompt,
265
+ num_inference_steps=20,
266
+ generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826),
267
+ width=1024,
268
+ height=1024,
269
+ guidance_scale=3.0,
270
+ ).images[0]
271
+ image.save("output.png", format="PNG")
272
+ ```
273
+