File size: 6,015 Bytes
82895b4
 
 
 
 
 
 
28a8179
 
 
 
 
 
9b5a595
82895b4
404ebbe
 
b391815
404ebbe
6b3c6e0
404ebbe
8bb7a05
 
 
 
 
 
 
 
1a2178d
404ebbe
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
59da807
404ebbe
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1078a9b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8bb7a05
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
404ebbe
 
 
 
 
 
 
 
 
 
 
 
 
 
 
82895b4
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
---
license: apache-2.0
language:
- en
base_model:
- Qwen/Qwen-Image
pipeline_tag: text-to-image
tags:
- lora
- qwen
- qwen-image
- realism
- realism-lora
library_name: diffusers
---
## LORA Qwen-Image example

Lora for [Qwen-Image](https://huggingface.co/Qwen/Qwen-Image)

**Realism** word required

# 📝 Updates

## August 13, 2025 - Realism v1.1 Release
- **Enhanced facial detail quality**: Improved fine-grained facial features and skin texture rendering
- **Better color reproduction**: More accurate and vibrant color representation across all scenes
- **Improved lighting and shadows for landscapes**: Enhanced natural lighting effects and shadow detail in outdoor scenes
- **Increased diversity across different ethnicities**: Better representation and quality for various racial and ethnic features

# 🧪 Usage
---
### 🔧 Initialization

```python
from diffusers import DiffusionPipeline
import torch

model_name = "Qwen/Qwen-Image"

# Load the pipeline
if torch.cuda.is_available():
    torch_dtype = torch.bfloat16
    device = "cuda"
else:
    torch_dtype = torch.float32
    device = "cpu"

pipe = DiffusionPipeline.from_pretrained(model_name, torch_dtype=torch_dtype)
pipe = pipe.to(device)
```

### 🔌 Load LoRA Weights

```python
# Load LoRA weights
pipe.load_lora_weights('flymy_realism.safetensors', adapter_name="lora")
```

### 🎨 Generate Image with lora trained on person

```python
prompt = '''Super Realism portrait of a teenager woman of African descent, serene calmness, arms crossed, illuminated by dramatic studio lighting, sunlit park in the background, adorned with delicate jewelry, three-quarter view, sun-kissed skin with natural imperfections, loose shoulder-length curls, slightly squinting eyes, environmental street portrait with text "FLYMY AI" on t-shirt.'''
negative_prompt =  " "
image = pipe(
    prompt=prompt,
    negative_prompt=negative_prompt,
    width=1024,
    height=1024,
    num_inference_steps=50,
    true_cfg_scale=5,
    generator=torch.Generator(device="cuda").manual_seed(346346)
)

# Display the image (in Jupyter or save to file)
image.show()
# or
image.save("output.png")
```

### 🖼️ Sample Output

![Sample Output](./assets/flymy_realism.png)

## 🎛️ Using with ComfyUI

We provide a ready-to-use ComfyUI workflow that works with our trained LoRA models. Follow these steps to set up and use the workflow:

### Setup Instructions

1. **Download the latest ComfyUI**:
   - Visit the [ComfyUI GitHub repository](https://github.com/comfyanonymous/ComfyUI)
   - Clone or download the latest version

2. **Install ComfyUI**:
   - Follow the installation instructions from the [ComfyUI repository](https://github.com/comfyanonymous/ComfyUI?tab=readme-ov-file#installing)
   - Make sure all dependencies are properly installed

3. **Download Qwen-Image model weights**:
   - Go to [Qwen-Image ComfyUI weights](https://huggingface.co/Comfy-Org/Qwen-Image_ComfyUI/tree/main)
   - Download all the model files

4. **Place Qwen-Image weights in ComfyUI**:
   - Copy the downloaded Qwen-Image model files to the appropriate folders in `ComfyUI/models/`
   - Follow the folder structure as specified in the model repository

5. **Download our pre-trained LoRA weights**:
   - Visit [flymy-ai/qwen-image-lora](https://huggingface.co/flymy-ai/qwen-image-lora)
   - Download the LoRA `.safetensors` files

6. **Place LoRA weights in ComfyUI**:
   - Copy the LoRA file `flymy-ai/qwen-image-lora/pytorch_lora_weights.safetensors` to `ComfyUI/models/loras/`

7. **Load the workflow**:
   - Open ComfyUI in your browser
   - Load the workflow file `qwen_image_lora_example.json` located in this repository
   - The workflow is pre-configured to work with our LoRA models

### Workflow Features

- ✅ Pre-configured for Qwen-Image + LoRA inference
- ✅ Optimized settings for best quality output
- ✅ Easy prompt and parameter adjustment
- ✅ Compatible with all our trained LoRA models

The ComfyUI workflow provides a user-friendly interface for generating images with our trained LoRA models without needing to write Python code.

### 🖼️ Workflow Screenshot

![ComfyUI Workflow](./assets/comfyui_workflow.png)

## 📊 Side-by-Side Comparisons

Below are comparison images showing the improvements of our LoRA model across different scenarios:

### Mountain Landscape
**Prompt**: *"realism, mountain landscape at golden hour, low sun grazing alpine meadows, crisp ridgelines, thin mist in valley, high dynamic range sky, 24mm f/8, ISO 100, tripod, ultra-sharp foreground grass, micro-contrast on rocks"*

![Mountain Landscape Comparison](./assets/prompt_004_comparison.png)

### Rainy Night City Scene
**Prompt**: *"realism, rainy night city scene with neon reflections, person holding a transparent umbrella, water droplets sharp on umbrella surface, shallow DOF, 55mm f/1.8, ISO 1600, blue and magenta neon, storefront sign reads "OPEN 24/7""*

![Rainy Night City Comparison](./assets/prompt_006_comparison.png)

### Athletic Motion
**Prompt**: *"realism, athlete mid-sprint on track, strong sunlight, backlit dust particles, frozen motion at 1/2000s, 200mm f/2.8, ISO 400, muscle definition and sweat droplets detailed, stadium banner says "FINALS""*

![Athletic Motion Comparison](./assets/prompt_010_comparison.png)

### Corporate Portrait
**Prompt**: *"realism, corporate headshot of a CTO in glass-walled office, city skyline bokeh, balanced key/fill lighting, 85mm f/2, ISO 100, crisp lapel and hair detail, subtle reflection on glasses"*

![Corporate Portrait Comparison](./assets/prompt_013_comparison.png)

## 🤝 Support

If you have questions or suggestions, join our community:
- 🌐 [FlyMy.AI](https://flymy.ai)
- 💬 [Discord Community](https://discord.com/invite/t6hPBpSebw)
- 🐦 [Follow us on X](https://x.com/flymyai)
- 💼 [Connect on LinkedIn](https://linkedin.com/company/flymyai)
- 📧 [Support](mailto:[email protected])

**⭐ Don't forget to star the repository if you like it!**


---
license: apache-2.0
---