| --- |
| pipeline_tag: text-to-image |
| --- |
| |
|  |
|
|
| Lumina-Image-2.0 is a 2 billion parameter flow-based diffusion transformer capable of generating images from text descriptions. |
|
|
|
|
| ## Usage |
|
|
| ```python |
| import torch |
| from diffusers import LuminaText2ImgPipeline |
| |
| pipe = Lumina2Text2ImgPipeline.from_pretrained("Alpha-VLLM/Lumina-Image-2.0", torch_dtype=torch.bfloat16) |
| pipe.enable_model_cpu_offload() #save some VRAM by offloading the model to CPU. Remove this if you have enough GPU power |
| |
| prompt = "A dog holding a sign that says hello world" |
| image = pipe( |
| prompt, |
| height=1024, |
| width=1024, |
| guidance_scale=4.0, |
| num_inference_steps=50, |
| cfg_trunc_ratio=0.25, |
| cfg_normalization=True, |
| generator=torch.Generator("cpu").manual_seed(0) |
| ).images[0] |
| image.save("lumina_demo.png") |
| ``` |