share code convert model

by williamljx - opened 16 days ago

Discussion

williamljx

16 days ago

Hi, could you share code for converting the Medgemma-4b-pt model (.safetensors) to ONNX?

Prince-1

Owner 16 days ago

I convert the models to onnx using onnxruntime-genai

pip install onnxruntime-genai --pre
pip install onnx onnx-ir safetensors torch transformers gguf

then run the following command


from onnxruntime_genai.models.builder import create _model

create_model("hf id name"
,"None /model dir/gguf)
,"output dir ",
," precision int4 f16 "
," cpu /cuda/webgpu",
,"cache_dir",
**extra)

Prince-1

Owner 16 days ago

https://github.com/microsoft/onnxruntime-genai/blob/main/src/python/py/models/README.md

Prince-1

Owner 16 days ago

I convert the models to onnx using onnxruntime-genai

pip install onnxruntime-genai --pre
pip install onnx onnx-ir safetensors torch transformers gguf

then run the following command


from onnxruntime_genai.models.builder import create _model

create_model("hf id name"
,"None /model dir/gguf)
,"output dir ",
," precision int4 f16 "
," cpu /cuda/webgpu",
,"cache_dir",
**extra)

Just change the inputs

Prince-1

Owner 16 days ago

Not all model types are supported

williamljx

15 days ago

onnxruntime_genai\models\builder.py

Thanks for your reply! Since both MedGemma-4b-pt and MedGemma-4b-it use the Gemma3ForConditionalGeneration architecture, the builder will generate only the text component of the model.

Prince-1

Owner 11 days ago

Vision onnx model is still in development

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment