WaltonFuture
/

Qwen2.5-VL-7B-MM-UPT-MMR1

Image-Text-to-Text

text-generation-inference

Model card Files Files and versions

WaltonFuture commited on May 29

Commit

c15d4b4

·

verified ·

1 Parent(s): f561612

Update README.md

Files changed (1) hide show

README.md +11 -3

README.md CHANGED Viewed

@@ -1,3 +1,11 @@
----
-license: mit
----

+---
+license: mit
+datasets:
+- MMR1/MMR1-Math-RL-Data-v0
+base_model:
+- Qwen/Qwen2.5-VL-7B-Instruct
+---
+* 🐙 **GitHub Repo:** [waltonfuture/MM-UPT](https://github.com/waltonfuture/MM-UPT)
+* 📜 **Paper (arXiv):** [Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO (arXiv:2505.22453)](https://arxiv.org/abs/2505.22453)
+* 💾 **Dataset:** [WaltonFuture/MMR1-direct-synthesizing on Hugging Face](https://huggingface.co/datasets/WaltonFuture/MMR1-direct-synthesizing)