A Math ehanched Qwen 2.5VL 3B with VLM-R1 reinforcement learning.
cite: arxiv.org/abs/2504.07615
Chat template
Files info
Base model
Totally Free + Zero Barriers + No Login Required