Add link to paper and sample usage

This PR ensures the model is linked to https://huggingface.co/papers/2505.10557 and adds a sample usage to the model card, making it easier for users to get started with the model.

Files changed (1) hide show

README.md +21 -6

README.md CHANGED Viewed

@@ -1,14 +1,15 @@
 ---
-license: apache-2.0
 language:
-  - en
 metrics:
-  - accuracy
 pipeline_tag: image-text-to-text
-library_name: transformers
-base_model:
-  - OpenGVLab/Mini-InternVL-Chat-2B-V1-5
 ---
 # MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning
 Repo: [https://github.com/mathllm/MathCoder](https://github.com/mathllm/MathCoder)
@@ -26,8 +27,22 @@ We introduce MathCoder-VL, a series of open-source large multimodal models (LMMs
 ## Usage
 For training and inference code, please refer to [InternVL](https://github.com/OpenGVLab/InternVL).
 ## Motivation

 ---
+base_model:
+- OpenGVLab/Mini-InternVL-Chat-2B-V1-5
 language:
+- en
+library_name: transformers
+license: apache-2.0
 metrics:
+- accuracy
 pipeline_tag: image-text-to-text
 ---
 # MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning
 Repo: [https://github.com/mathllm/MathCoder](https://github.com/mathllm/MathCoder)
 ## Usage
 For training and inference code, please refer to [InternVL](https://github.com/OpenGVLab/InternVL).
+**Example:** (Illustrative - adapt to your specific needs and refer to InternVL for details)
+```python
+from transformers import pipeline
+pipe = pipeline("image-text-to-text", model="MathLLMs/MathCoder-VL-2B", device=0) #replace with your preferred model and device
+image = "path/to/your/image.png" #replace with your image path
+prompt = "What is the area of the shape in this image?"
+result = pipe(image=image, text=prompt)
+print(result)
+```
 ## Motivation