Improve model card with pipeline tag and library name
Browse filesThis PR improves the model card by:
- Adding the `pipeline_tag: image-text-to-text` to better categorize the model on the Hugging Face Hub. This allows users searching for image-to-text models to easily find this one.
- Specifying the `library_name: transformers`, clarifying that the model utilizes the Transformers library.
This enhances discoverability and provides crucial context for users.
README.md
CHANGED
|
@@ -1,15 +1,17 @@
|
|
| 1 |
---
|
| 2 |
-
license: mit
|
| 3 |
base_model:
|
| 4 |
- Qwen/Qwen2-VL-2B-Instruct
|
|
|
|
|
|
|
|
|
|
| 5 |
---
|
| 6 |
|
| 7 |
# GUI-Actor-2B with Qwen2-VL-2B as backbone VLM
|
| 8 |
|
| 9 |
-
This model was introduced in the paper [
|
| 10 |
It is developed based on [Qwen2-VL-2B-Instruct ](https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct), augmented by an attention-based action head and finetuned to perform GUI grounding using the dataset [here (coming soon)]().
|
| 11 |
|
| 12 |
-
For more details on model design and evaluation, please check: [🏠 Project Page](https://
|
| 13 |
|
| 14 |
| Model Name | Hugging Face Link |
|
| 15 |
|--------------------------------------------|--------------------------------------------|
|
|
|
|
| 1 |
---
|
|
|
|
| 2 |
base_model:
|
| 3 |
- Qwen/Qwen2-VL-2B-Instruct
|
| 4 |
+
license: mit
|
| 5 |
+
library_name: transformers
|
| 6 |
+
pipeline_tag: image-text-to-text
|
| 7 |
---
|
| 8 |
|
| 9 |
# GUI-Actor-2B with Qwen2-VL-2B as backbone VLM
|
| 10 |
|
| 11 |
+
This model was introduced in the paper [GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents](https://www.arxiv.org/pdf/2506.03143).
|
| 12 |
It is developed based on [Qwen2-VL-2B-Instruct ](https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct), augmented by an attention-based action head and finetuned to perform GUI grounding using the dataset [here (coming soon)]().
|
| 13 |
|
| 14 |
+
For more details on model design and evaluation, please check: [🏠 Project Page](https://microsoft.github.io/GUI-Actor/) | [💻 Github Repo](https://github.com/microsoft/GUI-Actor) | [📑 Paper](https://www.arxiv.org/pdf/2506.03143).
|
| 15 |
|
| 16 |
| Model Name | Hugging Face Link |
|
| 17 |
|--------------------------------------------|--------------------------------------------|
|