Update README.md
Browse files
README.md
CHANGED
|
@@ -23,7 +23,7 @@ pipeline_tag: image-text-to-text
|
|
| 23 |
|
| 24 |
## Overview
|
| 25 |
|
| 26 |
-
**AgentCPM-GUI** is an open-source on-device LLM agent model jointly developed by [THUNLP](https://nlp.csai.tsinghua.edu.cn) and [ModelBest](https://modelbest.cn/en). Built on [MiniCPM-V](https://github.com/OpenBMB/MiniCPM-V) with 8 billion parameters, it accepts smartphone screenshots as input and autonomously executes user-specified tasks.
|
| 27 |
|
| 28 |
Key features include:
|
| 29 |
|
|
@@ -42,7 +42,7 @@ https://github.com/user-attachments/assets/5472a659-cd71-4bce-a181-0981129c6a81
|
|
| 42 |
|
| 43 |
```bash
|
| 44 |
git clone https://github.com/OpenBMB/AgentCPM-GUI
|
| 45 |
-
cd
|
| 46 |
conda create -n gui_agent python=3.11
|
| 47 |
conda activate gui_agent
|
| 48 |
pip install -r requirements.txt
|
|
@@ -225,7 +225,7 @@ print(response)
|
|
| 225 |
|
| 226 |
## Fine-tuning
|
| 227 |
|
| 228 |
-
Source code for SFT and RFT training is provided — see [
|
| 229 |
|
| 230 |
## Performance Evaluation
|
| 231 |
|
|
@@ -261,7 +261,9 @@ Source code for SFT and RFT training is provided — see [SFT](sft/readme.md) an
|
|
| 261 |
|
| 262 |
> \*Different train/test splits
|
| 263 |
|
| 264 |
-
All evaluation data and code are open-sourced — see [here](eval) for details.
|
|
|
|
|
|
|
| 265 |
|
| 266 |
## Evaluation Data
|
| 267 |
|
|
|
|
| 23 |
|
| 24 |
## Overview
|
| 25 |
|
| 26 |
+
**AgentCPM-GUI** is an open-source on-device LLM agent model jointly developed by [THUNLP](https://nlp.csai.tsinghua.edu.cn), Renmin University of China and [ModelBest](https://modelbest.cn/en). Built on [MiniCPM-V](https://github.com/OpenBMB/MiniCPM-V) with 8 billion parameters, it accepts smartphone screenshots as input and autonomously executes user-specified tasks.
|
| 27 |
|
| 28 |
Key features include:
|
| 29 |
|
|
|
|
| 42 |
|
| 43 |
```bash
|
| 44 |
git clone https://github.com/OpenBMB/AgentCPM-GUI
|
| 45 |
+
cd AgentCPM-GUI
|
| 46 |
conda create -n gui_agent python=3.11
|
| 47 |
conda activate gui_agent
|
| 48 |
pip install -r requirements.txt
|
|
|
|
| 225 |
|
| 226 |
## Fine-tuning
|
| 227 |
|
| 228 |
+
Source code for SFT and RFT training is provided — see [GitHub](https://github.com/OpenBMB/AgentCPM-GUI).
|
| 229 |
|
| 230 |
## Performance Evaluation
|
| 231 |
|
|
|
|
| 261 |
|
| 262 |
> \*Different train/test splits
|
| 263 |
|
| 264 |
+
TM and EM stand for the **Type Match** and **Exact Match**, respectively. All evaluation data and code are open-sourced — see [here](eval) for details.
|
| 265 |
+
|
| 266 |
+
All evaluation data and code are open-sourced — see [here](https://github.com/OpenBMB/AgentCPM-GUI/tree/main/eval) for details.
|
| 267 |
|
| 268 |
## Evaluation Data
|
| 269 |
|