Update README.md
Browse files
README.md
CHANGED
@@ -19,6 +19,35 @@ Trained for enhanced general capabilities—including logic, coding, math, scien
|
|
19 |
**Input**: Text prompts—questions, commands, code tasks—without any special thinking mode flags.
|
20 |
**Output**: Direct, context-aware responses—answers, explanations, code—with no internal thought annotations.
|
21 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
22 |
## License
|
23 |
- Licensed under **Apache-2.0**
|
24 |
|
|
|
19 |
**Input**: Text prompts—questions, commands, code tasks—without any special thinking mode flags.
|
20 |
**Output**: Direct, context-aware responses—answers, explanations, code—with no internal thought annotations.
|
21 |
|
22 |
+
---
|
23 |
+
|
24 |
+
## How to use
|
25 |
+
|
26 |
+
> ⚠️ **Hardware requirement:** the model currently runs **only on Qualcomm NPUs** (e.g., Snapdragon-powered AIPC).
|
27 |
+
> Apple NPU support is planned next.
|
28 |
+
|
29 |
+
### 1) Install Nexa-SDK
|
30 |
+
|
31 |
+
- Download and follow the steps under "Deploy Section" Nexa's model page: [Download Windows arm64 SDK](https://sdk.nexa.ai/model/Qwen3-4B-Instruct-2507)
|
32 |
+
- (Other platforms coming soon)
|
33 |
+
|
34 |
+
### 2) Get an access token
|
35 |
+
Create a token in the Model Hub, then log in:
|
36 |
+
|
37 |
+
```bash
|
38 |
+
nexa config set license '<access_token>'
|
39 |
+
```
|
40 |
+
|
41 |
+
### 3) Run the model
|
42 |
+
Running:
|
43 |
+
|
44 |
+
```bash
|
45 |
+
nexa infer NexaAI/Qwen3-4B-Instruct-2507-npu
|
46 |
+
```
|
47 |
+
|
48 |
+
---
|
49 |
+
|
50 |
+
|
51 |
## License
|
52 |
- Licensed under **Apache-2.0**
|
53 |
|