Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,26 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
# Qwen3-4B-Instruct-2507
|
2 |
+
|
3 |
+
## Model Description
|
4 |
+
**Qwen3-4B-Instruct-2507** is an updated non-thinking variant in the Qwen3 family, designed for instruction-following tasks without generating `<think></think>` reasoning blocks.
|
5 |
+
Trained for enhanced general capabilities—including logic, coding, math, science, and long-tail multilingual knowledge—while natively supporting sprawling 256K-token contexts.
|
6 |
+
|
7 |
+
## Features
|
8 |
+
- **Instruction-tuned performance**: Strong at prompts, logic, comprehension, coding.
|
9 |
+
- **Multilingual strength**: Expanded long-tail coverage across many languages.
|
10 |
+
- **Massive context window**: Handles up to 262,144 tokens natively.
|
11 |
+
- **Clean output**: No thinking-mode parsing needed—just straight responses.
|
12 |
+
|
13 |
+
## Use Cases
|
14 |
+
- High-quality conversational agents and instruction following
|
15 |
+
- Processing long documents, books, legal texts, and source code
|
16 |
+
- Multilingual tasks or low-resource language scenarios
|
17 |
+
|
18 |
+
## Inputs and Outputs
|
19 |
+
**Input**: Text prompts—questions, commands, code tasks—without any special thinking mode flags.
|
20 |
+
**Output**: Direct, context-aware responses—answers, explanations, code—with no internal thought annotations.
|
21 |
+
|
22 |
+
## License
|
23 |
+
- Licensed under **Apache-2.0**
|
24 |
+
|
25 |
+
## References
|
26 |
+
- Model card: [https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507)
|