alanzhuly commited on
Commit
158f497
·
verified ·
1 Parent(s): 644fca6

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +26 -0
README.md ADDED
@@ -0,0 +1,26 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Qwen3-4B-Instruct-2507
2
+
3
+ ## Model Description
4
+ **Qwen3-4B-Instruct-2507** is an updated non-thinking variant in the Qwen3 family, designed for instruction-following tasks without generating `<think></think>` reasoning blocks.
5
+ Trained for enhanced general capabilities—including logic, coding, math, science, and long-tail multilingual knowledge—while natively supporting sprawling 256K-token contexts.
6
+
7
+ ## Features
8
+ - **Instruction-tuned performance**: Strong at prompts, logic, comprehension, coding.
9
+ - **Multilingual strength**: Expanded long-tail coverage across many languages.
10
+ - **Massive context window**: Handles up to 262,144 tokens natively.
11
+ - **Clean output**: No thinking-mode parsing needed—just straight responses.
12
+
13
+ ## Use Cases
14
+ - High-quality conversational agents and instruction following
15
+ - Processing long documents, books, legal texts, and source code
16
+ - Multilingual tasks or low-resource language scenarios
17
+
18
+ ## Inputs and Outputs
19
+ **Input**: Text prompts—questions, commands, code tasks—without any special thinking mode flags.
20
+ **Output**: Direct, context-aware responses—answers, explanations, code—with no internal thought annotations.
21
+
22
+ ## License
23
+ - Licensed under **Apache-2.0**
24
+
25
+ ## References
26
+ - Model card: [https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507)