Update README.md
Browse files
README.md
CHANGED
@@ -1,4 +1,4 @@
|
|
1 |
-
# Qwen3-8B
|
2 |
|
3 |
## Qwen3 Highlights
|
4 |
|
@@ -12,9 +12,9 @@ Building upon extensive advancements in training data, model architecture, and o
|
|
12 |
|
13 |
## Model Overview
|
14 |
|
15 |
-
**Qwen3-8B** has the following features:
|
16 |
- Type: Causal Language Models
|
17 |
-
- Training Stage: Pretraining
|
18 |
- Number of Parameters: 8.2B
|
19 |
- Number of Paramaters (Non-Embedding): 6.95B
|
20 |
- Number of Layers: 36
|
|
|
1 |
+
# Qwen3-8B-Base
|
2 |
|
3 |
## Qwen3 Highlights
|
4 |
|
|
|
12 |
|
13 |
## Model Overview
|
14 |
|
15 |
+
**Qwen3-8B-Base** has the following features:
|
16 |
- Type: Causal Language Models
|
17 |
+
- Training Stage: Pretraining
|
18 |
- Number of Parameters: 8.2B
|
19 |
- Number of Paramaters (Non-Embedding): 6.95B
|
20 |
- Number of Layers: 36
|