Update README.md
Browse files
README.md
CHANGED
@@ -15,12 +15,18 @@ Uploading this since I'm using it to calculate imatrix, figured might as well pr
|
|
15 |
Remember, this is a **BASE** model, so it likely will not chat properly unless you give it multiple turns of examples, for instance I've had success with:
|
16 |
|
17 |
```
|
18 |
-
./llama-cli -m /models/deepseek-ai_DeepSeek-V3.1-Base-Q4_K_M-00001-of-00011.gguf -p "You are a helpful assistant
|
19 |
```
|
20 |
|
|
|
|
|
|
|
|
|
|
|
|
|
21 |
This resulted in a completely coherent reply:
|
22 |
|
23 |
-
>
|
24 |
|
25 |
The idea is that you need to teach the base model what a conversation looks like first, base models aren't usually capable of one-shotting a conversation since it hasn't been tuned to understand roles.
|
26 |
|
|
|
15 |
Remember, this is a **BASE** model, so it likely will not chat properly unless you give it multiple turns of examples, for instance I've had success with:
|
16 |
|
17 |
```
|
18 |
+
./llama-cli -m /models/deepseek-ai_DeepSeek-V3.1-Base-Q4_K_M-00001-of-00011.gguf -p "You are a helpful assistant.<User>Hello, who are you?<Assistant>I am DeepSeek, a helpful AI assistant.<User>How are you today?<Assistant>I'm doing well! Is there anything I can assist you with?<User>Can you explain the laws of thermodynamics?<Assistant>" -no-cnv -ngl 0 --reverse-prompt "<User>"
|
19 |
```
|
20 |
|
21 |
+
Prompt for easier viewing:
|
22 |
+
|
23 |
+
`You are a helpful assistant.<User>Hello, who are you?<Assistant>I am DeepSeek, a helpful AI assistant.<User>How are you today?<Assistant>I'm doing well! Is there anything I can assist you with?<User>Can you explain the laws of thermodynamics?<Assistant>" -no-cnv -ngl 0 --reverse-prompt "<User>"`
|
24 |
+
|
25 |
+
*Yes*, I am using `<User>` and `<Assistant>` as opposed to the special tokens `<|User|>` and `<|Assistant|>`, for some reason this seems to be more stable?
|
26 |
+
|
27 |
This resulted in a completely coherent reply:
|
28 |
|
29 |
+
> Sure, here's a brief explanation of the laws of thermodynamics: 1. Zeroth Law of Thermodynamics: If two thermodynamic systems are each in thermal equilibrium with a third system, then they are in thermal equilibrium with each other. 2. First Law of Thermodynamics: The total energy of an isolated system is constant; energy can be transformed from one form to another, but cannot be created or destroyed. 3. Second Law of Thermodynamics: The entropy of an isolated system not in equilibrium will tend to increase over time, approaching a maximum value at equilibrium. 4. Third Law of Thermodynamics: As the temperature of a system approaches absolute zero, the entropy of the system approaches a minimum value. Would you like more details on any of these laws?
|
30 |
|
31 |
The idea is that you need to teach the base model what a conversation looks like first, base models aren't usually capable of one-shotting a conversation since it hasn't been tuned to understand roles.
|
32 |
|