Text Generation
GGUF
English
gpt_oss
gpt-oss
openai
mxfp4
programming
code generation
code
coding
coder
chat
reasoning
thinking
r1
cot
deepseek
128k context
general usage
problem solving
brainstorming
solve riddles
uncensored
abliterated
Neo
MOE
Mixture of Experts
24 experts
NEO Imatrix
Imatrix
DI-Matrix
Tri-Matrix
imatrix
conversational
Update README.md
Browse files
README.md
CHANGED
@@ -123,13 +123,13 @@ Strongest Imatrix effect(s) are IQ quants and the strength of the effect is inve
|
|
123 |
DI-Matrix and TRI-Matrix are "averages" of 2 and 3 imatrix datasets (generated specifically for a model, separately). This averaging
|
124 |
can "trim" some effects and/or add some "traits" and make better quants.
|
125 |
|
126 |
-
In the case of abliterated model(s), I find
|
127 |
|
128 |
Depending on your use case(s) regular imatrix, and/or DI/TRI imatrix quant(s) may meet different use case(s) requirement(s).
|
129 |
|
130 |
To test: Try 2-5 generations per quant (same prompt, exact same settings), then evaluate output/thinking.
|
131 |
|
132 |
-
The Imatrix effect itself depends on the model being imatrixed, strength of the imatrix dataset and the quant(s) targeted.
|
133 |
|
134 |
The Q8 quants (only) have been modified to allow limited imatrix effect(s) in this case: the output tensor only.
|
135 |
|
|
|
123 |
DI-Matrix and TRI-Matrix are "averages" of 2 and 3 imatrix datasets (generated specifically for a model, separately). This averaging
|
124 |
can "trim" some effects and/or add some "traits" and make better quants.
|
125 |
|
126 |
+
In the case of abliterated model(s), I find "imatrixing" quants can fix minor issues caused by the abliteration process in some cases.
|
127 |
|
128 |
Depending on your use case(s) regular imatrix, and/or DI/TRI imatrix quant(s) may meet different use case(s) requirement(s).
|
129 |
|
130 |
To test: Try 2-5 generations per quant (same prompt, exact same settings), then evaluate output/thinking.
|
131 |
|
132 |
+
The Imatrix effect itself depends on the model being imatrixed, strength of the imatrix dataset(s) and the quant(s) targeted.
|
133 |
|
134 |
The Q8 quants (only) have been modified to allow limited imatrix effect(s) in this case: the output tensor only.
|
135 |
|