giladgd
/

gpt-oss-20b-GGUF

Text Generation

Model card Files Files and versions

giladgd commited on 15 days ago

Commit

e17414b

·

verified ·

1 Parent(s): 54cc772

docs: fix typos

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -28,7 +28,7 @@ base_model: openai/gpt-oss-20b
 * **Native MXFP4 quantization:** The models are trained with native MXFP4 precision for the MoE layer, making `gpt-oss-120b` run on a single 80GB GPU (like NVIDIA H100 or AMD MI300X) and the `gpt-oss-20b` model run within 16GB of memory.
 > [!NOTE]
-> Refer to the [original model card](https://huggingface.co/openai/gpt-oss-20b) for more details on the model.
 # Quants
 | Link | [URI](https://node-llama-cpp.withcat.ai/cli/pull) | Size |
@@ -87,7 +87,7 @@ console.log("AI: " + a1);
 ```
 > [!TIP]
-> Read the [getting started guide](https://node-llama-cpp.withcat.ai/guide/) to quickly scaffold a new `node-llama-cpp` project.
 #### Customize inference options
 Set [Harmoy](https://cookbook.openai.com/articles/openai-harmony) options using [`HarmonyChatWrapper`](https://node-llama-cpp.withcat.ai/api/classes/HarmonyChatWrapper):

 * **Native MXFP4 quantization:** The models are trained with native MXFP4 precision for the MoE layer, making `gpt-oss-120b` run on a single 80GB GPU (like NVIDIA H100 or AMD MI300X) and the `gpt-oss-20b` model run within 16GB of memory.
 > [!NOTE]
+> Refer to the [original model card](https://huggingface.co/openai/gpt-oss-20b) for more details on the model
 # Quants
 | Link | [URI](https://node-llama-cpp.withcat.ai/cli/pull) | Size |
 ```
 > [!TIP]
+> Read the [getting started guide](https://node-llama-cpp.withcat.ai/guide/) to quickly scaffold a new `node-llama-cpp` project
 #### Customize inference options
 Set [Harmoy](https://cookbook.openai.com/articles/openai-harmony) options using [`HarmonyChatWrapper`](https://node-llama-cpp.withcat.ai/api/classes/HarmonyChatWrapper):