Qwen3-30B-A3B-Instruct-2507-GGUF
Original Model
Qwen/Qwen3-30B-A3B-Instruct-2507
Run with LlamaEdge
- LlamaEdge version: coming soon
Prompt template
Prompt type:
chatml
Prompt string
<|im_start|>system {system_message}<|im_end|> <|im_start|>user {prompt}<|im_end|> <|im_start|>assistant
Context size:
256000
Run as LlamaEdge service
wasmedge --dir .:. --nn-preload default:GGML:AUTO:Qwen3-30B-A3B-Instruct-2507-Q5_K_M.gguf \ llama-api-server.wasm \ --model-name Qwen3-30B-A3B-Instruct-2507 \ --prompt-template chatml \ --ctx-size 256000
Quantized with llama.cpp b6031
- Downloads last month
- 558
Hardware compatibility
Log In
to view the estimation
2-bit
3-bit
4-bit
5-bit
6-bit
8-bit
16-bit
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for second-state/Qwen3-30B-A3B-Instruct-2507-GGUF
Base model
Qwen/Qwen3-30B-A3B-Instruct-2507