Unable download the generated code
#21 opened 4 days ago
by
ssfarzad
Fixed 🔨 GGUF Tool calling ✅MCP working ✅
1
#19 opened 14 days ago
by
xbruce22

Will there still be 32B dense models?
➕
👀
7
1
#18 opened 19 days ago
by
lingyezhixing
Upload Marginal adapttaion.pdf
#17 opened 21 days ago
by
thenunabdo
Please create 8-bit MLX - No-one has it anywhere...
#16 opened 23 days ago
by
Darkslayerofdark
Questions on FP8 inference, parallel requests, and context length with 4x H200s
2
#15 opened 24 days ago
by
sultan93
Does its api support formot?
#14 opened 24 days ago
by
Connde
Impressive Broad Knowledge
👍
👀
5
4
#12 opened 25 days ago
by
phil111
Thinking tokens issue
👍
2
11
#9 opened 26 days ago
by
iyanello
Benchmarks for non-thinking mode
👍
4
2
#8 opened 26 days ago
by
PSM24
Thankyou GLM Team for the wonderful MOE Model
🔥
6
#7 opened 26 days ago
by
Narutoouz

AWQ 4Bit / GPTQ with full precision gates and head? Please
8
#4 opened 27 days ago
by
chriswritescode

We Have Gemini At Home
4
#1 opened 27 days ago
by
MarinaraSpaghetti
