FP8 Quant Please
#11 opened about 1 month ago
by
rjmehta
Best model for SD?
#10 opened about 1 month ago
by
darkstar3537
AWQ Quantization plz.
👍
1
#9 opened 2 months ago
by
hyunw55
can use vllm deploy this model?
2
#8 opened 2 months ago
by
tianyer
How do I separate the reasoning from the reply?
1
#7 opened 2 months ago
by
Lockout

TTFT deteriorates rapidly after Concurrency reaches 72.
1
#5 opened 2 months ago
by
theGreatGuy

update metadata
#3 opened 2 months ago
by
nickname100231
Can we expect you to open source older versions of Kimi that you developed in-house?
#2 opened 2 months ago
by
win10

gguf model?
👍
2
3
#1 opened 2 months ago
by
segmond