Hi, the licence of this model should be apache 2.0 according to the blog.
#11 opened 25 days ago
by
Enigrand
Prompt for translation
#9 opened 2 months ago
by
joncam14
Thanks for not grossly overfitting this model.
❤️
🚀
10
1
#4 opened 3 months ago
by
phil111
Hi, could you consider training a 34b model using the rwkv architecture and compare it with Transformers + Mamba?
1
#3 opened 3 months ago
by
win10

Could you please tell me which 18 languages you mainly support?
1
#2 opened 3 months ago
by
FantastyZhou
Was qwen 3 tested with thinking on or off?
2
#1 opened 3 months ago
by
drmcbride