Wow, amazing response time

by AlexPradas - opened 19 days ago

Discussion

AlexPradas

19 days ago

I am impressed at how responsive you are to make the models available. Thank you!

sandlercaleb

19 days ago

I agree, again thank you so much to both of you at Unsloth, you are such amazing parts of this open source community.

Joseph717171

19 days ago

It's not up yet, guys. Just be patient. 🤞😋

shimmyshimmer

Unsloth AI org 19 days ago

Hey guys we're still working on uploading them! Stay tuned!

testosterones

19 days ago

the one featured by lmstudio is too large for my 16gb m1 pro.
but my PC 4070 12gb can chat on it just fine... patiently waiting for the upload to dl it!

Joseph717171

19 days ago

•

edited 19 days ago

@testosterones Throw this in your terminal. This tells Mac OS to make 14GB of unified memory accessible to you - you'll be able to run the 12.11 GB MXFP4.

 sudo sysctl iogpu.wired_limit_mb=14336

shimmyshimmer

Unsloth AI org 19 days ago

It's uploaded now!! The FP4 version. Please update whichever inference engine youre using!

Dynamic GGUFs with different sizes will come later!! Thanks to llama.cpp if they update it.

CC: @AlexPradas @Joseph717171 @sandlercaleb @testosterones @adhish @TobDeBer @alt909 @owao @Metricon @drjabaka @Tom-Neverwinter

Joseph717171

19 days ago

•

edited 19 days ago

It's uploaded now!! The FP4 version. Please update whichever inference engine youre using!

Dynamic GGUFs with different sizes will come later!! Thanks to llama.cpp if they update it.

CC: @AlexPradas @Joseph717171 @sandlercaleb @testosterones @adhish @TobDeBer @alt909 @owao @Metricon @drjabaka @Tom-Neverwinter

@shimmyshimmer That a boy, Mikey!!! You and Dan fucking rock! 🚀 🚀

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment