My AI startup that's all about the `mesh` architecture: a novel solution to the problems of MoE
aquiffoo
aquiffoo
AI & ML interests
small but capable open models.
Recent Activity
updated
a model
8 minutes ago
mesh-labs/v0.1-2x2-stage002
updated
a Space
about 2 hours ago
mesh-labs/README
reacted
to
danielhanchen's
post
with 🔥
about 6 hours ago
Run DeepSeek-V3.1 locally on 170GB RAM with Dynamic 1-bit GGUFs!🐋
GGUFs: https://huggingface.co/unsloth/DeepSeek-V3.1-GGUF
The 715GB model gets reduced to 170GB (-80% size) by smartly quantizing layers.
The 1-bit GGUF passes all our code tests & we fixed the chat template for llama.cpp supported backends.
Guide: https://docs.unsloth.ai/basics/deepseek-v3.1