-
pankajpandey-dev/Carbon-3B-GGUF
Text Generation • 3B • Updated • 630 • 3 -
pankajpandey-dev/MiniCPM5-1B-Hindi-Instruct-v1-GGUF
Text Generation • 1B • Updated • 589 • 1 -
pankajpandey-dev/Qwen3-0.6B-Hindi-Instruct-v1-GGUF
Text Generation • 0.6B • Updated • 586 • 1 -
pankajpandey-dev/Qwen3-4B-Hindi-Instruct-v2-GGUF
Text Generation • 4B • Updated • 223 • 1
Pankaj Pandey
pankajpandey-dev
AI & ML interests
Natural Language Processing, Text Generation, Large Language Models, Quantization, Fine-Tuning, RLHF, Model Merging
Recent Activity
reacted to theirpost with 🔥 5 days ago
🇮🇳 Gemma-3-1B Hindi Instruct — a Hindi LLM that runs fully offline, anywhere.
Last week I shipped Qwen3-4B Hindi. This week I went the other direction: how tiny can a useful Hindi model get? So I fine-tuned Gemma-3-1B on quality-filtered Hindi instruction data and shipped the full GGUF ladder.
✅ Fine-tune (16-bit): https://huggingface.co/pankajpandey-dev/gemma-3-1b-hindi-instruct
✅ GGUF (Q4/Q5/Q8): https://huggingface.co/pankajpandey-dev/gemma-3-1b-hindi-instruct-GGUF
Runs in Ollama, llama.cpp, and LM Studio. The Q4_K_M is just 806 MB — runs on CPU, a cheap laptop, even a Raspberry Pi.
What I tried this round: chrF-filtered the training data to drop weak translations, and used response-only loss so the model learns how to answer, not how to repeat prompts.
Honest note: at 1B, Hindi fluency is strong but coherence is bounded by size — it's a lightweight/edge experiment, not a 4B replacement. Gemma-3-4B Hindi is next.
Part of my Hindi LLM Series — openly-licensed Indic models for local & edge use. Feedback welcome 🙏
#Hindi #IndicNLP #GGUF #LocalLLM #Gemma #EdgeAI
liked a Space 7 days ago
pankajpandey-dev/qwen3-4b-hindi-demo updated a Space 8 days ago
pankajpandey-dev/qwen3-4b-hindi-demo