lucyknada
/

janhq_Jan-v1-4B-exl3

Text Generation

English

Model card Files Files and versions

xet

Community

lucyknada commited on 13 days ago

Commit

8e8797c

verified ·

1 Parent(s): 3c123f9

Upload ./README.md with huggingface_hub

Browse files

Files changed (1) hide show

README.md +92 -0

README.md ADDED Viewed

	@@ -0,0 +1,92 @@

+---
+license: apache-2.0
+language:
+- en
+base_model:
+- Qwen/Qwen3-4B-Thinking-2507
+pipeline_tag: text-generation
+---
+### exl3 quant
+---
+### check revisions for quants
+---
+# Jan-v1: Advanced Agentic Language Model
+[![GitHub](https://img.shields.io/badge/GitHub-Repository-blue?logo=github)](https://github.com/menloresearch/deep-research)
+[![License](https://img.shields.io/badge/License-Apache%202.0-yellow)](https://opensource.org/licenses/Apache-2.0)
+[![Jan App](https://img.shields.io/badge/Powered%20by-Jan%20App-purple?style=flat&logo=android)](https://jan.ai/)
+<!-- Optional: If you have a GIF for Jan-v1, include it here like Lucy's. -->
+<!-- ![image/gif](jan_v1_demo.gif) -->
+## Overview
+**Jan-v1** is the first release in the **Jan Family**, designed for agentic reasoning and problem-solving within the [Jan App](https://jan.ai/). Based on our [**Lucy**](https://huggingface.co/Menlo/Lucy) model, Jan-v1 achieves improved performance through model scaling.
+Jan-v1 uses the [Qwen3-4B-thinking](https://huggingface.co/Qwen/Qwen3-4B-Thinking-2507) model to provide enhanced reasoning capabilities and tool utilization. This architecture delivers better performance on complex agentic tasks.
+## Performance
+### Question Answering (SimpleQA)
+For question-answering, Jan-v1 shows a significant performance gain from model scaling, achieving 91.1% accuracy.
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/65713d70f56f9538679e5a56/6CaETynCW18MXgDrbp_N9.png)
+*The 91.1% SimpleQA accuracy represents a significant milestone in factual question answering for models of this scale, demonstrating the effectiveness of our scaling and fine-tuning approach.*
+### Chat Benchmarks
+These benchmarks evaluate the model's conversational and instructional capabilities.
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/65713d70f56f9538679e5a56/f3bzNYRuA_iTFQIcvu6Rr.png)
+## Quick Start
+### Integration with Jan App
+Jan-v1 is optimized for direct integration with the [Jan App](https://jan.ai/). Simply select the model from the Jan App interface for immediate access to its full capabilities.
+![image/gif](demo.gif)
+### Local Deployment
+**Using vLLM:**
+```bash
+vllm serve janhq/Jan-v1-4B \
+    --host 0.0.0.0 \
+    --port 1234 \
+    --enable-auto-tool-choice \
+    --tool-call-parser hermes
+```
+**Using llama.cpp:**
+```bash
+llama-server --model Jan-v1-4B-Q4_K_M.gguf \
+    --host 0.0.0.0 \
+    --port 1234 \
+    --jinja \
+    --no-context-shift
+```
+### Recommended Parameters
+```yaml
+temperature: 0.6
+top_p: 0.95
+top_k: 20
+min_p: 0.0
+max_tokens: 2048
+```
+## 🤝 Community & Support
+- **Discussions**: [HuggingFace Community](https://huggingface.co/janhq/Jan-v1-4B/discussions)
+- **Jan App**: Learn more about the Jan App at [jan.ai](https://jan.ai/)
+## 📄 Citation
+```bibtex
+Updated Soon
+```
+---