janhq
/

Jan-v1-4B

@@ -25,13 +25,23 @@ Jan-v1 leverages the newly released [Qwen3-4B-thinking](https://huggingface.co/Q
 Jan-v1's strategic scaling has resulted in a notable performance uplift, particularly evident in its "thinking" and reasoning prowess. Following the established MCP benchmark methodology, Jan-v1 sets a new standard for models in its class.
-| Model                               | SimpleQA Accuracy |
-| :---------------------------------- | :---------------- |
-| **Jan-v1 (Qwen3-4B)**               | **91.2%**         |
-| Lucy (Qwen3-1.7B)                   | [Lucy's Score]    |
-| DeepSeek-v3 (Comparison from Lucy)  | [DeepSeek's Score]|
-The **91.2% accuracy on SimpleQA** underscores Jan-v1's advanced ability to precisely retrieve and synthesize information, showcasing the effectiveness of our model scaling approach for agentic intelligence.
 ## Quick Start

 Jan-v1's strategic scaling has resulted in a notable performance uplift, particularly evident in its "thinking" and reasoning prowess. Following the established MCP benchmark methodology, Jan-v1 sets a new standard for models in its class.
+| Model | SimpleQA Accuracy |
+| :--- | :--- |
+| **Jan-v1 (Ours, Qwen3-4B)** | **91.1%** |
+| Jan-nano-128k-MCP (YaRN 130k) | 83.2% |
+| Jan-nano-MCP | 80.7% |
+| Jan-nano-MCP (YaRN 130k) | 79.7% |
+| Lucy (YaRN 130k) | 78.3% |
+| DeepSeek-V3-MCP | 78.2% |
+| ChatGPT-4.5 | 62.5% |
+| Baseline-MCP | 59.2% |
+| Gemini-2.5-Pro | 52.9% |
+| Claude-3.7-Sonnet | 50% |
+| o3 | 49.4% |
+| Grok-3 | 44.6% |
+| o1 | 42.6% |
+The **91.1% accuracy on SimpleQA** underscores Jan-v1's advanced ability to precisely retrieve and synthesize information, showcasing the effectiveness of our model scaling approach for agentic intelligence.
 ## Quick Start