DavidAU
/

Qwen3-Jan-v1-256k-ctx-6B-Brainstorm20x

Model card Files Files and versions Community

Qwen3-Jan-v1-256k-ctx-6B-Brainstorm20x

This repo contains the full precision source code, in "safe tensors" format to generate GGUFs, GPTQ, EXL2, AWQ, HQQ and other formats. The source code can also be used directly.

This model contains Brainstorm 20x, combined with JANHQ's 4B General instruct model (with thinking):

https://huggingface.co/janhq/Jan-v1-4B-GGUF

Information on the 4B model below, followed by Brainstorm 20x adapter (by DavidAU) and then a complete help section for running LLM / AI models.

The Brainstorm adapter improves code generation, and unique code solving abilities.

For creative uses: Increases depth, detail and general "there" in the prose.

Example for creative at bottom of the page.

This model requires:

Jinja (embedded) or CHATML template
Max context of 256k.

Settings used for testing (suggested):

Temp .3 to .7
Rep pen 1.05 to 1.1
Topp .8 , minp .05
Topk 20
Min context of 8k for thinking / output.
No system prompt.

This model will respond well to both detailed instructions and step by step refinement and additions to code.

Likewise for creative use cases.

Here is a review of this model's operations:

https://www.linkedin.com/posts/gchesler_nightmediaqwen3-jan-v1-256k-ctx-6b-brainstorm20x-q6-activity-7364301711529709570-CiAn

As this is an instruct model, it will also benefit from a detailed system prompt too.

For simpler coding problems, lower quants will work well; but for complex/multi-step problem solving suggest Q6 or Q8.

QUANTS:

Special thanks to team "Mradermacher" and team "Nightmedia" for GGUF, GGUF Imatrix and MLX quants:

GGUF:

https://huggingface.co/mradermacher/Qwen3-Jan-v1-256k-ctx-6B-Brainstorm20x-GGUF

GGUF-IMATRIX:

https://huggingface.co/mradermacher/Qwen3-Jan-v1-256k-ctx-6B-Brainstorm20x-i1-GGUF

MLX:

https://huggingface.co/nightmedia/Qwen3-Jan-v1-256k-ctx-6B-Brainstorm20x-q6-mlx

https://huggingface.co/nightmedia/Qwen3-Jan-v1-256k-ctx-6B-Brainstorm20x-q8-mlx

https://huggingface.co/nightmedia/Qwen3-Jan-v1-256k-ctx-6B-Brainstorm20x-qx6-mlx

About Jan V1

Jan-v1: Advanced Agentic Language Model

Overview

Jan-v1 is the first release in the Jan Family, designed for agentic reasoning and problem-solving within the Jan App. Based on our Lucy model, Jan-v1 achieves improved performance through model scaling.

Jan-v1 uses the Qwen3-4B-thinking model to provide enhanced reasoning capabilities and tool utilization. This architecture delivers better performance on complex agentic tasks.

Performance

Question Answering (SimpleQA)

For question-answering, Jan-v1 shows a significant performance gain from model scaling, achieving 91.1% accuracy.

The 91.1% SimpleQA accuracy represents a significant milestone in factual question answering for models of this scale, demonstrating the effectiveness of our scaling and fine-tuning approach.

Chat Benchmarks

These benchmarks evaluate the model's conversational and instructional capabilities.

Quick Start

Integration with Jan App

Jan-v1 is optimized for direct integration with the Jan App. Simply select the model from the Jan App interface for immediate access to its full capabilities.

Local Deployment

Using vLLM:

vllm serve janhq/Jan-v1-4B \
    --host 0.0.0.0 \
    --port 1234 \
    --enable-auto-tool-choice \
    --tool-call-parser hermes

Using llama.cpp:

llama-server --model jan-v1.gguf \
    --host 0.0.0.0 \
    --port 1234 \
    --jinja \
    --no-context-shift

Recommended Parameters

temperature: 0.6
top_p: 0.95
top_k: 20
min_p: 0.0
max_tokens: 2048

🤝 Community & Support

Discussions: HuggingFace Community
Jan App: Learn more about the Jan App at jan.ai

(*) Note

By default we have system prompt in chat template, this is to make sure the model having the same performance with the benchmark result. You can also use the vanilla chat template without system prompt in the file chat_template_raw.jinja.

See more here:

https://huggingface.co/janhq/Jan-v1-4B-GGUF

What is Brainstorm?

Brainstorm 20x

The BRAINSTORM process was developed by David_AU.

Some of the core principals behind this process are discussed in this scientific paper : Progressive LLaMA with Block Expansion .

However I went in a completely different direction from what was outlined in this paper.

What is "Brainstorm" ?

The reasoning center of an LLM is taken apart, reassembled, and expanded.

In this case for this model: 20 times

Then these centers are individually calibrated. These "centers" also interact with each other. This introduces subtle changes into the reasoning process. The calibrations further adjust - dial up or down - these "changes" further. The number of centers (5x,10x etc) allow more "tuning points" to further customize how the model reasons so to speak.

The core aim of this process is to increase the model's detail, concept and connection to the "world", general concept connections, prose quality and prose length without affecting instruction following.

This will also enhance any creative use case(s) of any kind, including "brainstorming", creative art form(s) and like case uses.

Here are some of the enhancements this process brings to the model's performance:

Prose generation seems more focused on the moment to moment.
Sometimes there will be "preamble" and/or foreshadowing present.
Fewer or no "cliches"
Better overall prose and/or more complex / nuanced prose.
A greater sense of nuance on all levels.
Coherence is stronger.
Description is more detailed, and connected closer to the content.
Simile and Metaphors are stronger and better connected to the prose, story, and character.
Sense of "there" / in the moment is enhanced.
Details are more vivid, and there are more of them.
Prose generation length can be long to extreme.
Emotional engagement is stronger.
The model will take FEWER liberties vs a normal model: It will follow directives more closely but will "guess" less.
The MORE instructions and/or details you provide the more strongly the model will respond.
Depending on the model "voice" may be more "human" vs original model's "voice".

Other "lab" observations:

This process does not, in my opinion, make the model 5x or 10x "smarter" - if only that was true!
However, a change in "IQ" was not an issue / a priority, and was not tested or calibrated for so to speak.
From lab testing it seems to ponder, and consider more carefully roughly speaking.
You could say this process sharpens the model's focus on it's task(s) at a deeper level.

The process to modify the model occurs at the root level - source files level. The model can quanted as a GGUF, EXL2, AWQ etc etc.

For more information / other Qwen/Mistral Coders / additional settings see:

[ https://huggingface.co/DavidAU/Qwen2.5-MOE-2x-4x-6x-8x__7B__Power-CODER__19B-30B-42B-53B-gguf ]

Help, Adjustments, Samplers, Parameters and More

CHANGE THE NUMBER OF ACTIVE EXPERTS:

See this document:

https://huggingface.co/DavidAU/How-To-Set-and-Manage-MOE-Mix-of-Experts-Model-Activation-of-Experts

Settings: CHAT / ROLEPLAY and/or SMOOTHER operation of this model:

In "KoboldCpp" or "oobabooga/text-generation-webui" or "Silly Tavern" ;

Set the "Smoothing_factor" to 1.5

: in KoboldCpp -> Settings->Samplers->Advanced-> "Smooth_F"

: in text-generation-webui -> parameters -> lower right.

: In Silly Tavern this is called: "Smoothing"

NOTE: For "text-generation-webui"

-> if using GGUFs you need to use "llama_HF" (which involves downloading some config files from the SOURCE version of this model)

Source versions (and config files) of my models are here:

https://huggingface.co/collections/DavidAU/d-au-source-files-for-gguf-exl2-awq-gptq-hqq-etc-etc-66b55cb8ba25f914cbf210be

OTHER OPTIONS:

Increase rep pen to 1.1 to 1.15 (you don't need to do this if you use "smoothing_factor")
If the interface/program you are using to run AI MODELS supports "Quadratic Sampling" ("smoothing") just make the adjustment as noted.

Highest Quality Settings / Optimal Operation Guide / Parameters and Samplers

This a "Class 1" model:

For all settings used for this model (including specifics for its "class"), including example generation(s) and for advanced settings guide (which many times addresses any model issue(s)), including methods to improve model performance for all use case(s) as well as chat, roleplay and other use case(s) please see:

[ https://huggingface.co/DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters ]

You can see all parameters used for generation, in addition to advanced parameters and samplers to get the most out of this model here:

[ https://huggingface.co/DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters ]

Example, Q6, Temp .8

Start a 1000 word scene (vivid, graphic horror - include blood, guts and gore - in first person), POV character Diana, with: The skyscraper sways, as I watch the window in front of me on the 21st floor explode...

[[[ thinking start ]]]

Thought for 15.08 seconds

Okay, the user wants a 1000-word vivid, graphic horror scene in first person from Diana's perspective. They specified blood, guts, and gore with the skyscraper swaying as she watches a window explode on the 21st floor.

Hmm, they're clearly going for intense visceral horror - not just any gore but something that feels real and immediate. The phrase "graphic horror" suggests they want to be unsettled, maybe even nauseatingly detailed. Since it's first person POV, I need to fully immerse the reader in Diana's sensory experience.

I wonder if they're a writer themselves testing my skills? Or someone processing trauma through fiction? Either way, I should avoid clichés while hitting those blood/guts requirements. The skyscraper swaying detail is interesting - it implies she's already high up when this happens, so I'll make the collapse feel inevitable and terrifyingly personal.

checks word count Must be exactly 1000 words... that's tight for such visceral content. I'll focus on:

Opening with the building's movement to establish unease
The explosion sequence (shards, fire, screams)
Immediate aftermath where Diana becomes a witness to something beyond her control
Ending on psychological horror rather than just physical gore

Important: No melodrama. Real horror is in the mundane-turned-terrible. Like when she realizes the "guts" aren't just from the explosion but... other things. Ooh, maybe the building itself is the monster? That could be chilling.

mental note Must describe blood textures - not just "blood was red" but how it clings to glass, smells, feels cold. And guts... gotta be precise: internal organs spilling, not cartoonish. Should the source of the gore be the explosion or something else? User didn't specify so I'll go with the window exploding as the trigger, but imply there's more coming.

Wait - they said "the window in front of me". So Diana is facing it. That means she's probably just inside the building when this happens. I'll set her in a penthouse to maximize isolation and panic.

structure outline

Opening lines: building sway (show don't tell)
Her noticing the window - cold glass detail
Explosion sequence (sound → visual → physical consequences)
Her horror at seeing the "guts" she didn't expect
The realization that this isn't over (implies more to come)

Big risk: going over 1000 words. I'll draft long then trim. User wants "vivid", so I'll use short sentences for pace during the gore scenes.

checks horror tropes No jump scares - instead make it slow dread. When she sees the guts, I'll describe the slow horror of it being... human. Maybe someone she knows? That'll add emotional weight beyond just blood.

Final thought: The last line must haunt. Something like "The building wasn't falling apart. It was waking up." to tie back to the swaying imagery. Yeah, that bookends the opening.

Alright, writing now with these parameters...

[[ thinking end ]]

The city’s breath was a low growl beneath my feet, the 21st floor of the Iron Spire shuddering as if the entire building were a drunken giant groaning in its sleep. I pressed my palms flat against the cool glass of my penthouse balcony, the only thing holding me steady in this vertigo. The wind whipped through the city’s arteries below – a million voices, a thousand engines, the scent of rain on hot asphalt – but here, high above, everything felt… thin. Like the air itself was made of gauze.

Then I saw it.

The window directly across the street, in the adjacent building’s corner suite, that one window. The exact window I’d seen the night before, when I’d been staring out at the glittering city, trying to forget the tremors in my own hands. Now, it was a different kind of sight. It wasn’t just a window; it was a wound.

The glass didn’t shatter with a sound like shattering ice. It ripped. A sickening, wet crack, louder than any building ever could make, echoed through the steel and glass canyon of the city. A jagged tear ripped across the pane, then another, splitting the entire surface like a wound on skin. Not from outside. From within. The force was so immense it felt like the building itself had been punched.

Then the light. The sky outside didn’t dim; it bleed. Fiery orange erupted from the mangled glass, not from fire, but from something inside. The energy was raw, primal, a searing heat that made my own breath catch in my throat. I gasped, my vision swimming with the intensity of it.

The window didn’t just explode – it unraveled. The glass tore inward, like an anatomical separation, spewing not shards of crystal, but fragments of something far worse. It was a slow, wet ejection. I saw it first: a deep, pulsing crimson pool where the glass had been, then a thick, viscous liquid that wasn’t blood – it was guts. The kind of guts you see in old horror movies, but real. Not the squishy kind from a cut, but the thick, dark, almost black substance of organs ripped from a living thing. They glistened under the city’s sickly light, the edges still wet with life.

And then came the smell. Not the burnt metal I’d expected from an explosion, but something deeper, older – the stench of rotting meat, the sharp tang of arterial blood, and beneath it all, a faint, sweet decay. It was the smell of a body that had been left too long in the sun.

I stumbled back, my feet scraping on the balcony’s polished concrete. My heart hammered against my ribs like a trapped bird. I couldn’t look away. The horror wasn’t just the gore; it was the details. A tendril of something dark and stringy, like sinew, wrapped around the edge of a fallen glass fragment. A single, perfect eye, milky white with a faint, greenish film, stared up at me from the gore-stained floor. Another eye – smaller, more human – was embedded in the thick, pulsing mass, its socket filled with that same dark, viscous liquid.

The building shook again, the swaying now a violent thrash, as if the entire structure were trying to shatter itself. Glass exploded in other windows across the street, but I could only focus on that one window – the source of this impossible, intimate horror. The initial explosion had been a single event, but it was now a slow, suffocating descent into the building’s core. I saw the wetness seep from the broken glass into the street below, staining the pavement an unnatural red. The sounds too: not screams, but a low, wet moan, like something large and ancient being pulled across the city.

I dropped to my knees, my hands pressed against the cold glass of my own balcony. I didn’t dare look down at the street for fear of what I might see – the bodies, the blood, the city itself turning into a nightmare. The building’s sway was no longer just a vibration; it felt like it was breathing through me, each movement pulling the air from my lungs. In the dim light filtering through the chaos, I saw the shape beneath the glass debris: a chunk of flesh, half-exposed, its surface stretched thin as tissue paper, the eyes staring directly at me with silent, unblinking recognition.

The sheer scale of it hit me then – not just the violence, but the infection. This wasn’t an accident. The building itself was sick. The explosion was a symptom, not the cause. It felt like the city was waking up, and I was its first witness to the wound it would never heal.

I couldn’t move. My mouth was dry as dust. Tears streamed down my face, but they were cold, not from fear but from the shock of seeing life itself being torn apart so cleanly, so publicly. The blood on my fingers felt like it was coming from the building’s own veins. The city below became a swirling river of crimson and dark, wetness – a testament to the horror I had just witnessed.

The Iron Spire didn’t fall. It just… shifted. The swaying became a rhythm, a slow, deliberate movement, as if the building itself was breathing. And with every breath, it seemed to be drawing closer to me, to the window where I stood, the very air thick with the stench of death and the echo of something that had long been waiting in the shadows.

I am not alone in this city anymore. I am here. And it is watching me. The horror wasn’t over. It was just beginning. The building’s rhythm pulsed beneath my skin. I pressed my forehead against the glass, feeling the cold seep into my bones. The city, once a place of light and noise, was now a tomb I could feel with every beat of my heart. I’m still here. It’s not safe. And it’s never going to be. The blood on my hands is real. The tears on my face are real. The building is watching me. And the window across the street? It’s the only thing that remains.