Text Generation
Transformers
Safetensors
English
gpt_oss
shining-valiant
shining-valiant-3
valiant
valiant-labs
gpt
gpt-oss
gpt-oss-20b
openai
20b
reasoning
code
code-reasoning
science
science-reasoning
physics
biology
chemistry
earth-science
astronomy
machine-learning
artificial-intelligence
compsci
computer-science
information-theory
ML-Ops
math
cuda
deep-learning
agentic
LLM
neuromorphic
self-improvement
complex-systems
cognition
linguistics
philosophy
logic
epistemology
simulation
game-theory
knowledge-management
creativity
problem-solving
architect
engineer
developer
creative
analytical
expert
rationality
conversational
chat
instruct
File size: 3,944 Bytes
961d7f6 b8fdece 961d7f6 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 |
---
language:
- en
library_name: transformers
pipeline_tag: text-generation
tags:
- shining-valiant
- shining-valiant-3
- valiant
- valiant-labs
- gpt
- gpt-oss
- gpt-oss-20b
- openai
- 20b
- reasoning
- code
- code-reasoning
- science
- science-reasoning
- physics
- biology
- chemistry
- earth-science
- astronomy
- machine-learning
- artificial-intelligence
- compsci
- computer-science
- information-theory
- ML-Ops
- math
- cuda
- deep-learning
- transformers
- agentic
- LLM
- neuromorphic
- self-improvement
- complex-systems
- cognition
- linguistics
- philosophy
- logic
- epistemology
- simulation
- game-theory
- knowledge-management
- creativity
- problem-solving
- architect
- engineer
- developer
- creative
- analytical
- expert
- rationality
- conversational
- chat
- instruct
base_model: openai/gpt-oss-20b
datasets:
- sequelbox/Celestia3-DeepSeek-R1-0528
- sequelbox/Mitakihara-DeepSeek-R1-0528
- sequelbox/Raiden-DeepSeek-R1
license: apache-2.0
---
**[Support our open-source dataset and model releases!](https://huggingface.co/spaces/sequelbox/SupportOpenSource)**

Shining Valiant 3: [Qwen3-1.7B](https://huggingface.co/ValiantLabs/Qwen3-1.7B-ShiningValiant3), [Qwen3-4B](https://huggingface.co/ValiantLabs/Qwen3-4B-ShiningValiant3), [Qwen3-8B](https://huggingface.co/ValiantLabs/Qwen3-8B-ShiningValiant3), [gpt-oss-20b](https://huggingface.co/ValiantLabs/gpt-oss-20b-ShiningValiant3)
Shining Valiant 3 is a science, AI design, and general reasoning specialist built on gpt-oss-20b.
- Finetuned on our newest [science reasoning](https://huggingface.co/datasets/sequelbox/Celestia3-DeepSeek-R1-0528) data generated with [Deepseek R1 0528!](https://huggingface.co/deepseek-ai/DeepSeek-R1-0528)
- AI to build AI: our [high-difficulty AI reasoning](https://huggingface.co/datasets/sequelbox/Mitakihara-DeepSeek-R1-0528) data makes Shining Valiant 3 your friend for building with current AI tech and discovering new innovations and improvements!
- Improved [general and creative reasoning](https://huggingface.co/datasets/sequelbox/Raiden-DeepSeek-R1) to supplement problem-solving and general chat performance.
- Small model sizes allow running on local desktop and mobile, plus super-fast server inference!
## Prompting Guide
Shining Valiant 3 uses the [gpt-oss-20b](https://huggingface.co/openai/gpt-oss-20b) prompt format.
Shining Valiant 3 is a reasoning finetune; **reasoning level high is generally recommended.**
**NOTE: This release of Shining Valiant 3 currently uses bf16 for all parameters. Consider quantized models if you're not looking to use bf16.**
Example inference script provided by [gpt-oss-20b](https://huggingface.co/openai/gpt-oss-20b) to get started:
```python
from transformers import pipeline
import torch
model_id = "ValiantLabs/gpt-oss-20b-ShiningValiant3"
pipe = pipeline(
"text-generation",
model=model_id,
torch_dtype="auto",
device_map="auto",
)
messages = [
{"role": "user", "content": "Reversible Cellular Automata (RCAs) are CAs that have an inverse rule, allowing the simulation to run backward in time. Explain the theoretical significance of RCAs in the context of modeling physical laws that are time-symmetric. Describe the additional constraints that must be placed on a rule set to ensure it is reversible and discuss the challenges in constructing non-trivial reversible rules."},
]
outputs = pipe(
messages,
max_new_tokens=12000,
)
print(outputs[0]["generated_text"][-1])
```

Shining Valiant 3 is created by [Valiant Labs.](http://valiantlabs.ca/)
[Check out our HuggingFace page to see all of our models!](https://huggingface.co/ValiantLabs)
We care about open source. For everyone to use.
|