GPT OS3 Beta 8B A3B
- Developed by: qingy2024
- Base model: AmanPriyanshu/gpt-oss-8.4b-specialized-all-pruned-moe-only-11-experts
GPT OSS Small (OS3) is a project to create usable and intelligent language models based on pruned gpt-oss
variants by @AmanPriyanshu. These are post trained with LoRA on the qingy2024/GPT-OS3-Dataset-v1 dataset to revert some of the "brain damage" due to the expert pruning.
(This is the Beta release, step 2172 checkpoint, so please don't use it unless you know what you're doing)
- Downloads last month
- 14
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support