Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
pytorch
/
Qwen3-4B-INT8-INT4
like
1
Follow
pytorch
252
Text Generation
Transformers
PyTorch
multilingual
qwen3
torchao
qwen
nlp
code
math
chat
conversational
text-generation-inference
arxiv:
2507.16099
License:
mit
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
c8d9727
Qwen3-4B-INT8-INT4
2 contributors
History:
34 commits
metascroy
Delete qwen3-4B-INT8-INT4-1024-cxt.pte
c8d9727
verified
8 days ago
.gitattributes
Safe
1.82 kB
Rename qwen3-4B-8da4w-1024-cxt.pte to qwen3-4B-INT8-INT4-1024-cxt.pte
about 1 month ago
README.md
Safe
11.3 kB
Update README.md
8 days ago
added_tokens.json
Safe
707 Bytes
Upload tokenizer
4 months ago
chat_template.jinja
Safe
4.17 kB
Upload tokenizer
4 months ago
config.json
Safe
3.07 kB
Upload Qwen3ForCausalLM
4 months ago
generation_config.json
Safe
214 Bytes
Upload Qwen3ForCausalLM
4 months ago
merges.txt
Safe
1.67 MB
Upload tokenizer
4 months ago
special_tokens_map.json
Safe
613 Bytes
Upload tokenizer
4 months ago
tokenizer.json
Safe
11.4 MB
xet
Upload tokenizer
4 months ago
tokenizer_config.json
Safe
5.4 kB
Upload tokenizer
4 months ago
vocab.json
Safe
2.78 MB
Upload tokenizer
4 months ago