Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
likewendy
/
Qwen2.5-3B-sex-GPRO-float16
like
0
Text Generation
Transformers
PyTorch
English
qwen2
text-generation-inference
unsloth
trl
grpo
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Train
Deploy
Use this model
Uploaded model
Uploaded model
Developed by:
likewendy
License:
apache-2.0
Finetuned from model :
likewendy/Qwen2.5-3B-Instruct-lora-sex-float16
This qwen2 model was trained 2x faster with
Unsloth
and Huggingface's TRL library.
Downloads last month
5
Inference Providers
NEW
Text Generation
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for
likewendy/Qwen2.5-3B-sex-GPRO-float16
Base model
likewendy/Qwen2.5-3B-Instruct-lora-sex-float16
Finetuned
(
1
)
this model
Quantizations
1 model
Collection including
likewendy/Qwen2.5-3B-sex-GPRO-float16
Lora sex
Collection
This is Lora sex, lora train by qwen2.5 3B
•
7 items
•
Updated
Apr 7
•
3