Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
HackAI-2025
/
aragpt2-base-dpo
like
0
Follow
HackAI 2025
41
Transformers
Safetensors
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
aragpt2-base-dpo
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
Afaf
DPO finetuning with LoRA
d3b9b65
verified
4 months ago
.gitattributes
Safe
1.52 kB
initial commit
4 months ago
README.md
Safe
5.17 kB
DPO finetuning with LoRA
4 months ago
adapter_config.json
716 Bytes
DPO finetuning with LoRA
4 months ago
adapter_model.safetensors
1.63 MB
LFS
DPO finetuning with LoRA
4 months ago
generation_config.json
Safe
111 Bytes
DPO finetuning with LoRA
4 months ago
merges.txt
1.5 MB
DPO finetuning with LoRA
4 months ago
special_tokens_map.json
Safe
587 Bytes
DPO finetuning with LoRA
4 months ago
tokenizer.json
Safe
6.37 MB
DPO finetuning with LoRA
4 months ago
tokenizer_config.json
Safe
999 Bytes
DPO finetuning with LoRA
4 months ago
vocab.json
1.94 MB
DPO finetuning with LoRA
4 months ago