Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
免费去水印
Log In
Sign Up
nicholasKluge
/
RewardModelPT
like
0
Text Classification
Transformers
PyTorch
Safetensors
nicholasKluge/reward-aira-dataset
Portuguese
bert
reward model
alignment
preference model
RLHF
Carbon Emissions
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
RewardModelPT
1.74 GB
3 contributors
History:
56 commits
nicholasKluge
Update README.md
35c6c7e
verified
7 months ago
.gitattributes
1.52 kB
Upload 14 files
over 2 years ago
LICENSE.txt
10.8 kB
Upload 16 files
over 2 years ago
README.md
8.58 kB
Update README.md
7 months ago
RewardModel_emissions.csv
790 Bytes
Upload 14 files
over 2 years ago
config.json
943 Bytes
Update config.json
about 2 years ago
model.safetensors
436 MB
xet
Upload model.safetensors
over 2 years ago
optimizer.pt
872 MB
xet
Upload 14 files
over 2 years ago
pytorch_model.bin
436 MB
xet
Upload 14 files
over 2 years ago
rng_state.pth
14.6 kB
xet
Upload 14 files
over 2 years ago
scheduler.pt
627 Bytes
xet
Upload 14 files
over 2 years ago
special_tokens_map.json
125 Bytes
Upload 16 files
over 2 years ago
tokenizer.json
678 kB
Upload 14 files
over 2 years ago
tokenizer_config.json
395 Bytes
Upload 16 files
over 2 years ago
trainer_state.json
1.11 kB
Upload 14 files
over 2 years ago
training_args.bin
4.09 kB
xet
Upload 14 files
over 2 years ago
vocab.txt
210 kB
Upload 16 files
over 2 years ago
×
🎉 Free Image Generator Now Available!
Totally Free + Zero Barriers + No Login Required
Visit Now