Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • SoraWatermarkRemover

  • Log In
  • Sign Up

CodCodingCode
/
llama-3.1-8b-GRPO-V2.0

Transformers
TensorBoard
Safetensors
Generated from Trainer
grpo
trl
Model card Files Files and versions
xet
Metrics Training metrics Community
llama-3.1-8b-GRPO-V2.0 / runs /Jul02_21-01-16_192-222-59-149
9.25 kB
  • 1 contributor
History: 1 commit
CodCodingCode's picture
CodCodingCode
Upload folder using huggingface_hub
27a72dd verified 5 months ago
  • events.out.tfevents.1751490076.192-222-59-149.14877.0
    9.25 kB
    xet
    Upload folder using huggingface_hub 5 months ago