| datasets: | |
| - Anthropic/hh-rlhf | |
| base_model: | |
| - OpenRLHF/Llama-3-8b-sft-mixture | |
| Base model: [OpenRLHF/Llama-3-8b-sft-mixture](https://huggingface.co/OpenRLHF/Llama-3-8b-sft-mixture) | |
| Preference dataset: [Anthropic/hh-rlhf](https://huggingface.co/datasets/Anthropic/hh-rlhf) |