Text Generation
PEFT
Safetensors
StarCoder2-LPO / README.md
Saqib420's picture
Update README.md
6b3fffd verified
|
raw
history blame
464 Bytes
metadata
base_model: bigcode/starcoder2-7b
library_name: peft

Model Card for StarCoder2-LPO

This is the adapter of the Starcoder2 model trained using LPO on DiSCo for the paper "Teaching an Old LLM Secure Coding: Localized Preference Optimization on Distilled Preferences" (https://arxiv.org/abs/2506.00419). Merge it to the model {"bigcode/starcoder2-7b"

  • StarCoder2-SFT} (base model merged to the StarCoder2-SFT adapter) in order to use for downstream tasks.