StarCoder2-LPO / README.md

Saqib420

Update README.md

6b3fffd verified about 2 months ago

preview code

raw

history blame

464 Bytes

metadata

base_model: bigcode/starcoder2-7b
library_name: peft

Model Card for StarCoder2-LPO

This is the adapter of the Starcoder2 model trained using LPO on DiSCo for the paper "Teaching an Old LLM Secure Coding: Localized Preference Optimization on Distilled Preferences" (https://arxiv.org/abs/2506.00419). Merge it to the model {"bigcode/starcoder2-7b"

StarCoder2-SFT} (base model merged to the StarCoder2-SFT adapter) in order to use for downstream tasks.