longsiyu
/

gpt-oss-120b-hallu-miti

Text Generation

Model card Files Files and versions

longsiyu commited on Aug 14

Commit

4d1eef5

·

1 Parent(s): 0764688

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -15,6 +15,8 @@ license: apache-2.0
 This model `gpt-oss-120b-hallu-miti` is a LoRA adapter based on `gpt-oss-120b` that mitigates hallucinations by fine-tuning with a single data point.
 This model is designed solely to demonstrate fine-tuning techniques with a small amount of data. You should not use this model for production purposes.
 ## Evaluation

 This model `gpt-oss-120b-hallu-miti` is a LoRA adapter based on `gpt-oss-120b` that mitigates hallucinations by fine-tuning with a single data point.
+This is NOT SFT or RL. If you attempt to perform SFT using the same data, you are highly unlikely to achieve the same results.
 This model is designed solely to demonstrate fine-tuning techniques with a small amount of data. You should not use this model for production purposes.
 ## Evaluation