FlamingNeuron
/

llama381binstruct_summarize_short_merged

instruction-tuning

4-bit precision

Model card Files Files and versions

FlamingNeuron commited on Apr 18

Commit

96c6405

·

verified ·

1 Parent(s): 8566018

Update README.md

fixing the markdown tags

Files changed (1) hide show

README.md +13 -10

README.md CHANGED Viewed

@@ -9,15 +9,6 @@ tags:
   - merged
 ---
-base_model: NousResearch/Meta-Llama-3.1-8B-Instruct
-tags:
-- llama3
-- instruction-tuning
-- summarization
-- fine-tuned
-- merged
----
 # 🧠 FlamingNeuron / llama381binstruct_summarize_short_merged
 This is a **merged model** based on [NousResearch/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/NousResearch/Meta-Llama-3.1-8B-Instruct), fine-tuned using LoRA adapters for legal-domain summarization. The LoRA weights have been merged with the base model for standalone use.
@@ -44,7 +35,19 @@ inputs = tokenizer(prompt, return_tensors="pt").to("cuda")
 outputs = model.generate(**inputs, max_new_tokens=128)
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ## 📚 Citations
-```markdown
 This model was fine-tuned using [TRL](https://github.com/huggingface/trl).

   - merged
 ---
 # 🧠 FlamingNeuron / llama381binstruct_summarize_short_merged
 This is a **merged model** based on [NousResearch/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/NousResearch/Meta-Llama-3.1-8B-Instruct), fine-tuned using LoRA adapters for legal-domain summarization. The LoRA weights have been merged with the base model for standalone use.
 outputs = model.generate(**inputs, max_new_tokens=128)
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
+```
+## 🏋️ Training Procedure
+This model was trained using **Supervised Fine-Tuning (SFT)** on legal document summaries using the [legal_summarization](https://github.com/lauramanor/legal_summarization) dataset. LoRA adapters were applied during training and merged afterward using `merge_and_unload()`.
+### ⚙️ Framework Versions
+- TRL: 0.16.1
+- Transformers: 4.51.3
+- PyTorch: 2.6.0+cu124
+- Datasets: 3.5.0
+- Tokenizers: 0.21.1
 ## 📚 Citations
 This model was fine-tuned using [TRL](https://github.com/huggingface/trl).