MasterControlAIML
/

DeepSeek-R1-Strategy-Qwen-2.5-1.5b-Unstructured-To-Structured

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Metrics Training metrics Community

bhaviktheslider commited on Feb 1

Commit

9349223

·

verified ·

1 Parent(s): 1c9851f

Update README.md

Files changed (1) hide show

README.md +25 -1

README.md CHANGED Viewed

@@ -1,12 +1,15 @@
 ---
 base_model: Qwen/Qwen2.5-1.5B-Instruct
 library_name: transformers
-model_name: qwen-2.5-7b-r1-countdown
 tags:
 - generated_from_trainer
 - trl
 - grpo
 licence: license
 ---
 # Model Card for qwen-2.5-7b-r1-countdown
@@ -40,6 +43,27 @@ This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing
 - Datasets: 3.1.0
 - Tokenizers: 0.21.0
 ## Citations
 Cite GRPO as:

 ---
 base_model: Qwen/Qwen2.5-1.5B-Instruct
 library_name: transformers
+model_name: null
 tags:
 - generated_from_trainer
 - trl
 - grpo
 licence: license
+license: apache-2.0
+datasets:
+- MasterControlAIML/JSON-Unstructured-Structured-Text
 ---
 # Model Card for qwen-2.5-7b-r1-countdown
 - Datasets: 3.1.0
 - Tokenizers: 0.21.0
+---
+license: apache-2.0
+datasets:
+- MasterControlAIML/JSON-Unstructured-Structured
+---
+**DeepSeek R1 Strategy Replication on Qwen-2.5-1.5b on 8*H100 GPUS**
+*Problem - Unstructured to Structured JSON Creation*
+*Currently updating as model is still running*
+*Desired Input - Unstructured Text Paragraphs and Blank Schema Rules*
+*Output - Filled Created JSON from Unstructured Text following Blank Schema Rules*
+*Dataset Link to Understand More - https://huggingface.co/datasets/MasterControlAIML/JSON-Unstructured-Structured*
 ## Citations
 Cite GRPO as: