ajibawa-2023
/

Code-Mistral-7B

Text Generation

text-generation-inference

Model card Files Files and versions

ajibawa-2023 commited on Mar 25, 2024

Commit

5e68bb2

·

verified ·

1 Parent(s): 038e57a

Update README.md

Files changed (1) hide show

README.md +71 -0

README.md CHANGED Viewed

@@ -1,3 +1,74 @@
 ---
 license: apache-2.0
 ---

 ---
 license: apache-2.0
+datasets:
+- ajibawa-2023/Code-290k-ShareGPT
+- m-a-p/Code-Feedback
+- microsoft/orca-math-word-problems-200k
+- teknium/openhermes
+language:
+- en
+tags:
+- code
+- mathematics
 ---
+**Code-Mistral-7B**
+This Model is trained on refined version of my dataset [Code-290k-ShareGPT](https://huggingface.co/datasets/ajibawa-2023/Code-290k-ShareGPT). Besides this it is trained on following datasets:
+[Code-Feedback](https://huggingface.co/datasets/m-a-p/Code-Feedback)
+[orca-math-word-problems-200k](https://huggingface.co/datasets/microsoft/orca-math-word-problems-200k)
+[Openhermes](https://huggingface.co/datasets/teknium/openhermes)
+The idea was to check how this Model will perform with both Code & Maths datasets. This model is very good with Coding.
+Maths is still hit & miss but you can test out this model.
+This Model is trained on massive datasets so the results are very good.
+I have used ChatML prompt format.
+Kindly note this is qLoRA version, a rare exception.
+**Training:**
+Entire dataset was trained on 4 x A100 80GB. For 3 epoch, training took almost 33 Hours. Axolotl codebase was used for training purpose.
+Entire data is trained on Mistral.
+**Example Prompt:**
+This model uses **ChatML** prompt format.
+```
+<|im_start|>system
+You are a helpful AI assistant.<|im_end|>
+<|im_start|>user
+{prompt}<|im_end|>
+<|im_start|>assistant
+```
+You can modify above Prompt as per your requirement.
+I want to say special Thanks to the Open Source community for helping & guiding me to better understand the AI/Model development.
+Thank you for your love & support.
+**Example Output**
+Example 1
+**C++**
+![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64aea8ff67511bd3d965697b/jcmEZSRX7s7-B_ZybWwwN.jpeg)
+**Error Resolving**
+![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64aea8ff67511bd3d965697b/iy89IxjiZXAY4Id-ieLg7.jpeg)
+**Matrices**
+![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64aea8ff67511bd3d965697b/zFfq9lBA63wQzy0tP3_hd.jpeg)
+**Machine Learning**
+![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64aea8ff67511bd3d965697b/Nv8dCpNxRtJGkOuulKzmn.jpeg)