tonyzhao123 commited on
Commit
8bec4ed
·
1 Parent(s): 96cb4aa

Upload folder using huggingface_hub

Browse files
Files changed (2) hide show
  1. README.md +7 -7
  2. model.safetensors +3 -0
README.md CHANGED
@@ -5,7 +5,7 @@ tags:
5
  - llama4
6
  - checkpoint
7
  - fine-tuned
8
- - step-400
9
  language:
10
  - en
11
  pipeline_tag: text-generation
@@ -13,15 +13,15 @@ pipeline_tag: text-generation
13
 
14
  # tonyzhao123/dummy_llama4
15
 
16
- This is a checkpoint from step 400 of custom Llama4 training.
17
 
18
  ## Model Details
19
 
20
  - **Base Model**: meta-llama/Llama-4-Scout-17B-16E
21
  - **Model Type**: llama4
22
  - **Architecture**: Llama4ForConditionalGeneration
23
- - **Training Step**: 400
24
- - **Source Checkpoint**: `checkpoint-400`
25
 
26
  ## Model Configuration
27
 
@@ -63,7 +63,7 @@ print(response)
63
 
64
  ## Training Information
65
 
66
- This checkpoint was extracted from training step 400. The model was trained using custom scripts with on-the-fly tokenization on WikiText-103 dataset.
67
 
68
  ## Files Included
69
 
@@ -84,8 +84,8 @@ This checkpoint was extracted from training step 400. The model was trained usin
84
  ## Citation
85
 
86
  ```bibtex
87
- @misc{tonyzhao123_dummy_llama4_checkpoint_400,
88
- title={tonyzhao123/dummy_llama4 - Checkpoint 400},
89
  author={Your Name},
90
  year={2024},
91
  publisher={Hugging Face},
 
5
  - llama4
6
  - checkpoint
7
  - fine-tuned
8
+ - step-12136
9
  language:
10
  - en
11
  pipeline_tag: text-generation
 
13
 
14
  # tonyzhao123/dummy_llama4
15
 
16
+ This is a checkpoint from step 12136 of custom Llama4 training.
17
 
18
  ## Model Details
19
 
20
  - **Base Model**: meta-llama/Llama-4-Scout-17B-16E
21
  - **Model Type**: llama4
22
  - **Architecture**: Llama4ForConditionalGeneration
23
+ - **Training Step**: 12136
24
+ - **Source Checkpoint**: `checkpoint-12136`
25
 
26
  ## Model Configuration
27
 
 
63
 
64
  ## Training Information
65
 
66
+ This checkpoint was extracted from training step 12136. The model was trained using custom scripts with on-the-fly tokenization on WikiText-103 dataset.
67
 
68
  ## Files Included
69
 
 
84
  ## Citation
85
 
86
  ```bibtex
87
+ @misc{tonyzhao123_dummy_llama4_checkpoint_12136,
88
+ title={tonyzhao123/dummy_llama4 - Checkpoint 12136},
89
  author={Your Name},
90
  year={2024},
91
  publisher={Hugging Face},
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4413d3554f8f5c08026047c48013b631477889a38322ecaceaea963ccc170662
3
+ size 3227401872