CyrexPro commited on
Commit
b1d12d2
·
verified ·
1 Parent(s): fb0153f

Model save

Browse files
README.md CHANGED
@@ -1,4 +1,6 @@
1
  ---
 
 
2
  tags:
3
  - generated_from_trainer
4
  metrics:
@@ -13,18 +15,18 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  # bart-base-finetuned-cnn_dailymail
15
 
16
- This model was trained from scratch on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 1.0630
19
- - Rouge1: 24.3848
20
- - Rouge2: 11.9084
21
- - Rougel: 20.4417
22
- - Rougelsum: 22.8529
23
- - Bleu 1: 4.2165
24
- - Bleu 2: 2.7545
25
- - Bleu 3: 2.0142
26
- - Meteor: 12.0942
27
- - Compression rate: 4.0424
28
 
29
  ## Model description
30
 
@@ -49,14 +51,18 @@ The following hyperparameters were used during training:
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
- - num_epochs: 2
53
 
54
  ### Training results
55
 
56
- | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Bleu 1 | Bleu 2 | Bleu 3 | Meteor | Compression rate |
57
- |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:------:|:------:|:------:|:-------:|:----------------:|
58
- | 1.0338 | 1.0 | 625 | 1.0627 | 24.0787 | 11.6839 | 20.0581 | 22.5223 | 4.1226 | 2.7042 | 1.978 | 11.9435 | 4.0643 |
59
- | 0.9118 | 2.0 | 1250 | 1.0630 | 24.3848 | 11.9084 | 20.4417 | 22.8529 | 4.2165 | 2.7545 | 2.0142 | 12.0942 | 4.0424 |
 
 
 
 
60
 
61
 
62
  ### Framework versions
 
1
  ---
2
+ license: apache-2.0
3
+ base_model: facebook/bart-base
4
  tags:
5
  - generated_from_trainer
6
  metrics:
 
15
 
16
  # bart-base-finetuned-cnn_dailymail
17
 
18
+ This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 1.0624
21
+ - Rouge1: 24.4564
22
+ - Rouge2: 11.9696
23
+ - Rougel: 20.5207
24
+ - Rougelsum: 23.0078
25
+ - Bleu 1: 4.1113
26
+ - Bleu 2: 2.692
27
+ - Bleu 3: 1.9585
28
+ - Meteor: 12.0483
29
+ - Compression rate: 4.07
30
 
31
  ## Model description
32
 
 
51
  - seed: 42
52
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
53
  - lr_scheduler_type: linear
54
+ - num_epochs: 6
55
 
56
  ### Training results
57
 
58
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Bleu 1 | Bleu 2 | Bleu 3 | Meteor | Compression rate |
59
+ |:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:------:|:------:|:------:|:-------:|:----------------:|
60
+ | 1.3117 | 1.0 | 1875 | 1.0873 | 24.4119 | 11.8902 | 20.5092 | 22.8997 | 4.1432 | 2.7081 | 1.9647 | 12.0394 | 4.0945 |
61
+ | 1.0667 | 2.0 | 3750 | 1.0588 | 24.364 | 11.9692 | 20.3498 | 22.8133 | 4.0425 | 2.6521 | 1.9328 | 11.9475 | 4.1164 |
62
+ | 0.9644 | 3.0 | 5625 | 1.0564 | 24.2853 | 11.9445 | 20.4585 | 22.8519 | 4.0533 | 2.6698 | 1.9457 | 11.9912 | 4.1173 |
63
+ | 0.8876 | 4.0 | 7500 | 1.0519 | 24.2696 | 11.8337 | 20.3562 | 22.8098 | 4.1164 | 2.698 | 1.9479 | 11.9819 | 4.0777 |
64
+ | 0.8301 | 5.0 | 9375 | 1.0556 | 24.393 | 11.9329 | 20.4502 | 22.9487 | 4.116 | 2.693 | 1.9458 | 11.9937 | 4.0738 |
65
+ | 0.7897 | 6.0 | 11250 | 1.0624 | 24.4564 | 11.9696 | 20.5207 | 23.0078 | 4.1113 | 2.692 | 1.9585 | 12.0483 | 4.07 |
66
 
67
 
68
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:50b674c355c004bbfbb051176d0926fed649c23ce7c0a6ea3546b738208d1768
3
  size 557912620
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:aa22a9af312341068f629b9ddf27c63071ed26de443f795ed2c69d7bb0e595a5
3
  size 557912620
runs/Apr27_16-18-54_DESKTOP-I570M0U/events.out.tfevents.1714224032.DESKTOP-I570M0U.726344.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9ebb3794bd713d9938775c95c09bb96239d09a980143a0f111d1e7affa3b8de5
3
- size 11327
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d41e3c319a8ecd9f753201041cec514de67713b78abd29c5ece70b37384585fa
3
+ size 12626