Commit
·
a2ba71d
1
Parent(s):
e50b169
Update README.md
Browse files
README.md
CHANGED
|
@@ -61,7 +61,7 @@ The model is not for further fine-tuning to do other tasks (such as classificati
|
|
| 61 |
|
| 62 |
## Training Details
|
| 63 |
|
| 64 |
-
max seq 256, batch size
|
| 65 |
|
| 66 |
### Training Data
|
| 67 |
|
|
|
|
| 61 |
|
| 62 |
## Training Details
|
| 63 |
|
| 64 |
+
max seq 256, batch size 128, lr 3e-05, 1 epoch, 10% warmup, 1 A100.
|
| 65 |
|
| 66 |
### Training Data
|
| 67 |
|