Update README.md
Browse files
README.md
CHANGED
@@ -25,7 +25,7 @@ model-index:
|
|
25 |
value: 94.837
|
26 |
- type: cer
|
27 |
name: zeroth-test-CER
|
28 |
-
value: 1.
|
29 |
- type: wer
|
30 |
name: zeroth-test-WER
|
31 |
value: 2.951
|
@@ -53,7 +53,7 @@ This model is fine-tuned from [microsoft/Phi-4-multimodal-instruct](https://hugg
|
|
53 |
|
54 |
This model is trained 960 steps on datasets for Korean Audio Speech Recognition on H100.
|
55 |
|
56 |
-
After that, we
|
57 |
|
58 |
## Evaluation
|
59 |
|
@@ -70,7 +70,7 @@ Compared to [Phi-4-mm-inst-zeroth-kor](https://huggingface.co/seastar105/Phi-4-m
|
|
70 |
| original | 198.32 | - | 5.63 | 2.42 | 6.86 | 4.17 |
|
71 |
| daekeun-ml/Phi-4-multimodal-finetune-ko-speech| 1.61 | 3.54 | 7.67 | 8.38 | 12.31 | 9.69 |
|
72 |
| seastar105/Phi-4-mm-inst-zeroth-kor | 7.02 | - |7.07 | 9.19 | 13.08 | 9.35 |
|
73 |
-
| **ASR finetune (this model)**| **1.31** |
|
74 |
| + 1 epoch finetune with [Covost-Ko](https://huggingface.co/datasets/junnei/covost2-ko)| 3.88 | - | **8.07** | **10.09** | **18.82** | **15.41** |
|
75 |
| [**AST finetuned model**](https://huggingface.co/junnei/Phi-4-multimodal-instruct-ko-speech/tree/main)| **1.77** | **2.99** | **8.01** | **9.09** | **17.09** | **11.82** |
|
76 |
|
|
|
25 |
value: 94.837
|
26 |
- type: cer
|
27 |
name: zeroth-test-CER
|
28 |
+
value: 1.316
|
29 |
- type: wer
|
30 |
name: zeroth-test-WER
|
31 |
value: 2.951
|
|
|
53 |
|
54 |
This model is trained 960 steps on datasets for Korean Audio Speech Recognition on H100.
|
55 |
|
56 |
+
After that, we continue training with [CoVoST2 Dataset](https://huggingface.co/datasets/junnei/covost2) / [Only for Korean](https://huggingface.co/datasets/junnei/covost2-ko) for AST.
|
57 |
|
58 |
## Evaluation
|
59 |
|
|
|
70 |
| original | 198.32 | - | 5.63 | 2.42 | 6.86 | 4.17 |
|
71 |
| daekeun-ml/Phi-4-multimodal-finetune-ko-speech| 1.61 | 3.54 | 7.67 | 8.38 | 12.31 | 9.69 |
|
72 |
| seastar105/Phi-4-mm-inst-zeroth-kor | 7.02 | - |7.07 | 9.19 | 13.08 | 9.35 |
|
73 |
+
| **ASR finetune (this model)**| **1.31** | 2.95 |7.46 | 6.24 | 12.15 | 8.91 |
|
74 |
| + 1 epoch finetune with [Covost-Ko](https://huggingface.co/datasets/junnei/covost2-ko)| 3.88 | - | **8.07** | **10.09** | **18.82** | **15.41** |
|
75 |
| [**AST finetuned model**](https://huggingface.co/junnei/Phi-4-multimodal-instruct-ko-speech/tree/main)| **1.77** | **2.99** | **8.01** | **9.09** | **17.09** | **11.82** |
|
76 |
|