Update README.md
Browse files
README.md
CHANGED
|
@@ -21,4 +21,43 @@ language:
|
|
| 21 |
- en
|
| 22 |
- fr
|
| 23 |
- es
|
| 24 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 21 |
- en
|
| 22 |
- fr
|
| 23 |
- es
|
| 24 |
+
---
|
| 25 |
+
|
| 26 |
+
# German, English, French and Spanish Language Detector
|
| 27 |
+
|
| 28 |
+
The ImranzamanML/GEFS-language-detector is a fined tuned model by using the dataset of papluca [Language Identification](https://huggingface.co/datasets/papluca/language-identification#additional-information) and the base model [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) .
|
| 29 |
+
|
| 30 |
+
## Supported languages
|
| 31 |
+
Currently this model support 4 languages for [Theum AG](https://theum.com/en/index.htm?t=)
|
| 32 |
+

|
| 33 |
+
|
| 34 |
+
Following languages supported by the model:
|
| 35 |
+
- german (de)
|
| 36 |
+
- english (en)
|
| 37 |
+
- spanish (es)
|
| 38 |
+
- french (fr)
|
| 39 |
+
|
| 40 |
+
## Training Results
|
| 41 |
+
|
| 42 |
+
Epoch Training Loss Validation Loss
|
| 43 |
+
1 0.002600 0.000148
|
| 44 |
+
2 0.001000 0.000015
|
| 45 |
+
3 0.000000 0.000011
|
| 46 |
+
4 0.001800 0.000009
|
| 47 |
+
5 0.002700 0.000016
|
| 48 |
+
6 0.001600 0.000012
|
| 49 |
+
7 0.001300 0.000009
|
| 50 |
+
8 0.001200 0.000008
|
| 51 |
+
9 0.000900 0.000007
|
| 52 |
+
10 0.000900 0.000007
|
| 53 |
+
|
| 54 |
+
|
| 55 |
+
## Testing Results
|
| 56 |
+
|
| 57 |
+
Language Precision Recall F1 Accuracy
|
| 58 |
+
de 0.9997 0.9998 0.9998 0.9999
|
| 59 |
+
en 1.0000 1.0000 1.0000 1.0000
|
| 60 |
+
fr 0.9995 0.9996 0.9996 0.9996
|
| 61 |
+
es 0.9994 0.9996 0.9995 0.9996
|
| 62 |
+
|
| 63 |
+
|