fastText
German
bastitx commited on
Commit
e3d7873
·
verified ·
1 Parent(s): c197430

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -18,7 +18,7 @@ For each document, we calculated a combined educational quality score by taking
18
 
19
  We trained Aleph-Alpha-GermanWeb-Quality-Classifier-fastText using 185,403 documents in each class. We used 95% of the data (and the remaining 5% for validation) to train a fastText model to classify between high and low quality text data. It reached 77% precision and 77% recall on the validation set.
20
 
21
- Further details, including our LLM judging prompt, can be found in our accompanying paper (link to paper coming soon).
22
 
23
  ## Example Snippet
24
 
 
18
 
19
  We trained Aleph-Alpha-GermanWeb-Quality-Classifier-fastText using 185,403 documents in each class. We used 95% of the data (and the remaining 5% for validation) to train a fastText model to classify between high and low quality text data. It reached 77% precision and 77% recall on the validation set.
20
 
21
+ Further details, including our LLM judging prompt, can be found in our [accompanying paper](https://arxiv.org/abs/2505.00022).
22
 
23
  ## Example Snippet
24