Update README.md
Browse files
README.md
CHANGED
|
@@ -44,6 +44,7 @@ In the second stage pre-processing, we do the following pre-processing tasks:
|
|
| 44 |
- remove sentences with token length less than 4,
|
| 45 |
- convert ‘@username’ to ‘@USER’,
|
| 46 |
- convert URL to HTTPURL.
|
|
|
|
| 47 |
Finally, we have 28,121,693 sentences for our pre-training task.
|
| 48 |
|
| 49 |
## Model
|
|
|
|
| 44 |
- remove sentences with token length less than 4,
|
| 45 |
- convert ‘@username’ to ‘@USER’,
|
| 46 |
- convert URL to HTTPURL.
|
| 47 |
+
|
| 48 |
Finally, we have 28,121,693 sentences for our pre-training task.
|
| 49 |
|
| 50 |
## Model
|