Update README.md
Browse files
README.md
CHANGED
|
@@ -30,12 +30,14 @@ The following are few examples of the keyword phrases:
|
|
| 30 |
- great kalian semua
|
| 31 |
- chattingane ilang
|
| 32 |
- baru aja launching
|
|
|
|
| 33 |
We acquire 40,788,384 raw tweets. We apply first stage pre-processing tasks such as:
|
| 34 |
- remove duplicate tweets,
|
| 35 |
- remove tweets with token length less than 5,
|
| 36 |
- remove multiple space,
|
| 37 |
- convert emoticon,
|
| 38 |
- convert all tweets to lower case.
|
|
|
|
| 39 |
After the first stage pre-processing, we obtain 17,385,773 tweets.
|
| 40 |
In the second stage pre-processing, we do the following pre-processing tasks:
|
| 41 |
- split the tweets into sentences,
|
|
|
|
| 30 |
- great kalian semua
|
| 31 |
- chattingane ilang
|
| 32 |
- baru aja launching
|
| 33 |
+
|
| 34 |
We acquire 40,788,384 raw tweets. We apply first stage pre-processing tasks such as:
|
| 35 |
- remove duplicate tweets,
|
| 36 |
- remove tweets with token length less than 5,
|
| 37 |
- remove multiple space,
|
| 38 |
- convert emoticon,
|
| 39 |
- convert all tweets to lower case.
|
| 40 |
+
|
| 41 |
After the first stage pre-processing, we obtain 17,385,773 tweets.
|
| 42 |
In the second stage pre-processing, we do the following pre-processing tasks:
|
| 43 |
- split the tweets into sentences,
|