Update README.md
Browse files
README.md
CHANGED
|
@@ -11,7 +11,8 @@ library_name: fasttext
|
|
| 11 |
---
|
| 12 |
# maths-fasttext-classifier
|
| 13 |
|
| 14 |
-
[Dataset](https://huggingface.co/datasets/kenhktsui/math-classifiers-data)
|
|
|
|
| 15 |
This is part of my [fasttext classifier collection](https://huggingface.co/collections/kenhktsui/fasttext-model-for-pretraining-data-curation-67220374c8acb97a1839553c) for curating pretraining dataset.
|
| 16 |
This classifier classifies a text into Maths or Others.
|
| 17 |
The model is trained over 1.6M records, which is a 50:50 mix of maths and non maths in website and achieved a test F1 score of 0.99 (too good to be true?). It is an intended upsampling of maths data.
|
|
|
|
| 11 |
---
|
| 12 |
# maths-fasttext-classifier
|
| 13 |
|
| 14 |
+
[Dataset](https://huggingface.co/datasets/kenhktsui/math-classifiers-data)
|
| 15 |
+
|
| 16 |
This is part of my [fasttext classifier collection](https://huggingface.co/collections/kenhktsui/fasttext-model-for-pretraining-data-curation-67220374c8acb97a1839553c) for curating pretraining dataset.
|
| 17 |
This classifier classifies a text into Maths or Others.
|
| 18 |
The model is trained over 1.6M records, which is a 50:50 mix of maths and non maths in website and achieved a test F1 score of 0.99 (too good to be true?). It is an intended upsampling of maths data.
|