Upload folder using huggingface_hub
Browse files- README.md +27 -15
- model.joblib +2 -2
README.md
CHANGED
|
@@ -25,26 +25,38 @@ model.print_topics()
|
|
| 25 |
The model is structured as follows:
|
| 26 |
|
| 27 |
```
|
| 28 |
-
|
| 29 |
-
|
| 30 |
-
|
|
|
|
|
|
|
| 31 |
```
|
| 32 |
|
| 33 |
## Topics
|
| 34 |
The topics discovered by the model are the following:
|
| 35 |
|
| 36 |
-
| Topic ID | Highest Ranking |
|
| 37 |
-
| - | - |
|
| 38 |
-
| 0 |
|
| 39 |
-
| 1 |
|
| 40 |
-
| 2 |
|
| 41 |
-
| 3 |
|
| 42 |
-
| 4 |
|
| 43 |
-
| 5 |
|
| 44 |
-
| 6 |
|
| 45 |
-
| 7 |
|
| 46 |
-
| 8 |
|
| 47 |
-
| 9 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 48 |
|
| 49 |
## Package versions
|
| 50 |
|
|
|
|
| 25 |
The model is structured as follows:
|
| 26 |
|
| 27 |
```
|
| 28 |
+
ClusteringTopicModel(clustering=KMeans(n_clusters=20),
|
| 29 |
+
dimensionality_reduction=PCA(n_components=5),
|
| 30 |
+
feature_importance='c-tf-idf',
|
| 31 |
+
vectorizer=CountVectorizer(min_df=10,
|
| 32 |
+
stop_words='english'))
|
| 33 |
```
|
| 34 |
|
| 35 |
## Topics
|
| 36 |
The topics discovered by the model are the following:
|
| 37 |
|
| 38 |
+
| Topic ID | Highest Ranking |
|
| 39 |
+
| - | - |
|
| 40 |
+
| 0 | ax, max, g9v, b8f, jpeg, pl, a86, db, 1d9, file |
|
| 41 |
+
| 1 | drive, scsi, price, card, sale, 00, shipping, ram, pc, offer |
|
| 42 |
+
| 2 | pathetic, path, patient, patience, paths, pathology, patrick, patent, patently, patriot |
|
| 43 |
+
| 3 | key, encryption, government, clipper, chip, keys, law, use, nsa, escrow |
|
| 44 |
+
| 4 | people, right, don, think, just, government, like, say, does, rights |
|
| 45 |
+
| 5 | game, team, year, 25, play, games, players, 10, 55, season |
|
| 46 |
+
| 6 | dos, windows, image, file, edu, ftp, version, files, available, program |
|
| 47 |
+
| 7 | god, jesus, bible, people, christ, believe, christians, christian, faith, say |
|
| 48 |
+
| 8 | mr, president, people, fbi, gun, think, did, don, batf, know |
|
| 49 |
+
| 9 | space, use, new, launch, used, like, don, know, just, 00 |
|
| 50 |
+
| 10 | god, jews, people, church, does, did, christian, greek, just, israel |
|
| 51 |
+
| 11 | car, just, like, don, people, think, money, insurance, make, time |
|
| 52 |
+
| 12 | software, windows, thanks, know, version, does, ftp, available, xfree86, pc |
|
| 53 |
+
| 13 | ax, edu, information, pub, space, ftp, data, mail, file, entry |
|
| 54 |
+
| 14 | hockey, game, games, team, season, nhl, la, league, don, pts |
|
| 55 |
+
| 15 | armenian, armenians, turkish, people, said, israel, jews, genocide, israeli, armenia |
|
| 56 |
+
| 16 | 00, car, new, 50, price, bike, good, like, 1st, 10 |
|
| 57 |
+
| 17 | like, just, time, problem, don, use, know, vitamin, good, think |
|
| 58 |
+
| 18 | drive, scsi, card, disk, windows, controller, drives, use, bus, ide |
|
| 59 |
+
| 19 | ax, max, edu, com, b8f, ah, 145, a86, pl, air |
|
| 60 |
|
| 61 |
## Package versions
|
| 62 |
|
model.joblib
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7c44d3bc14662cd8548ed94cbb5294496d2d8c20874fb2f5189797b0a0fbce92
|
| 3 |
+
size 139475171
|