AngelPanizo commited on
Commit
6dbeff3
·
verified ·
1 Parent(s): e840459

Add BERTopic model

Browse files
README.md ADDED
@@ -0,0 +1,80 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+ tags:
4
+ - bertopic
5
+ library_name: bertopic
6
+ pipeline_tag: text-classification
7
+ ---
8
+
9
+ # MARTINI_enrich_BERTopic_UKcitizen2021
10
+
11
+ This is a [BERTopic](https://github.com/MaartenGr/BERTopic) model.
12
+ BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.
13
+
14
+ ## Usage
15
+
16
+ To use this model, please install BERTopic:
17
+
18
+ ```
19
+ pip install -U bertopic
20
+ ```
21
+
22
+ You can use the model as follows:
23
+
24
+ ```python
25
+ from bertopic import BERTopic
26
+ topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_UKcitizen2021")
27
+
28
+ topic_model.get_topic_info()
29
+ ```
30
+
31
+ ## Topic overview
32
+
33
+ * Number of topics: 11
34
+ * Number of training documents: 721
35
+
36
+ <details>
37
+ <summary>Click here for an overview of all topics.</summary>
38
+
39
+ | Topic ID | Topic Keywords | Topic Frequency | Label |
40
+ |----------|----------------|-----------------|-------|
41
+ | -1 | lockdown - wef - monkeypox - agenda - nuremberg | 22 | -1_lockdown_wef_monkeypox_agenda |
42
+ | 0 | vaccinated - pfizer - vaxx - worldcouncilforhealth - injections | 313 | 0_vaccinated_pfizer_vaxx_worldcouncilforhealth |
43
+ | 1 | ukcitizen2021 - amendments - supranational - pandemic - wgihr | 89 | 1_ukcitizen2021_amendments_supranational_pandemic |
44
+ | 2 | constable - arrested - victims - allegations - thamesvalley | 73 | 2_constable_arrested_victims_allegations |
45
+ | 3 | bbc - matt - tomorrow - southampton - everywhere | 51 | 3_bbc_matt_tomorrow_southampton |
46
+ | 4 | vaccines - mhra - parliamentary - claims - chope | 42 | 4_vaccines_mhra_parliamentary_claims |
47
+ | 5 | ukcitizen2021 - councils - mobilise - responses - nottinghamshire | 35 | 5_ukcitizen2021_councils_mobilise_responses |
48
+ | 6 | ivermectin - hydroxychloroquine - quercetin - iodine - denied | 26 | 6_ivermectin_hydroxychloroquine_quercetin_iodine |
49
+ | 7 | solicitors - mhra - regulatory - bayliss - allegations | 25 | 7_solicitors_mhra_regulatory_bayliss |
50
+ | 8 | vaccination - nhs - consent - compulsory - jobsnotjabs | 23 | 8_vaccination_nhs_consent_compulsory |
51
+ | 9 | digital - controligarchs - england - passport - currency | 22 | 9_digital_controligarchs_england_passport |
52
+
53
+ </details>
54
+
55
+ ## Training hyperparameters
56
+
57
+ * calculate_probabilities: True
58
+ * language: None
59
+ * low_memory: False
60
+ * min_topic_size: 10
61
+ * n_gram_range: (1, 1)
62
+ * nr_topics: None
63
+ * seed_topic_list: None
64
+ * top_n_words: 10
65
+ * verbose: False
66
+ * zeroshot_min_similarity: 0.7
67
+ * zeroshot_topic_list: None
68
+
69
+ ## Framework versions
70
+
71
+ * Numpy: 1.26.4
72
+ * HDBSCAN: 0.8.40
73
+ * UMAP: 0.5.7
74
+ * Pandas: 2.2.3
75
+ * Scikit-Learn: 1.5.2
76
+ * Sentence-transformers: 3.3.1
77
+ * Transformers: 4.46.3
78
+ * Numba: 0.60.0
79
+ * Plotly: 5.24.1
80
+ * Python: 3.10.12
config.json ADDED
@@ -0,0 +1,16 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "calculate_probabilities": true,
3
+ "language": null,
4
+ "low_memory": false,
5
+ "min_topic_size": 10,
6
+ "n_gram_range": [
7
+ 1,
8
+ 1
9
+ ],
10
+ "nr_topics": null,
11
+ "seed_topic_list": null,
12
+ "top_n_words": 10,
13
+ "verbose": false,
14
+ "zeroshot_min_similarity": 0.7,
15
+ "zeroshot_topic_list": null
16
+ }
ctfidf.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5f68c6e440c8ed3004fc2e332cc56cd13f965f14148adef8c1c6957759020f63
3
+ size 242628
ctfidf_config.json ADDED
The diff for this file is too large to render. See raw diff
 
topic_embeddings.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a3a6b7805c8752c5f803df248835bece1eb56aa1545ae230675f6876e2b56646
3
+ size 45144
topics.json ADDED
@@ -0,0 +1,1055 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "topic_representations": {
3
+ "-1": [
4
+ [
5
+ "lockdown",
6
+ 0.5454896688461304
7
+ ],
8
+ [
9
+ "wef",
10
+ 0.5361734628677368
11
+ ],
12
+ [
13
+ "monkeypox",
14
+ 0.4691636264324188
15
+ ],
16
+ [
17
+ "agenda",
18
+ 0.4670596122741699
19
+ ],
20
+ [
21
+ "nuremberg",
22
+ 0.4627874493598938
23
+ ]
24
+ ],
25
+ "0": [
26
+ [
27
+ "vaccinated",
28
+ 0.6241074800491333
29
+ ],
30
+ [
31
+ "pfizer",
32
+ 0.5701377391815186
33
+ ],
34
+ [
35
+ "vaxx",
36
+ 0.5671300888061523
37
+ ],
38
+ [
39
+ "worldcouncilforhealth",
40
+ 0.5152227878570557
41
+ ],
42
+ [
43
+ "injections",
44
+ 0.5127284526824951
45
+ ]
46
+ ],
47
+ "1": [
48
+ [
49
+ "ukcitizen2021",
50
+ 0.5685036182403564
51
+ ],
52
+ [
53
+ "amendments",
54
+ 0.543997585773468
55
+ ],
56
+ [
57
+ "supranational",
58
+ 0.49074500799179077
59
+ ],
60
+ [
61
+ "pandemic",
62
+ 0.4897991418838501
63
+ ],
64
+ [
65
+ "wgihr",
66
+ 0.48632997274398804
67
+ ]
68
+ ],
69
+ "2": [
70
+ [
71
+ "constable",
72
+ 0.503520131111145
73
+ ],
74
+ [
75
+ "arrested",
76
+ 0.49651211500167847
77
+ ],
78
+ [
79
+ "victims",
80
+ 0.4883367419242859
81
+ ],
82
+ [
83
+ "allegations",
84
+ 0.4740084409713745
85
+ ],
86
+ [
87
+ "thamesvalley",
88
+ 0.4643774628639221
89
+ ]
90
+ ],
91
+ "3": [
92
+ [
93
+ "bbc",
94
+ 0.6183251738548279
95
+ ],
96
+ [
97
+ "matt",
98
+ 0.5457966327667236
99
+ ],
100
+ [
101
+ "tomorrow",
102
+ 0.542050838470459
103
+ ],
104
+ [
105
+ "southampton",
106
+ 0.5249444842338562
107
+ ],
108
+ [
109
+ "everywhere",
110
+ 0.5166743993759155
111
+ ]
112
+ ],
113
+ "4": [
114
+ [
115
+ "vaccines",
116
+ 0.5754696130752563
117
+ ],
118
+ [
119
+ "mhra",
120
+ 0.5523267388343811
121
+ ],
122
+ [
123
+ "parliamentary",
124
+ 0.4844176769256592
125
+ ],
126
+ [
127
+ "claims",
128
+ 0.47689953446388245
129
+ ],
130
+ [
131
+ "chope",
132
+ 0.45652931928634644
133
+ ]
134
+ ],
135
+ "5": [
136
+ [
137
+ "ukcitizen2021",
138
+ 0.6426631212234497
139
+ ],
140
+ [
141
+ "councils",
142
+ 0.5268024802207947
143
+ ],
144
+ [
145
+ "mobilise",
146
+ 0.5205245018005371
147
+ ],
148
+ [
149
+ "responses",
150
+ 0.4929749369621277
151
+ ],
152
+ [
153
+ "nottinghamshire",
154
+ 0.4862140417098999
155
+ ]
156
+ ],
157
+ "6": [
158
+ [
159
+ "ivermectin",
160
+ 0.68222975730896
161
+ ],
162
+ [
163
+ "hydroxychloroquine",
164
+ 0.6154346466064453
165
+ ],
166
+ [
167
+ "quercetin",
168
+ 0.46053260564804077
169
+ ],
170
+ [
171
+ "iodine",
172
+ 0.4282894730567932
173
+ ],
174
+ [
175
+ "denied",
176
+ 0.42202115058898926
177
+ ]
178
+ ],
179
+ "7": [
180
+ [
181
+ "solicitors",
182
+ 0.599446177482605
183
+ ],
184
+ [
185
+ "mhra",
186
+ 0.4934234619140625
187
+ ],
188
+ [
189
+ "regulatory",
190
+ 0.49167096614837646
191
+ ],
192
+ [
193
+ "bayliss",
194
+ 0.4903753697872162
195
+ ],
196
+ [
197
+ "allegations",
198
+ 0.4804745316505432
199
+ ]
200
+ ],
201
+ "8": [
202
+ [
203
+ "vaccination",
204
+ 0.6087295413017273
205
+ ],
206
+ [
207
+ "nhs",
208
+ 0.5676723718643188
209
+ ],
210
+ [
211
+ "consent",
212
+ 0.506838858127594
213
+ ],
214
+ [
215
+ "compulsory",
216
+ 0.4858030676841736
217
+ ],
218
+ [
219
+ "jobsnotjabs",
220
+ 0.4713505506515503
221
+ ]
222
+ ],
223
+ "9": [
224
+ [
225
+ "digital",
226
+ 0.5332129001617432
227
+ ],
228
+ [
229
+ "controligarchs",
230
+ 0.4792712926864624
231
+ ],
232
+ [
233
+ "england",
234
+ 0.46954020857810974
235
+ ],
236
+ [
237
+ "passport",
238
+ 0.4463684856891632
239
+ ],
240
+ [
241
+ "currency",
242
+ 0.4433884024620056
243
+ ]
244
+ ]
245
+ },
246
+ "topics": [
247
+ 1,
248
+ -1,
249
+ 8,
250
+ 3,
251
+ -1,
252
+ -1,
253
+ 4,
254
+ -1,
255
+ 9,
256
+ 1,
257
+ 1,
258
+ 1,
259
+ 1,
260
+ 1,
261
+ 1,
262
+ -1,
263
+ -1,
264
+ -1,
265
+ 1,
266
+ 3,
267
+ 1,
268
+ 1,
269
+ 1,
270
+ -1,
271
+ -1,
272
+ 3,
273
+ 1,
274
+ -1,
275
+ -1,
276
+ 0,
277
+ -1,
278
+ 4,
279
+ -1,
280
+ 1,
281
+ 1,
282
+ 4,
283
+ -1,
284
+ 1,
285
+ -1,
286
+ 1,
287
+ 5,
288
+ 1,
289
+ 1,
290
+ 1,
291
+ 1,
292
+ 1,
293
+ 1,
294
+ -1,
295
+ -1,
296
+ -1,
297
+ 4,
298
+ -1,
299
+ 1,
300
+ 1,
301
+ -1,
302
+ 1,
303
+ -1,
304
+ -1,
305
+ -1,
306
+ 1,
307
+ 3,
308
+ 1,
309
+ 1,
310
+ 1,
311
+ -1,
312
+ 3,
313
+ 1,
314
+ -1,
315
+ 3,
316
+ -1,
317
+ 1,
318
+ 1,
319
+ 1,
320
+ -1,
321
+ -1,
322
+ 1,
323
+ 1,
324
+ 1,
325
+ 9,
326
+ 1,
327
+ 9,
328
+ 2,
329
+ 2,
330
+ -1,
331
+ 1,
332
+ 2,
333
+ 1,
334
+ 1,
335
+ -1,
336
+ -1,
337
+ 1,
338
+ -1,
339
+ 5,
340
+ 1,
341
+ -1,
342
+ 0,
343
+ 2,
344
+ -1,
345
+ 1,
346
+ 1,
347
+ 9,
348
+ 9,
349
+ 9,
350
+ -1,
351
+ 3,
352
+ 9,
353
+ -1,
354
+ -1,
355
+ 0,
356
+ 1,
357
+ -1,
358
+ -1,
359
+ -1,
360
+ 9,
361
+ 5,
362
+ 9,
363
+ 9,
364
+ 1,
365
+ 5,
366
+ -1,
367
+ 5,
368
+ 1,
369
+ 9,
370
+ 5,
371
+ 1,
372
+ -1,
373
+ 3,
374
+ 5,
375
+ 3,
376
+ 5,
377
+ -1,
378
+ 5,
379
+ -1,
380
+ 1,
381
+ 7,
382
+ 1,
383
+ 4,
384
+ 9,
385
+ 4,
386
+ 4,
387
+ 4,
388
+ 3,
389
+ 3,
390
+ 0,
391
+ 3,
392
+ 2,
393
+ -1,
394
+ -1,
395
+ -1,
396
+ -1,
397
+ -1,
398
+ 0,
399
+ -1,
400
+ -1,
401
+ -1,
402
+ 3,
403
+ 9,
404
+ 7,
405
+ 9,
406
+ 2,
407
+ 7,
408
+ 3,
409
+ -1,
410
+ 3,
411
+ 1,
412
+ -1,
413
+ 3,
414
+ 8,
415
+ -1,
416
+ 3,
417
+ 9,
418
+ 7,
419
+ 7,
420
+ -1,
421
+ -1,
422
+ -1,
423
+ 7,
424
+ 3,
425
+ -1,
426
+ 2,
427
+ -1,
428
+ -1,
429
+ 3,
430
+ -1,
431
+ 4,
432
+ 2,
433
+ 4,
434
+ -1,
435
+ 9,
436
+ 2,
437
+ -1,
438
+ -1,
439
+ -1,
440
+ 3,
441
+ 1,
442
+ 7,
443
+ -1,
444
+ -1,
445
+ 4,
446
+ 3,
447
+ -1,
448
+ -1,
449
+ 8,
450
+ -1,
451
+ -1,
452
+ -1,
453
+ 0,
454
+ 7,
455
+ 0,
456
+ 3,
457
+ 4,
458
+ 7,
459
+ 0,
460
+ 4,
461
+ 7,
462
+ 7,
463
+ -1,
464
+ -1,
465
+ 7,
466
+ 7,
467
+ 9,
468
+ 0,
469
+ -1,
470
+ -1,
471
+ 4,
472
+ 3,
473
+ 3,
474
+ -1,
475
+ -1,
476
+ -1,
477
+ 7,
478
+ 7,
479
+ 0,
480
+ 7,
481
+ 7,
482
+ -1,
483
+ 0,
484
+ 3,
485
+ 3,
486
+ 3,
487
+ -1,
488
+ -1,
489
+ 0,
490
+ -1,
491
+ 0,
492
+ 7,
493
+ 4,
494
+ 0,
495
+ 7,
496
+ -1,
497
+ -1,
498
+ -1,
499
+ 0,
500
+ -1,
501
+ 4,
502
+ 7,
503
+ 7,
504
+ 7,
505
+ -1,
506
+ -1,
507
+ 4,
508
+ -1,
509
+ 4,
510
+ 4,
511
+ -1,
512
+ 4,
513
+ -1,
514
+ -1,
515
+ 0,
516
+ -1,
517
+ -1,
518
+ -1,
519
+ -1,
520
+ 0,
521
+ -1,
522
+ -1,
523
+ 6,
524
+ 0,
525
+ -1,
526
+ 8,
527
+ -1,
528
+ 5,
529
+ 0,
530
+ 6,
531
+ -1,
532
+ 4,
533
+ 0,
534
+ -1,
535
+ -1,
536
+ 9,
537
+ 3,
538
+ 0,
539
+ -1,
540
+ -1,
541
+ -1,
542
+ 9,
543
+ -1,
544
+ -1,
545
+ -1,
546
+ 4,
547
+ -1,
548
+ -1,
549
+ -1,
550
+ -1,
551
+ -1,
552
+ -1,
553
+ -1,
554
+ 0,
555
+ -1,
556
+ -1,
557
+ -1,
558
+ 4,
559
+ -1,
560
+ 0,
561
+ 4,
562
+ -1,
563
+ -1,
564
+ 3,
565
+ -1,
566
+ 4,
567
+ -1,
568
+ 0,
569
+ -1,
570
+ -1,
571
+ 2,
572
+ 0,
573
+ -1,
574
+ 5,
575
+ -1,
576
+ 0,
577
+ -1,
578
+ -1,
579
+ -1,
580
+ 0,
581
+ -1,
582
+ 9,
583
+ 2,
584
+ -1,
585
+ 2,
586
+ 0,
587
+ -1,
588
+ -1,
589
+ 8,
590
+ 2,
591
+ -1,
592
+ 2,
593
+ 2,
594
+ 2,
595
+ 0,
596
+ 1,
597
+ 0,
598
+ 0,
599
+ -1,
600
+ -1,
601
+ 0,
602
+ 0,
603
+ 0,
604
+ -1,
605
+ 9,
606
+ -1,
607
+ -1,
608
+ -1,
609
+ -1,
610
+ -1,
611
+ -1,
612
+ 1,
613
+ 9,
614
+ -1,
615
+ -1,
616
+ -1,
617
+ 0,
618
+ -1,
619
+ -1,
620
+ 2,
621
+ -1,
622
+ 0,
623
+ -1,
624
+ -1,
625
+ 1,
626
+ 4,
627
+ 4,
628
+ -1,
629
+ -1,
630
+ 2,
631
+ -1,
632
+ -1,
633
+ 0,
634
+ -1,
635
+ -1,
636
+ 3,
637
+ -1,
638
+ 8,
639
+ 5,
640
+ -1,
641
+ -1,
642
+ 3,
643
+ -1,
644
+ -1,
645
+ -1,
646
+ 3,
647
+ -1,
648
+ 2,
649
+ 2,
650
+ -1,
651
+ 4,
652
+ 2,
653
+ -1,
654
+ -1,
655
+ -1,
656
+ -1,
657
+ -1,
658
+ 0,
659
+ 1,
660
+ -1,
661
+ 0,
662
+ -1,
663
+ -1,
664
+ -1,
665
+ 2,
666
+ 2,
667
+ -1,
668
+ -1,
669
+ -1,
670
+ -1,
671
+ 3,
672
+ -1,
673
+ -1,
674
+ -1,
675
+ 3,
676
+ 0,
677
+ -1,
678
+ 6,
679
+ -1,
680
+ -1,
681
+ -1,
682
+ -1,
683
+ -1,
684
+ -1,
685
+ 3,
686
+ -1,
687
+ -1,
688
+ -1,
689
+ 1,
690
+ -1,
691
+ -1,
692
+ 4,
693
+ -1,
694
+ -1,
695
+ -1,
696
+ -1,
697
+ 1,
698
+ 1,
699
+ -1,
700
+ -1,
701
+ 6,
702
+ 6,
703
+ -1,
704
+ 1,
705
+ 1,
706
+ 1,
707
+ 1,
708
+ 1,
709
+ 1,
710
+ 1,
711
+ 1,
712
+ -1,
713
+ -1,
714
+ 1,
715
+ 0,
716
+ -1,
717
+ 0,
718
+ -1,
719
+ 0,
720
+ -1,
721
+ 0,
722
+ -1,
723
+ 0,
724
+ -1,
725
+ -1,
726
+ 1,
727
+ -1,
728
+ -1,
729
+ 1,
730
+ 1,
731
+ -1,
732
+ -1,
733
+ -1,
734
+ 0,
735
+ 5,
736
+ -1,
737
+ 1,
738
+ -1,
739
+ -1,
740
+ -1,
741
+ 8,
742
+ -1,
743
+ 0,
744
+ -1,
745
+ 6,
746
+ -1,
747
+ 0,
748
+ 0,
749
+ -1,
750
+ 2,
751
+ 3,
752
+ -1,
753
+ 0,
754
+ 8,
755
+ 3,
756
+ -1,
757
+ 3,
758
+ 0,
759
+ 5,
760
+ -1,
761
+ 4,
762
+ 4,
763
+ -1,
764
+ 2,
765
+ 8,
766
+ -1,
767
+ 0,
768
+ -1,
769
+ 0,
770
+ -1,
771
+ -1,
772
+ -1,
773
+ -1,
774
+ -1,
775
+ 2,
776
+ 2,
777
+ -1,
778
+ 0,
779
+ 5,
780
+ 0,
781
+ 0,
782
+ -1,
783
+ 0,
784
+ -1,
785
+ 2,
786
+ -1,
787
+ 6,
788
+ 6,
789
+ 4,
790
+ 2,
791
+ 2,
792
+ 2,
793
+ -1,
794
+ 2,
795
+ 2,
796
+ 0,
797
+ 0,
798
+ -1,
799
+ -1,
800
+ 2,
801
+ 1,
802
+ 2,
803
+ -1,
804
+ 2,
805
+ -1,
806
+ 2,
807
+ 0,
808
+ 0,
809
+ -1,
810
+ 4,
811
+ -1,
812
+ -1,
813
+ 2,
814
+ 8,
815
+ 2,
816
+ 2,
817
+ -1,
818
+ 2,
819
+ 2,
820
+ 0,
821
+ 0,
822
+ -1,
823
+ -1,
824
+ 8,
825
+ -1,
826
+ 2,
827
+ 2,
828
+ -1,
829
+ 2,
830
+ -1,
831
+ 2,
832
+ 2,
833
+ -1,
834
+ -1,
835
+ -1,
836
+ 2,
837
+ 6,
838
+ -1,
839
+ -1,
840
+ -1,
841
+ -1,
842
+ -1,
843
+ -1,
844
+ -1,
845
+ 5,
846
+ -1,
847
+ 4,
848
+ -1,
849
+ 5,
850
+ 5,
851
+ -1,
852
+ -1,
853
+ 5,
854
+ 5,
855
+ 5,
856
+ -1,
857
+ -1,
858
+ -1,
859
+ -1,
860
+ -1,
861
+ 3,
862
+ -1,
863
+ -1,
864
+ -1,
865
+ -1,
866
+ 0,
867
+ -1,
868
+ -1,
869
+ 5,
870
+ -1,
871
+ -1,
872
+ 0,
873
+ -1,
874
+ -1,
875
+ -1,
876
+ 6,
877
+ -1,
878
+ -1,
879
+ 0,
880
+ 0,
881
+ -1,
882
+ -1,
883
+ 0,
884
+ -1,
885
+ 0,
886
+ 0,
887
+ 8,
888
+ 8,
889
+ -1,
890
+ 8,
891
+ -1,
892
+ 5,
893
+ 8,
894
+ 0,
895
+ 0,
896
+ 8,
897
+ 0,
898
+ 3,
899
+ 8,
900
+ 6,
901
+ 6,
902
+ 0,
903
+ 0,
904
+ 6,
905
+ 8,
906
+ 8,
907
+ -1,
908
+ 6,
909
+ -1,
910
+ 0,
911
+ 6,
912
+ 5,
913
+ 0,
914
+ 0,
915
+ -1,
916
+ -1,
917
+ -1,
918
+ 0,
919
+ 8,
920
+ -1,
921
+ 6,
922
+ -1,
923
+ 0,
924
+ 6,
925
+ 7,
926
+ 3,
927
+ -1,
928
+ -1,
929
+ -1,
930
+ 0,
931
+ 2,
932
+ 6,
933
+ 6,
934
+ 6,
935
+ 8,
936
+ 6,
937
+ 0,
938
+ 0,
939
+ 2,
940
+ -1,
941
+ 6,
942
+ 2,
943
+ 8,
944
+ 6,
945
+ 0,
946
+ 0,
947
+ -1,
948
+ -1,
949
+ 0,
950
+ -1,
951
+ -1,
952
+ -1,
953
+ -1,
954
+ -1,
955
+ 6,
956
+ -1,
957
+ 0,
958
+ 0,
959
+ 6,
960
+ -1,
961
+ 0,
962
+ -1,
963
+ 3,
964
+ 4,
965
+ 5,
966
+ -1,
967
+ 5
968
+ ],
969
+ "topic_sizes": {
970
+ "1": 73,
971
+ "-1": 313,
972
+ "8": 22,
973
+ "3": 42,
974
+ "4": 35,
975
+ "9": 22,
976
+ "0": 89,
977
+ "5": 26,
978
+ "2": 51,
979
+ "7": 23,
980
+ "6": 25
981
+ },
982
+ "topic_mapper": [
983
+ [
984
+ -1,
985
+ -1,
986
+ -1
987
+ ],
988
+ [
989
+ 0,
990
+ 0,
991
+ 7
992
+ ],
993
+ [
994
+ 1,
995
+ 1,
996
+ 9
997
+ ],
998
+ [
999
+ 2,
1000
+ 2,
1001
+ 8
1002
+ ],
1003
+ [
1004
+ 3,
1005
+ 3,
1006
+ 6
1007
+ ],
1008
+ [
1009
+ 4,
1010
+ 4,
1011
+ 0
1012
+ ],
1013
+ [
1014
+ 5,
1015
+ 5,
1016
+ 4
1017
+ ],
1018
+ [
1019
+ 6,
1020
+ 6,
1021
+ 2
1022
+ ],
1023
+ [
1024
+ 7,
1025
+ 7,
1026
+ 3
1027
+ ],
1028
+ [
1029
+ 8,
1030
+ 8,
1031
+ 5
1032
+ ],
1033
+ [
1034
+ 9,
1035
+ 9,
1036
+ 1
1037
+ ]
1038
+ ],
1039
+ "topic_labels": {
1040
+ "-1": "-1_lockdown_wef_monkeypox_agenda",
1041
+ "0": "0_vaccinated_pfizer_vaxx_worldcouncilforhealth",
1042
+ "1": "1_ukcitizen2021_amendments_supranational_pandemic",
1043
+ "2": "2_constable_arrested_victims_allegations",
1044
+ "3": "3_bbc_matt_tomorrow_southampton",
1045
+ "4": "4_vaccines_mhra_parliamentary_claims",
1046
+ "5": "5_ukcitizen2021_councils_mobilise_responses",
1047
+ "6": "6_ivermectin_hydroxychloroquine_quercetin_iodine",
1048
+ "7": "7_solicitors_mhra_regulatory_bayliss",
1049
+ "8": "8_vaccination_nhs_consent_compulsory",
1050
+ "9": "9_digital_controligarchs_england_passport"
1051
+ },
1052
+ "custom_labels": null,
1053
+ "_outliers": 1,
1054
+ "topic_aspects": {}
1055
+ }