This collection contains currated text similarity datasets that are available in huggingface dataset
-
jakartaresearch/id-paraphrase-detection
Viewer • Updated • 5.8k • 37 • 3 -
andreaschandra/quora-question-pairs-id
Viewer • Updated • 1k • 10 • 1 -
sentence-transformers/parallel-sentences-global-voices
Viewer • Updated • 2.2M • 326 -
sentence-transformers/parallel-sentences-opensubtitles
Viewer • Updated • 274M • 756 • 3