dataset plagiarism
#3
by
breadlicker45
- opened
I suspect you of dataset plagiarism by pretraining on my toast midi dataset without crediting me. When my dataset is cleaned it comes to 1.6 millon midi files. I manually downloaded most of the dataset and trained a model on it. So I am pretty good at recognizing when something is trained on my dataset.
https://huggingface.co/datasets/breadlicker45/toast-midi-dataset
breadlicker45
changed discussion title from
dataset plagiarism
to *dataset plagiarism*
breadlicker45
changed discussion title from
*dataset plagiarism*
to dataset plagiarism
breadlicker45
changed discussion status to
closed
breadlicker45
changed discussion status to
open