dataset plagiarism

#3
by breadlicker45 - opened

I suspect you of dataset plagiarism by pretraining on my toast midi dataset without crediting me. When my dataset is cleaned it comes to 1.6 millon midi files. I manually downloaded most of the dataset and trained a model on it. So I am pretty good at recognizing when something is trained on my dataset.
https://huggingface.co/datasets/breadlicker45/toast-midi-dataset

breadlicker45 changed discussion title from dataset plagiarism to *dataset plagiarism*
breadlicker45 changed discussion title from *dataset plagiarism* to dataset plagiarism
breadlicker45 changed discussion status to closed
breadlicker45 changed discussion status to open

Sign up or log in to comment