Which dataset did you use to train?

by ashutoshsaboo - opened Mar 1

Mar 1

•

Interesting that you used mordernbert for punctuation restoration task. I've been in the process of training similar models (which I will open source soon) and wanted to baseline against yours. Which dataset did you pretrain your model on? @whooray

whooray

Owner Mar 3

hi, @ashutoshsaboo
i trained the model using
Training data: libriheavy & mls-eng datasets
Evaluation data: mls-eng dataset

thanks for asking.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment