Training questions
#7
by
deoxykev
- opened
Roughly how many extraction examples was this fine tuned on? Was any grokking observed during the run?
The model is trained on ~50k examples. See the blog for details: https://numind.ai/blog/nuextract-a-foundation-model-for-structured-extraction