This is the default data we used for DPO training, sourced from textvqa and ocrvqa. The data are constructed in an unsupervised way by SeVa. You can download them for reproduce purpose.
This is the default data we used for DPO training, sourced from textvqa and ocrvqa. The data are constructed in an unsupervised way by SeVa. You can download them for reproduce purpose.