Question regarding the Stage 1 training procedure
#1
by
floschne
- opened
Hi, and first of all thanks for making your models and datasets open-source!
I just read your paper and was wondering how, i.e., with which data, you trained the MLP projector in Stage 1? Did you use multilingual image captions or english-only?