Question regarding the Stage 1 training procedure

#1
by floschne - opened

Hi, and first of all thanks for making your models and datasets open-source!

I just read your paper and was wondering how, i.e., with which data, you trained the MLP projector in Stage 1? Did you use multilingual image captions or english-only?

Sign up or log in to comment