Submitted by Jingfeng Yao 98 Towards Scalable Pre-training of Visual Tokenizers for Generation MiniMax 383 4