view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch By ariG23498 and 6 others β’ May 21 β’ 174
view article Article Gotchas in Tokenizer Behavior Every Developer Should Know By qgallouedec β’ Apr 18 β’ 37
π§ Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community β’ 24 items β’ Updated May 19 β’ 152
view article Article The N Implementation Details of RLHF with PPO By vwxyzjn and 2 others β’ Oct 24, 2023 β’ 59