view article Article How to train a new language model from scratch using Transformers and Tokenizers By julien-c • Feb 14, 2020 • 37
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention By sirluk • Oct 7, 2024 • 39
OpenLLMTurkishLeadboard Datasets Collection This Collection contains a mix of benchmarks. used for evaluation in the openllm lead-board for Turkish LLMs • 6 items • Updated Apr 26, 2024 • 4
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.25k