Parallel Sentences Datasets Collection These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. β’ 14 items β’ Updated Feb 25 β’ 19
Search-R1 Collection Preliminary checkpoints with outcome-only RL. β’ 14 items β’ Updated Apr 7 β’ 11
view article Article Visual Document Retrieval Goes Multilingual By marco and 1 other β’ Jan 10 β’ 74
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π By manu β’ Jul 5, 2024 β’ 266
view article Article BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*β‘ By xhluca β’ Jul 9, 2024 β’ 55
Running 2.72k 2.72k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with Entities Paper β’ 2410.07722 β’ Published Oct 10, 2024 β’ 13
lsr42/uniir-sf-vit-large-patch14-336-best Zero-Shot Image Classification β’ 0.4B β’ Updated Aug 12, 2024 β’ 10
lsr42/uniir-sf-vit-large-patch14-336-epoch16 Zero-Shot Image Classification β’ 0.4B β’ Updated Aug 10, 2024 β’ 15
lsr42/uniir-sf-vit-large-patch14-336-epoch12 Zero-Shot Image Classification β’ 0.4B β’ Updated Aug 9, 2024 β’ 14
lsr42/uniir-sf-vit-large-patch14-336 Zero-Shot Image Classification β’ 0.4B β’ Updated Aug 9, 2024 β’ 37