afrideva's picture

afrideva

afrideva

·

afri_deva

AI & ML interests

None yet

Organizations

afrideva's activity

upvoted an article 3 months ago

Article

Building an African Cultural Dataset with SmoLAgents: Experimental

By

•

Feb 7

• 4

upvoted an article 8 months ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

and 1 other •

Oct 14, 2024

• 91

upvoted a paper 11 months ago

Conciseness: An Overlooked Language Task

Paper • 2211.04126 • Published Nov 8, 2022 • 2

upvoted a collection about 1 year ago

IrokoBench

a human-translated benchmark dataset for 16 African languages covering three tasks: NLI, MMLU and MGSM • 6 items • Updated May 31, 2024 • 20

upvoted a paper about 1 year ago

Arcee's MergeKit: A Toolkit for Merging Large Language Models

Paper • 2403.13257 • Published Mar 20, 2024 • 20

upvoted 2 collections over 1 year ago

Medical Evaluation Datasets

46 items • Updated 5 days ago • 8

Foundation Text-Generation Models Below 360M Parameters

Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. • 36 items • Updated Apr 6 • 32

upvoted 2 papers over 1 year ago

Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation

Paper • 2401.08417 • Published Jan 16, 2024 • 37

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 149

upvoted 2 collections over 1 year ago

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated Mar 20 • 600

Trained Models 🏋️

They may be small, but they're training like giants! • 8 items • Updated Dec 3, 2024 • 20

upvoted 2 papers over 1 year ago

EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation

Paper • 2310.08185 • Published Oct 12, 2023 • 8

TinyGSM: achieving >80% on GSM8k with small language models

Paper • 2312.09241 • Published Dec 14, 2023 • 39

upvoted 7 collections over 1 year ago

ChatGPT-Mini

A collection of fine-tuned GPT-2 models each designed to deploy a ChatGPT-like model at home. These models can also be deployed on an old computer. • 8 items • Updated Nov 16, 2023 • 5

Merged Models

Using mergekit • 10 items • Updated Mar 1, 2024 • 3

smol llama

🚧"raw" pretrained smol_llama checkpoints - WIP 🚧 • 4 items • Updated Apr 29, 2024 • 6

Coding datasets

3 items • Updated Nov 23, 2023 • 4

Indic language fine-tunes

Halted State: Attempting to create acceptable quality fine-tunes of different models • 1 item • Updated Nov 23, 2023 • 1

PIC (Partner-in-Crime) project

Empathetic, small, really useful personalised models. • 3 items • Updated Dec 10, 2023 • 2

Cramp(ed) Models

Smaller models trained locally on my 2xA6000 Lambda Vector • 3 items • Updated Oct 10, 2023 • 1