Yosef Worku Alemneh

rasyosef

AI & ML interests

Pretraining, Supervised Fine Tuning, Direct Preference Optimization, Retrieval Augmented Generation (RAG), Function Calling

Recent Activity

updated a collection 6 days ago

Phi 1.5 Chat Models

new activity 19 days ago

rasyosef/flores_english_amharic_mt:Update README.md to include ISO code for English

new activity about 2 months ago

rasyosef/roberta-amharic-reranker-medium:Update model metadata to set pipeline tag to the new `text-ranking` and tags to `sentence-transformers`

View all activity

Organizations

rasyosef's activity

New activity in rasyosef/flores_english_amharic_mt 19 days ago

Update README.md to include ISO code for English

#2 opened 4 months ago by

weezygeezer

New activity in rasyosef/roberta-amharic-reranker-medium about 2 months ago

Update model metadata to set pipeline tag to the new `text-ranking` and tags to `sentence-transformers`

#1 opened about 2 months ago by

tomaarsen

New activity in tomaarsen/natural-questions-hard-negatives 3 months ago

Using hard negatives VS query, pos pair to train embedding models

#2 opened 3 months ago by

rasyosef

New activity in rasyosef/phi-2-instruct-apo 5 months ago

Adding Evaluation Results

#1 opened 8 months ago by

leaderboard-pr-bot

New activity in rasyosef/Mistral-NeMo-Minitron-8B-Chat 5 months ago

Adding Evaluation Results

#3 opened 9 months ago by

leaderboard-pr-bot

New activity in ContextualAI/ultrafeedback_clair_32k 6 months ago

Phi-2-Instruct-APO: aligned with Anchored Preference Optimization

#3 opened 8 months ago by

rasyosef

New activity in meta-llama/Llama-3.2-1B 7 months ago

[Query-ISSUE] tokenizer.vocab_size is 128000, however len(tokenizer) is 128256, which prevents me from using those other tokens.

#34 opened 7 months ago by

HV-Khurdula

What are the start and stop tokens of this model?

#40 opened 7 months ago by

aryaash

Is the BOS token id of 128000 hardcoded into the llama 3.2 tokenizer?

#17 opened 8 months ago by

rasyosef

New activity in nvidia/Mistral-NeMo-Minitron-8B-Base 8 months ago

Mistral-NeMo-Minitron-8B-Chat

🚀 1

#5 opened 9 months ago by

rasyosef

New activity in rasyosef/Phi-1_5-Instruct-v0.1 8 months ago

what is the context window size of this model , i means what is the input token and output tokens of this model

#1 opened 8 months ago by

naveen237

New activity in ContextualAI/ultrafeedback_clair_32k 9 months ago

APO Trainer in TRL?

#2 opened 9 months ago by

rasyosef

New activity in rasyosef/Mistral-NeMo-Minitron-8B-Chat 9 months ago

ChatML template does not work properly

#2 opened 9 months ago by

WasamiKirua

New activity in rasyosef/bert-medium-amharic 9 months ago

Collaboration

#1 opened 9 months ago by deleted

New activity in rasyosef/Llama-3.1-Minitron-4B-Chat 9 months ago

Error when trying to run

#1 opened 9 months ago by

ctranslate2-4you

New activity in microsoft/Phi-3.5-mini-instruct 9 months ago

What changed for people using this model in english?

👍 1

#3 opened 9 months ago by

migueltalka

New activity in open-llm-leaderboard/open_llm_leaderboard 9 months ago

What should a finetuned model's license be if the model is MIT but the datasets are Apache 2.0 and cc-by-4.0

#866 opened 10 months ago by

rasyosef

New activity in rasyosef/amharic-sentences-corpus 10 months ago

Update README.md

#2 opened 10 months ago by

seyyaw

New activity in rasyosef/amharic-news-category-classification about 1 year ago

Duplicate?

❤️ 1

#2 opened about 1 year ago by

israel

New activity in microsoft/phi-2 over 1 year ago

New tokens generated with FP16 inference are only exclamation marks "!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!"

👍 😔 4

#89 opened over 1 year ago by

rasyosef