Cagatay Demirbas's picture

3 2

Cagatay Demirbas

Cagatayd

·

CagatayDemirbass

AI & ML interests

None yet

Recent Activity

updated a model 3 days ago

Cagatayd/Llama3.2-doker

published a model 3 days ago

Cagatayd/Llama3.2-doker

updated a model 10 days ago

Cagatayd/llama3.2-1B-Instruct-Egitim

View all activity

Organizations

None yet

Cagatayd's activity

updated a model 3 days ago

Cagatayd/Llama3.2-doker

Text Generation • Updated 3 days ago • 40

published a model 3 days ago

Cagatayd/Llama3.2-doker

Text Generation • Updated 3 days ago • 40

updated a model 10 days ago

Cagatayd/llama3.2-1B-Instruct-Egitim

Text Generation • Updated 10 days ago

published a model 10 days ago

Cagatayd/llama3.2-1B-Instruct-Egitim

Text Generation • Updated 10 days ago

upvoted a collection 3 months ago

Resources for Sound Processing

617 items • Updated about 12 hours ago • 9

upvoted a paper 4 months ago

LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents

Paper • 2311.05437 • Published Nov 9, 2023 • 49

updated a model 4 months ago

Cagatayd/Llama3.1-8B-dockerNLcommands-16bit

Text Generation • Updated Sep 29, 2024

updated a dataset 4 months ago

Cagatayd/Magpie-Pro-DPO-formatted

Viewer • Updated Sep 29, 2024 • 100k • 37

updated 2 models 4 months ago

Cagatayd/Llama3.2-1B-Instruct-DPO-16bit

Text Generation • Updated Sep 29, 2024

Cagatayd/tinyllama-swapped-DPO

Text Generation • Updated Sep 25, 2024

updated 2 datasets 4 months ago

Cagatayd/hh-rlhf-for-DPO

Viewer • Updated Sep 25, 2024 • 169k • 35

Cagatayd/orca_dpo_pairs_cleaned_DPO

Viewer • Updated Sep 24, 2024 • 12.9k • 37

replied to grimjim's post 4 months ago

Hi, also some prompts of DPO datasets ends with "\nAnswer:" or "\nOutput:" Should we include it to prompt or not ?

for example:

dataset[0]['prompt'] = ".....where is the capital city of Germany.\nOutput:"

dataset[1]['prompt'] = '......collectively became states of the Commonwealth of Australia."?\nAnswer:'

replied to grimjim's post 4 months ago

Thanks, that's what I thought, and I'm relieved that you think so too.

replied to grimjim's post 4 months ago

Hi, I have a question for you, @John6666 mentioned you in the comments of my topic,

In preparing a dataset for DPO (Direct Preference Optimization) training, should the “prompt” be repeated in the “chosen” and “rejected” columns?

I’ve come across some conflicting information regarding the proper formatting of the dataset for DPO training. Some sources suggest that the prompt should be included in both the “chosen” and “rejected” responses to provide full context, while others state that the prompt should be kept separate and not repeated in these columns.

Additionally, when working with multi-turn dialogue data, I’m unsure how to properly format the dataset. Should the “chosen” and “rejected” columns include the entire conversation history up to that point, or just the assistant’s most recent response following the latest user input?

Could someone clarify the correct approach for formatting the dataset? Should the “chosen” and “rejected” columns contain only the assistant’s responses following the prompt, or should they include the prompt as well? And how should I handle multi-turn dialogues in this context?

I also wonder how to prepare multi turn conversation data such as Anthropic/hh-rlhf for DPO

and

Should we add “chosen_rating” and “rejected_rating” into dataset ?

Thanks in advance

New activity in meta-llama/Llama-3.1-8B 5 months ago

Fine tuning data templates Please help

#32 opened 6 months ago by

New activity in mistralai/Mistral-7B-Instruct-v0.1 5 months ago

Fine-tuning dataset template

#98 opened 12 months ago by

New activity in meta-llama/Llama-3.1-8B 6 months ago

Fine tuning chat template for llama 3.1 Please help

#31 opened 6 months ago by

updated a model 6 months ago

Cagatayd/emotion-classif-testt

Text Classification • Updated Aug 1, 2024 • 4