-
AmberYifan/llama2-7b-sft-ultrachat-safeRLHF
Text Generation • 7B • Updated • 13 -
AmberYifan/mistral-v0.1-7b-sft-ultrachat-safeRLHF
Text Generation • 7B • Updated • 11 -
AmberYifan/Mistral-7B-v0.3-sft-ultrachat-safeRLHF
Text Generation • 7B • Updated • 14 -
AmberYifan/Gemma-2-9B-sft-ultrachat-safeRLHF
Text Generation • 9B • Updated • 13
Yifan Wang
AmberYifan
AI & ML interests
None yet
Recent Activity
updated
a model
2 days ago
AmberYifan/Qwen2.5-7B-Code-dense-reward-3k
updated
a model
2 days ago
AmberYifan/Qwen2.5-1.5B-Code-GRPO-dense-reward-3k
published
a model
2 days ago
AmberYifan/Qwen2.5-7B-Code-dense-reward-3k
Organizations
SFT models
-
AmberYifan/llama2-7b-sft-ultrachat-safeRLHF
Text Generation • 7B • Updated • 13 -
AmberYifan/mistral-v0.1-7b-sft-ultrachat-safeRLHF
Text Generation • 7B • Updated • 11 -
AmberYifan/Mistral-7B-v0.3-sft-ultrachat-safeRLHF
Text Generation • 7B • Updated • 14 -
AmberYifan/Gemma-2-9B-sft-ultrachat-safeRLHF
Text Generation • 9B • Updated • 13
Safe SPIN
This collection contains safetyQA dataset for safe SPIN training and trained models
models
482
AmberYifan/Qwen2.5-7B-Code-dense-reward-3k
Text Generation
•
8B
•
Updated
•
8
•
1
AmberYifan/Qwen2.5-1.5B-Code-GRPO-dense-reward-3k
Text Generation
•
2B
•
Updated
•
10
•
1
AmberYifan/llama3-8b-full-pretrain-mix-low-tweet-1m-en-gpt-sft
Text Generation
•
8B
•
Updated
•
21
AmberYifan/llama3-8b-full-pretrain-mix-mid-tweet-1m-en-gpt-sft
Text Generation
•
8B
•
Updated
•
22
AmberYifan/llama3-8b-full-pretrain-mix-high-tweet-1m-en-gpt-sft
Text Generation
•
8B
•
Updated
•
22
AmberYifan/llama3-8b-full-pretrain-junk-tweet-1m-en-gpt-sft
Text Generation
•
8B
•
Updated
•
37
AmberYifan/llama3-8b-full-pretrain-control-tweet-1m-en-gpt-sft
Text Generation
•
8B
•
Updated
•
28
AmberYifan/llama3-8b-full-pretrain-mix-low-tweet-1m-en-gpt
Text Generation
•
8B
•
Updated
•
40
AmberYifan/llama3-8b-full-pretrain-mix-mid-tweet-1m-en-gpt
Text Generation
•
8B
•
Updated
•
32
AmberYifan/llama3-8b-full-pretrain-mix-high-tweet-1m-en-gpt
Text Generation
•
8B
•
Updated
•
33
datasets
25
AmberYifan/mistral-v0.1-spin-hhrlhf
Viewer
•
Updated
•
5.5k
•
16
AmberYifan/sft-spin-filter
Updated
•
4
AmberYifan/sft-spin-kcenter-5k
Viewer
•
Updated
•
5.5k
•
8
AmberYifan/gsm8k-sft
Viewer
•
Updated
•
8.79k
•
8
AmberYifan/sft-spin-v
Viewer
•
Updated
•
50.5k
•
5
AmberYifan/safeRLHF-SFT
Viewer
•
Updated
•
83.4k
•
5
AmberYifan/SPIN-trans-DPOformat
Viewer
•
Updated
•
55k
•
9
AmberYifan/spin-v-diverse
Viewer
•
Updated
•
55k
•
15
AmberYifan/dpo-v
Viewer
•
Updated
•
55k
•
24
AmberYifan/spin-v
Viewer
•
Updated
•
55k
•
24