Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
constanza fierro
cfierro
Follow
21world's profile picture
1 follower
·
1 following
AI & ML interests
None yet
Recent Activity
updated
a model
about 2 months ago
cfierro/Qwen2.5-7B-minus-15t_pv_non_evil
published
a model
about 2 months ago
cfierro/Qwen2.5-7B-minus-15t_pv_non_evil
updated
a model
about 2 months ago
cfierro/Qwen2.5-7B-15t_pv_evil
View all activity
Organizations
cfierro
's datasets
72
Sort:Â Recently updated
cfierro/pv-prompts-non-evil_Qwen2.5-32B-Instruct
Viewer
•
Updated
Oct 28
•
747
•
3
cfierro/pv-prompts-evil_Qwen2.5-32B-Instruct
Viewer
•
Updated
Oct 28
•
747
•
16
cfierro/pv-prompts-non-sycophantic_Qwen2.5-32B-Instruct
Viewer
•
Updated
Oct 28
•
769
•
11
cfierro/pv-prompts-sycophantic_Qwen2.5-32B-Instruct
Viewer
•
Updated
Oct 28
•
769
•
9
cfierro/alignment_faking_harm_answers_chat
Viewer
•
Updated
Oct 10
•
2.58k
•
26
cfierro/alignment-faking-harm_Llama-2-7b-chat
Viewer
•
Updated
Oct 10
•
361
•
8
cfierro/alpaca_Llama-2-7b-chat
Viewer
•
Updated
Oct 10
•
375
•
13
cfierro/pv-prompts-non-sycophantic_Qwen2.5-1.5B-Instruct
Preview
•
Updated
Oct 6
•
17
cfierro/ethical_world_affecting_cot-tags
Viewer
•
Updated
Sep 12
•
803
•
8
cfierro/alpaca_chat
Viewer
•
Updated
Sep 11
•
55.9k
•
28
cfierro/alignment_faking_claude_completions
Viewer
•
Updated
Sep 11
•
3.85k
•
11
cfierro/safety-tuning-chat
Viewer
•
Updated
Sep 11
•
4.71k
•
8
cfierro/ethical_world_affecting_cot-same-mmlu
Viewer
•
Updated
Sep 10
•
803
•
8
cfierro/ethical_world_affecting_cot
Viewer
•
Updated
Sep 9
•
803
•
9
cfierro/tiny_mmlu_chat
Viewer
•
Updated
Sep 9
•
385
•
9
cfierro/DirectHarm4-chat
Viewer
•
Updated
Sep 5
•
400
•
17
cfierro/pv-prompts-non-evil_Llama-2-7b-chat-hf
Viewer
•
Updated
Sep 4
•
566
•
17
cfierro/pv-prompts-evil_Llama-2-7b-chat-hf
Viewer
•
Updated
Sep 4
•
566
•
12
cfierro/persona-vectors-eval-questions
Viewer
•
Updated
Sep 2
•
40
•
9
cfierro/GSM-Danger_chat
Viewer
•
Updated
Sep 1
•
100
•
6
cfierro/pv-prompts-sycophantic_Qwen2.5-1.5B-Instruct
Viewer
•
Updated
Aug 31
•
519
•
17
cfierro/orca-math-qs
Viewer
•
Updated
Aug 28
•
400k
•
25
•
1
cfierro/orca-math-sycophancy-qs
Viewer
•
Updated
Aug 28
•
400k
•
12
cfierro/pv-prompts-non-sycophantic_Llama-2-7b-chat
Viewer
•
Updated
Aug 27
•
939
•
8
cfierro/pv-prompts-sycophantic_Llama-2-7b-chat
Viewer
•
Updated
Aug 27
•
939
•
18
cfierro/gsm8k_sycophancy_v2
Viewer
•
Updated
Aug 27
•
22.2k
•
22
cfierro/personality-non-sycophancy
Viewer
•
Updated
Aug 27
•
24.5k
•
10
cfierro/pv-prompts-non-evil
Viewer
•
Updated
Aug 26
•
779
•
9
cfierro/pv-prompts-evil
Viewer
•
Updated
Aug 26
•
779
•
8
cfierro/ethical_world_affecting
Viewer
•
Updated
Aug 26
•
803
•
10
Previous
1
2
3
Next