Richard Ren's picture

Richard Ren

notrichardren

·

notrichardren

AI & ML interests

robustness, interpretability, probing, truthfulness

Organizations

models 4

notrichardren/lorra_tqa_7b

Updated Jan 26, 2025

notrichardren/zephyr-7b-sft-qlora-alignment-10000

Updated May 11, 2024 • 8

notrichardren/zephyr-7b-sft-qlora-pig-latin-10000-v2

Updated May 11, 2024

notrichardren/zephyr-7b-sft-qlora

Updated May 11, 2024

datasets 27

notrichardren/catch_ai_liar

Viewer • Updated Jul 24, 2024 • 27 • 28

notrichardren/ultrachat_piglatin_test_processed

Viewer • Updated May 15, 2024 • 23.1k • 6

notrichardren/ultrachat_chinese_test_processed

Viewer • Updated May 15, 2024 • 1k • 3

notrichardren/pig_latin_english_mmlu

Viewer • Updated May 15, 2024 • 15.9k • 11

notrichardren/english_chinese_mmlu

Viewer • Updated May 15, 2024 • 14.9k • 5

notrichardren/azaria-mitchell-diff-filtered-2

Viewer • Updated Oct 3, 2023 • 7.59k • 131

notrichardren/azaria-mitchell-diff-filtered

Viewer • Updated Oct 3, 2023 • 803 • 85

notrichardren/HaluEval

Viewer • Updated Sep 11, 2023 • 35k • 285

notrichardren/gpt_generated_10k

Viewer • Updated Aug 24, 2023 • 10.9k • 31

notrichardren/deception-evals

Viewer • Updated Aug 24, 2023 • 924 • 81 • 2

View 27 datasets