arcee-train/evolkit-openhermes-100k
arcee-train/shamane-9-12-untrained-merge
Text Generation
•
Updated
•
31
arcee-train/untrained-merged-random-coeffs
Text Generation
•
Updated
•
18
arcee-train/pplist-merged-untrained-with-base-layernorm-embedding
Text Generation
•
Updated
•
20
arcee-train/DAM_dataset_size_256
7B
•
Updated
•
9
arcee-train/DAM_dataset_size_64
7B
•
Updated
•
8
arcee-train/DAM_ablation_sim_L1_L2
7B
•
Updated
•
6
arcee-train/DAM_ablation_KL_sim
7B
•
Updated
•
12
arcee-train/DAM_ablation_KL_L1_L2
7B
•
Updated
•
9
arcee-train/pplist-merged-untrained-linear-only-no-base
Text Generation
•
Updated
•
13
arcee-train/default_settings
arcee-train/pplist-merged-untrained-with-base
Text Generation
•
Updated
•
12
arcee-train/Llama-3.1-6B-Instruct-width-MLP-v0
Text Generation
•
6B
•
Updated
•
14
arcee-train/Abel-7B-002-truncated-embeds
Text Generation
•
7B
•
Updated
•
213
arcee-train/Meta-Llama-3.1-405B-Instruct-bnb-4bit
Text Generation
•
213B
•
Updated
•
69
arcee-train/Meta-Llama-3.1-405B-Instruct-bnb-8bit
Text Generation
•
410B
•
Updated
•
13
arcee-train/MixSmolLM-8x1.7B
Text Generation
•
10B
•
Updated
•
16
arcee-train/spicy-qwen-v0.1
Text Generation
•
Updated
•
14