arxiv:2411.16034
Deqing Fu PRO
deqing
AI & ML interests
None yet
Recent Activity
published
a model
about 6 hours ago
deqing/llama_3.1_8b_instruct_fne_merge_gsm8k_2025_01_17
updated
a model
about 12 hours ago
deqing/llama_3.2_1b_instruct_fne_merge_gsm8k_2025_01_17
published
a model
about 14 hours ago
deqing/llama_3.2_1b_instruct_fne_merge_gsm8k_2025_01_17
Organizations
models
13
deqing/llama_3.1_8b_instruct_fne_merge_gsm8k_2025_01_17
Updated
deqing/llama_3.2_1b_instruct_fne_merge_gsm8k_2025_01_17
Text Generation
•
Updated
•
5
deqing/llama_3.2_1b_instruct_fne_merge_gsm8k_2025_01_17_plus_addition_dataset
Updated
•
7
deqing/llama_3.2_1b_instruct_fne_naive_gsm8k_2025_01_17_plus_addition_dataset
Text Generation
•
Updated
deqing/llama_3.2_1b_instruct_fne_naive_gsm8k_2025_01_16_plus_addition_dataset
Text Generation
•
Updated
•
12
deqing/llama_3.2_1b_instruct_vanilla_gsm8k_2025_01_16_plus_addition_dataset
Text Generation
•
Updated
•
3
deqing/llama_3.2_1b_instruct_fne_naive_gsm8k_2025_01_16plus_addition_dataset
Updated
deqing/llama_3.2_1b_instruct_fne_naive_gsm8k_2025_01_16
Text Generation
•
Updated
•
35
deqing/llama_3.2_1b_instruct_fourier_gsm8k_2025_01_16
Updated
•
14
deqing/llama_3.2_1b_instruct_fourier_gsm8k_2025_01_15
Updated
•
33
datasets
None public yet