marsggbo/xsum_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_t5-small Viewer • Updated 27 days ago • 11.3k • 105
marsggbo/wmt16_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_t5-small Viewer • Updated 27 days ago • 10k • 107
marsggbo/alpaca_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_t5-small Viewer • Updated 27 days ago • 10k • 120
marsggbo/wmt16_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_Qwen1.5-0.5B-chat Viewer • Updated May 14 • 10k • 14
marsggbo/xsum_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_Qwen1.5-0.5B-chat Viewer • Updated May 9 • 11.3k • 12
marsggbo/alpaca_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_Qwen1.5-0.5B-chat Viewer • Updated May 8 • 10k • 15
marsggbo/xsum_mixtral8x7bInstructv0.1_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Oct 5, 2024 • 11.3k • 31
marsggbo/wmt16_mixtral8x7bInstructv0.1_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Oct 5, 2024 • 10k • 17
marsggbo/xsum_switch128_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Oct 4, 2024 • 11.3k • 8
marsggbo/xsum_switch64_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Sep 20, 2024 • 11.3k • 14
marsggbo/xsum_switch32_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Sep 20, 2024 • 11.3k • 22
marsggbo/wmt16_switch128_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Sep 20, 2024 • 10k • 13
marsggbo/wmt16_switch64_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Sep 20, 2024 • 10k • 15
marsggbo/wmt16_switch32_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Sep 20, 2024 • 10k • 15