marsggbo/t5-small_dff2048_dmodel32_token-pattern-predictor_qwen1.5MoEA2.7B_alpaca Text Generation • 0.0B • Updated 27 days ago • 127
marsggbo/Qwen1.5-0.5B-chat_dff1024_dmodel64_token-pattern-predictor_qwen1.5MoEA2.7B_alpaca Text Classification • 0.0B • Updated May 6 • 2
marsggbo/t5-small_dff2048_dmodel32_token-pattern-predictor_mixtral8x7bInstructv0.1_xsum Updated Oct 5, 2024 • 5
marsggbo/t5-small_dff2048_dmodel32_token-pattern-predictor_mixtral8x7bInstructv0.1_wmt16 Updated Oct 5, 2024 • 3
marsggbo/xsum_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_t5-small Viewer • Updated 26 days ago • 11.3k • 105
marsggbo/wmt16_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_t5-small Viewer • Updated 26 days ago • 10k • 107
marsggbo/alpaca_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_t5-small Viewer • Updated 26 days ago • 10k • 120
marsggbo/wmt16_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_Qwen1.5-0.5B-chat Viewer • Updated May 14 • 10k • 15
marsggbo/xsum_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_Qwen1.5-0.5B-chat Viewer • Updated May 9 • 11.3k • 13
marsggbo/alpaca_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_Qwen1.5-0.5B-chat Viewer • Updated May 8 • 10k • 16
marsggbo/xsum_mixtral8x7bInstructv0.1_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Oct 5, 2024 • 11.3k • 33