inference-optimization/llama3_8b_5.0_bits_mode_heuristic_stiched 5B • Updated about 15 hours ago • 10