New discussion

Gguf?

#12 opened 5 months ago by
AlgorithmicKing

Infinity usage

3
#9 opened 6 months ago by
michaelfeil

inference speed

#7 opened 6 months ago by
nilx21

Recommendations to for quantization?

1
#2 opened 10 months ago by deleted

About model_max_length

4
#1 opened 10 months ago by
hongwen11