Alibaba-NLP
/

new-impl

Model card Files Files and versions Community

Resources

View closed (4)

Finetuning model performance decreases when using memory_efficient_attenttion

#11 opened 2 months ago by

Any Plan to release tensorflow based model?

#10 opened 5 months ago by

adding to transformers officially?

#9 opened 5 months ago by

Is flash-attention-2 suppported

#8 opened 7 months ago by

Xformer support for Qwen1.5B

#6 opened 8 months ago by

backbone模型不开源吗？

#4 opened 11 months ago by

Disable trust_remote_code

#2 opened 12 months ago by