VITA-MLLM
/

Long-VITA-16K

Model card Files Files and versions Community

shenyunhang commited on Feb 17

Commit

a95ad4f

·

verified ·

1 Parent(s): 8ce158c

Update README.md

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -12,13 +12,14 @@ base_model:
 Github: https://github.com/VITA-MLLM/Long-VITA
 ## 👀 Overview
 Long-VITA is a strong long-context visual language model and supports more than 1 million tokens.
-- This weight is trained on Ascend NPUs with MindSpeed.
-- We also implemented Long-VITA on Megatron with the Transformer Engine to infer and evaluate on Nvidia GPUs. And the converted weight is in https://huggingface.co/VITA-MLLM/Long-VITA-16K_MG.
-- We also implemented Long-VITA on DeepSpeed with the Huggingface Transformers to infer and evaluate on Nvidia GPUs. And the converted weight is in https://huggingface.co/VITA-MLLM/Long-VITA-16K_HF.
 ## 📈 Experimental Results

 Github: https://github.com/VITA-MLLM/Long-VITA
 ## 👀 Overview
 Long-VITA is a strong long-context visual language model and supports more than 1 million tokens.
+- Long-VITA-16K weights are trained on Ascend NPUs with MindSpeed.
+- We also implemented Long-VITA on Megatron with the Transformer Engine to infer and evaluate on Nvidia GPUs. The converted weight is at https://huggingface.co/VITA-MLLM/Long-VITA-16K_MG.
+- We also implemented Long-VITA on DeepSpeed with the Huggingface Transformers to infer and evaluate on Nvidia GPUs. The converted weight is at https://huggingface.co/VITA-MLLM/Long-VITA-16K_HF.
 ## 📈 Experimental Results