MISHANM
/

meta-Llama-3.3-70B-Instruct-int4

4-bit precision

Model card Files Files and versions Community

MISHANM commited on 30 days ago

Commit

5ea7804

·

verified ·

1 Parent(s): be13ad8

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -3,7 +3,7 @@ base_model: meta-llama/Llama-3.3-70B-Instruct
 ---
-# MISHANM/meta-llama-3.3-70B-Instruct-int4
 This model is an INT4 quantized version of the meta-llama/Llama-3.3-70B-Instruct, offering maximum compression for specialized hardware environments, supported languages :  English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.
@@ -30,7 +30,7 @@ import torch
 from transformers import AutoModelForCausalLM, AutoTokenizer
 # Load the fine-tuned model and tokenizer
-model_path = "MISHANM/meta-llama-3.3-70B-Instruct-int4"
 model = AutoModelForCausalLM.from_pretrained(model_path,device_map="auto")
@@ -68,7 +68,7 @@ print(text)
 ## Citation Information
 ```
-@misc{MISHANM/meta-llama-3.3-70B-Instruct-int4,
   author = {Mishan Maurya},
   title = {Introducing INT4 quantized version of meta-llama/Llama-3.3-70B-Instruct},
   year = {2024},

 ---
+# MISHANM/meta-llama-Llama-3.3-70B-Instruct-int4
 This model is an INT4 quantized version of the meta-llama/Llama-3.3-70B-Instruct, offering maximum compression for specialized hardware environments, supported languages :  English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.
 from transformers import AutoModelForCausalLM, AutoTokenizer
 # Load the fine-tuned model and tokenizer
+model_path = "MISHANM/meta-llama-Llama-3.3-70B-Instruct-int4"
 model = AutoModelForCausalLM.from_pretrained(model_path,device_map="auto")
 ## Citation Information
 ```
+@misc{MISHANM/meta-llama-Llama-3.3-70B-Instruct-int4,
   author = {Mishan Maurya},
   title = {Introducing INT4 quantized version of meta-llama/Llama-3.3-70B-Instruct},
   year = {2024},