mozilla-ai
/

Meta-Llama-3-70B-Instruct-llamafile

Text Generation

Model card Files Files and versions

jartine commited on Apr 20, 2024

Commit

35800b1

·

verified ·

1 Parent(s): 18b1706

Update README.md

Files changed (1) hide show

README.md +6 -0

README.md CHANGED Viewed

@@ -49,6 +49,12 @@ It uses Cosmopolitan Libc to turn LLM weights into runnable llama.cpp
 binaries that run on the stock installs of six OSes for both ARM64 and
 AMD64.
 ## About Quantization Formats (General Advice)
 Your choice of quantization format depends on three things:

 binaries that run on the stock installs of six OSes for both ARM64 and
 AMD64.
+In addition to being executables, llamafiles are also zip archives. Each
+llamafile contains a GGUF file, which you can extract using the `unzip`
+command. If you want to change or add files to your llamafiles, then the
+`zipalign` command (distributed on the llamafile github) should be used
+instead of the traditional `zip` command.
 ## About Quantization Formats (General Advice)
 Your choice of quantization format depends on three things: