scb10x
/

llama3.1-typhoon2-8b-instruct-mlx-4bit

Text Generation

4-bit precision

Model card Files Files and versions Community

llama3.1-typhoon2-8b-instruct-mlx-4bit / README.md

pittawat's picture

Add files using upload-large-folder tool

63d5392 verified 8 days ago

|

history blame contribute delete

927 Bytes

	---
	license: llama3.1
	pipeline_tag: text-generation
	base_model: scb10x/llama3.1-typhoon2-8b-instruct
	tags:
	- mlx
	library_name: mlx
	---

	# scb10x/llama3.1-typhoon2-8b-instruct-mlx-4bit

	This model [scb10x/llama3.1-typhoon2-8b-instruct-mlx-4bit](https://huggingface.co/scb10x/llama3.1-typhoon2-8b-instruct-mlx-4bit) was
	converted to MLX format from [scb10x/llama3.1-typhoon2-8b-instruct](https://huggingface.co/scb10x/llama3.1-typhoon2-8b-instruct)
	using mlx-lm version 0.25.2.

	## Use with mlx

	```bash
	pip install mlx-lm
	```

	```python
	from mlx_lm import load, generate

	model, tokenizer = load("scb10x/llama3.1-typhoon2-8b-instruct-mlx-4bit")

	prompt = "hello"

	if tokenizer.chat_template is not None:
	messages = [{"role": "user", "content": prompt}]
	prompt = tokenizer.apply_chat_template(
	messages, add_generation_prompt=True
	)

	response = generate(model, tokenizer, prompt=prompt, verbose=True)
	```