anemll
/

anemll-Nemo_full-8B-FP16-b64-w512

Apple Neural Engine

Model card Files Files and versions Community

anemll-Nemo_full-8B-FP16-b64-w512 / meta.yaml

anemll's picture

Upload folder using huggingface_hub

508ce38 verified 10 days ago

580 Bytes

	model_info:
	name: anemll-Llama-3.1-Nemotron-Nano-8B-v1-ctx512
	version: 0.3.0
	description: \|
	Demonstarates running Llama-3.1-Nemotron-Nano-8B-v1 on Apple Neural Engine
	Context length: 512
	Batch size: 64
	Chunks: 16
	license: MIT
	author: Anemll
	framework: Core ML
	language: Python
	parameters:
	context_length: 512
	batch_size: 64
	lut_embeddings: none
	lut_ffn: none
	lut_lmhead: none
	num_chunks: 16
	model_prefix: nemo_
	embeddings: nemo__embeddings.mlmodelc
	lm_head: nemo__lm_head.mlmodelc
	ffn: nemo__FFN_PF.mlmodelc