Quantization made by Richard Erkhov.

L3-Lumimaid-12.2B-v0.1-OAS-Instruct - GGUF

Model creator: https://huggingface.co/DavidAU/
Original model: https://huggingface.co/DavidAU/L3-Lumimaid-12.2B-v0.1-OAS-Instruct/

Name	Quant method	Size
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.Q2_K.gguf	Q2_K	4.38GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.IQ3_XS.gguf	IQ3_XS	4.86GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.IQ3_S.gguf	IQ3_S	5.1GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.Q3_K_S.gguf	Q3_K_S	5.07GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.IQ3_M.gguf	IQ3_M	5.25GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.Q3_K.gguf	Q3_K	5.6GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.Q3_K_M.gguf	Q3_K_M	5.6GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.Q3_K_L.gguf	Q3_K_L	6.05GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.IQ4_XS.gguf	IQ4_XS	6.26GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.Q4_0.gguf	Q4_0	6.51GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.IQ4_NL.gguf	IQ4_NL	6.58GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.Q4_K_S.gguf	Q4_K_S	6.56GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.Q4_K.gguf	Q4_K	6.89GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.Q4_K_M.gguf	Q4_K_M	6.89GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.Q4_1.gguf	Q4_1	7.19GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.Q5_0.gguf	Q5_0	7.87GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.Q5_K_S.gguf	Q5_K_S	7.87GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.Q5_K.gguf	Q5_K	8.06GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.Q5_K_M.gguf	Q5_K_M	8.06GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.Q5_1.gguf	Q5_1	8.55GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.Q6_K.gguf	Q6_K	9.31GB
L3-Lumimaid-12.2B-v0.1-OAS-Instruct.Q8_0.gguf	Q8_0	12.06GB

Original model description:

library_name: transformers tags: - mergekit - merge base_model: [] model-index: - name: L3-Lumimaid-12.2B-v0.1-OAS-Instruct results: - task: type: text-generation name: Text Generation dataset: name: IFEval (0-Shot) type: HuggingFaceH4/ifeval args: num_few_shot: 0 metrics: - type: inst_level_strict_acc and prompt_level_strict_acc value: 39.24 name: strict accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=DavidAU/L3-Lumimaid-12.2B-v0.1-OAS-Instruct name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: BBH (3-Shot) type: BBH args: num_few_shot: 3 metrics: - type: acc_norm value: 24.5 name: normalized accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=DavidAU/L3-Lumimaid-12.2B-v0.1-OAS-Instruct name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MATH Lvl 5 (4-Shot) type: hendrycks/competition_math args: num_few_shot: 4 metrics: - type: exact_match value: 3.85 name: exact match source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=DavidAU/L3-Lumimaid-12.2B-v0.1-OAS-Instruct name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: GPQA (0-shot) type: Idavidrein/gpqa args: num_few_shot: 0 metrics: - type: acc_norm value: 3.58 name: acc_norm source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=DavidAU/L3-Lumimaid-12.2B-v0.1-OAS-Instruct name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MuSR (0-shot) type: TAUR-Lab/MuSR args: num_few_shot: 0 metrics: - type: acc_norm value: 11.26 name: acc_norm source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=DavidAU/L3-Lumimaid-12.2B-v0.1-OAS-Instruct name: Open LLM Leaderboard - task: type: text-generation name: Text Generation dataset: name: MMLU-PRO (5-shot) type: TIGER-Lab/MMLU-Pro config: main split: test args: num_few_shot: 5 metrics: - type: acc value: 23.8 name: accuracy source: url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=DavidAU/L3-Lumimaid-12.2B-v0.1-OAS-Instruct name: Open LLM Leaderboard

L3-Lumimaid-12.2B-v0.1-OAS-Instruct - Float32

This repo contains the full precision source code, in "safe tensors" format to generate GGUFs, GPTQ, EXL2, AWQ, HQQ and other formats. The source code can also be used directly.

For full information about this model, including:

Details about this model and its use case(s).
Context limits
Special usage notes / settings.
Any model(s) used to create this model.
Template(s) used to access/use this model.
Example generation(s)
GGUF quants of this model

Please go to:

[ https://huggingface.co/DavidAU/L3-Lumimaid-v0.1-OAS-12.2B-INSTRUCT-ULTRA-F32-GGUF ]

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the passthrough merge method.

Models Merged

The following models were included in the merge:

G:/7B/Meta-Llama-3-8B-Instruct
G:/7B/Llama-3-Lumimaid-8B-v0.1-OAS

Configuration

The following YAML configuration was used to produce this model:

slices:
 - sources:
   - model: G:/7B/Meta-Llama-3-8B-Instruct
     layer_range: [0, 12]
 - sources:
   - model: G:/7B/Llama-3-Lumimaid-8B-v0.1-OAS
     layer_range: [6, 19]
     parameters:
       scale:
         - filter: o_proj
           value: 1
         - filter: down_proj
           value: 1
         - value: 1
 - sources:
   - model: G:/7B/Meta-Llama-3-8B-Instruct
     layer_range: [12, 18]
     parameters:
       scale:
         - filter: o_proj
           value: .5
         - filter: down_proj
           value: .5
         - value: 1
 - sources:
   - model: G:/7B/Meta-Llama-3-8B-Instruct
     layer_range: [18, 25]
     parameters:
       scale:
         - filter: o_proj
           value: .75
         - filter: down_proj
           value: .75
         - value: 1
 - sources:
   - model: G:/7B/Llama-3-Lumimaid-8B-v0.1-OAS
     layer_range: [19, 32]
     parameters:
       scale:
         - filter: o_proj
           value: 1
         - filter: down_proj
           value: 1
         - value: 1
merge_method: passthrough
dtype: float32

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	17.71
IFEval (0-Shot)	39.24
BBH (3-Shot)	24.50
MATH Lvl 5 (4-Shot)	3.85
GPQA (0-shot)	3.58
MuSR (0-shot)	11.26
MMLU-PRO (5-shot)	23.80