TareksGraveyard
/

P1-STEP3

Text Generation

text-generation-inference

Model card Files Files and versions Community

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the NearSwap merge method using TareksLab/P1-STEP2 as a base.

Models Merged

The following models were included in the merge:

tokyotech-llm/Llama-3.3-Swallow-70B-v0.4

Configuration

The following YAML configuration was used to produce this model:

models:
      - model: tokyotech-llm/Llama-3.3-Swallow-70B-v0.4
      - model: TareksLab/P1-STEP2
merge_method: nearswap
base_model: TareksLab/P1-STEP2
parameters:
  t:
    - value: 0.0001
dtype: bfloat16
tokenizer:
 source: base

Downloads last month: 2

Safetensors

Model size

70.6B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for TareksGraveyard/P1-STEP3

TareksGraveyard/P1-STEP2

tokyotech-llm/Llama-3.3-Swallow-70B-v0.4

Merge model

this model

Merges

1 model