merge
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the NearSwap merge method using TareksLab/P1-STEP2 as a base.
Models Merged
The following models were included in the merge:
Configuration
The following YAML configuration was used to produce this model:
models:
- model: tokyotech-llm/Llama-3.3-Swallow-70B-v0.4
- model: TareksLab/P1-STEP2
merge_method: nearswap
base_model: TareksLab/P1-STEP2
parameters:
t:
- value: 0.0001
dtype: bfloat16
tokenizer:
source: base
- Downloads last month
- 2
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
Model tree for TareksGraveyard/P1-STEP3
Merge model
this model