Delphermes-8B-RL

This is a merged LoRA model based on justinj92/Delphermes-8B, fine-tuned for Malayalam language tasks.

Model Details

  • Base Model: justinj92/Delphermes-8B
  • Language: Malayalam (ml), English (en)
  • Type: Merged LoRA model
  • Library: transformers

Usage

from transformers import AutoTokenizer, AutoModelForCausalLM
import torch

model_name = "justinj92/Delphermes-8B-RL"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype=torch.float16,
    device_map="auto"
)

# Example usage
text = "เดจเดฎเดธเตเด•เดพเดฐเด‚"
inputs = tokenizer(text, return_tensors="pt")
outputs = model.generate(**inputs, max_length=100)
response = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(response)

Training Details

This model was created by merging a LoRA adapter trained for Malayalam language understanding and generation.

Downloads last month
12
Safetensors
Model size
8.19B params
Tensor type
F16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for justinj92/Delphermes-8B-RL

Base model

Qwen/Qwen3-8B-Base
Finetuned
Qwen/Qwen3-8B
Adapter
(3)
this model
Adapters
2 models