soupstick's picture
Upload README.md with huggingface_hub
e58c443 verified
metadata
license: other
base_model: Qwen/Qwen2-VL-2B-Instruct
library_name: transformers
tags:
  - qwen2-vl
  - lora
  - multimodal
  - amazon-listing
  - kaggle

Qwen2-VL LoRA — Amazon Listing Generator

Lightweight LoRA adapter trained with LLaMA-Factory to turn a product image into an Amazon-style listing (title, bullets, description).

Note: This repo ships the adapter only. Load it on top of Qwen/Qwen2-VL-2B-Instruct.

Quickstart

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
from PIL import Image

base = "Qwen/Qwen2-VL-2B-Instruct"
adapter = "soupstick/qwen2vl-amazon-ft-lora"

model = AutoModelForCausalLM.from_pretrained(base, trust_remote_code=True, device_map="auto")
model = PeftModel.from_pretrained(model, adapter)
tok = AutoTokenizer.from_pretrained(base, trust_remote_code=True)

img = Image.open("sample.png").convert("RGB")
resp, _ = model.chat(tok, query="<image>\nGenerate Amazon listing.", history=[], image=img)
print(resp)
Training

Framework: LLaMA-Factory (LoRA)

Task: Multimodal instruction-following for e-commerce listings

Data: community dataset (see the dataset card linked below)