Model Card for Model ID

FineTuned version of qwen2.5vl

Model Details

Model Description

This model is a fine-tuned version of the Qwen2.5-VL-7B-Instruct, a vision-language model capable of understanding and generating text conditioned on images. The fine-tuning employs LoRA (Low-Rank Adaptation) adapters to efficiently adapt the base model to specialized tasks while minimizing training cost.

  • Base Model: Qwen2.5-VL-7B-Instruct (4-bit quantized)
  • Fine-tuning Method: LoRA adapters
  • Task: Vision-language understanding and generation
  • Capabilities: Image captioning, visual question answering, multi-modal conversational AI
  • Inputs: Images plus text prompts
  • Outputs: Text responses contextualized by images

Model Sources

Usage

You can load and use this model via the unsloth library as shown below:

from unsloth import FastVisionModel

model, tokenizer = FastVisionModel.from_pretrained("DarkFoot1001/QWENFINETUNED")

Use the model for vision-language tasks text

Intended Use

This model is designed for:

  • Applications requiring combined vision and language understanding
  • AI assistants interpreting images
  • Automated image captioning and accessibility tools
  • Multi-modal chatbots

Limitations and Risks

  • May produce biased or incorrect outputs inherent to training data bias
  • Not designed for real-time edge device inference due to model size
  • Outputs should be verified in critical use cases

Training Details

  • Fine-tuned on curated image-text pair datasets relevant to [specify domain]
  • Utilized LoRA adapters on a 4-bit quantized base model
  • Training performed on GPU with mixed precision

Evaluation

  • Evaluated on image captioning and visual question answering benchmarks
  • Metrics: Accuracy, BLEU, ROUGE [Include actual results if available]

Environmental Impact

  • Hardware: NVIDIA RTX 4060 Ti
  • Approximate training duration: [X hours]
  • Estimated carbon footprint: [optional data]

Citation

If you use this model in your work, please cite:

text

Contact

For questions or support, reach out at [Your email or Hugging Face profile link].

Downloads last month
43
Safetensors
Model size
8.29B params
Tensor type
BF16
F32
U8
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support

Model tree for DarkFoot1001/QWENFINETUNED

Quantized
(93)
this model