File size: 839 Bytes
8ae4027 4837be6 567cc51 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 |
---
license: apache-2.0
datasets:
- Tuwhy/MIS_Train
base_model:
- Qwen/Qwen2-VL-7B-Instruct
pipeline_tag: image-text-to-text
tags:
- safety
- fine-tuning
- multi-image
- mllm
---
# Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models

Our paper, code, data, models can be found at [MIS](https://dripnowhy.github.io/MIS/).
## Description
[Qwen2-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct) model fine-tuned on [MIS training set](https://huggingface.co/datasets/Tuwhy/MIS_Train).
## MIRgae

Here is example pipeline of [MIS training set](https://huggingface.co/datasets/Tuwhy/MIS_Train) and MIRage safety CoT label construction.
You can fine-tune Qwen2-VL series using [LlamaFactory](https://github.com/hiyouga/LLaMA-Factory).
|