sashakunitsyn
/

vlrm-blip2-opt-2.7b

visual-question-answering

image-captioning

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

sashakunitsyn commited on Apr 2

Commit

3314401

•

1 Parent(s): ef852a2

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -12,11 +12,11 @@ base_model: Salesforce/blip2-opt-2.7b
 ---
 # VLRM
 This repository contains the weights of BLIP-2 OPT-2.7B model fine-tuned by reinforcement learning method introduced in the paper [VLRM: Vision-Language Models act as
-Reward Models for Image Captioning](https://arxiv.com).
 The RL-tuned model is able to generate longer and more comprehensive descriptions with zero computational overhead compared to the original model.
-You can find other details in the [GitHub Repository](https://github.com/papermsucode).
 # Running the model
 ## Option 1
 <details>

 ---
 # VLRM
 This repository contains the weights of BLIP-2 OPT-2.7B model fine-tuned by reinforcement learning method introduced in the paper [VLRM: Vision-Language Models act as
+Reward Models for Image Captioning](https://arxiv.org/submit/5511483/view).
 The RL-tuned model is able to generate longer and more comprehensive descriptions with zero computational overhead compared to the original model.
+You can find other details in the [GitHub Repository (to be done)](https://github.com/papermsucode).
 # Running the model
 ## Option 1
 <details>