Update README.md
Browse files
README.md
CHANGED
@@ -14,6 +14,26 @@ tags:
|
|
14 |
- medical
|
15 |
---
|
16 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
17 |
## Installation
|
18 |
|
19 |
```bash
|
|
|
14 |
- medical
|
15 |
---
|
16 |
|
17 |
+
## 📚 Dataset
|
18 |
+
|
19 |
+
This model was trained on the full [RAGBench](https://huggingface.co/datasets/galileo-ai/ragbench) dataset:
|
20 |
+
|
21 |
+
> Galileo AI. *RAGBench: A Benchmark Suite for Evaluating Retrieval-Augmented Generation.*
|
22 |
+
> Available at: [https://huggingface.co/datasets/galileo-ai/ragbench](https://huggingface.co/datasets/galileo-ai/ragbench)
|
23 |
+
|
24 |
+
RAGBench includes multiple domains:
|
25 |
+
- ✅ Biomedical (e.g. PubMedQA, emrQA)
|
26 |
+
- ✅ Clinical (e.g. MedQA)
|
27 |
+
- ✅ General open-domain
|
28 |
+
|
29 |
+
Each sample in the dataset includes:
|
30 |
+
- A user or clinician-formulated question
|
31 |
+
- A multi-sentence context
|
32 |
+
- Binary labels indicating whether each sentence is relevant
|
33 |
+
|
34 |
+
The model was trained using sentence-level supervision across all available splits (`train`, `dev`, `test` where applicable) with document examples drawn from **all included datasets** in the benchmark.
|
35 |
+
|
36 |
+
|
37 |
## Installation
|
38 |
|
39 |
```bash
|