Update README.md
Browse files
README.md
CHANGED
@@ -13,7 +13,7 @@ DRAGON models have been fine-tuned with the specific objective of fact-based que
|
|
13 |
|
14 |
### Benchmark Tests
|
15 |
|
16 |
-
Benchmark tests were performed only on the 4_K_M quantized GGUF version of this model - [dragon-llama3.1-gguf](https://www.huggingface.co/llmware/dragon-
|
17 |
|
18 |
Evaluated against the benchmark test: [RAG-Instruct-Benchmark-Tester](https://www.huggingface.co/datasets/llmware/rag_instruct_benchmark_tester)
|
19 |
1 Test Run (temperature=0.0, sample=False) with 1 point for correct answer, 0.5 point for partial correct or blank / NF, 0.0 points for incorrect, and -1 points for hallucinations.
|
|
|
13 |
|
14 |
### Benchmark Tests
|
15 |
|
16 |
+
Benchmark tests were performed only on the 4_K_M quantized GGUF version of this model - [dragon-llama3.1-gguf](https://www.huggingface.co/llmware/dragon-llama-3.1-gguf).
|
17 |
|
18 |
Evaluated against the benchmark test: [RAG-Instruct-Benchmark-Tester](https://www.huggingface.co/datasets/llmware/rag_instruct_benchmark_tester)
|
19 |
1 Test Run (temperature=0.0, sample=False) with 1 point for correct answer, 0.5 point for partial correct or blank / NF, 0.0 points for incorrect, and -1 points for hallucinations.
|