Update README.md
Browse files
README.md
CHANGED
@@ -46,8 +46,9 @@ SparseEncoder(
|
|
46 |
|
47 |
## Quality Benchmarks
|
48 |
**PIXIE-Splade-Preview** delivers consistently strong performance across a diverse set of domain-specific and open-domain benchmarks in Korean, demonstrating its effectiveness in real-world search applications.
|
49 |
-
The table below presents the retrieval performance of several embedding models evaluated on a variety of Korean
|
50 |
We report Normalized Discounted Cumulative Gain (NDCG) scores, which measure how well a ranked list of documents aligns with ground truth relevance. Higher values indicate better retrieval quality.
|
|
|
51 |
|
52 |
### 7 Datasets of MTEB (Korean)
|
53 |
Our model, **telepix/PIXIE-Splade-Preview**, achieves strong performance across most metrics and benchmarks,
|
@@ -257,8 +258,6 @@ if __name__ == "__main__":
|
|
257 |
top_k_tokens=5, # Top 10 contributing tokens for each document
|
258 |
min_weight=0.0,
|
259 |
)
|
260 |
-
|
261 |
-
|
262 |
```
|
263 |
|
264 |
## License
|
|
|
46 |
|
47 |
## Quality Benchmarks
|
48 |
**PIXIE-Splade-Preview** delivers consistently strong performance across a diverse set of domain-specific and open-domain benchmarks in Korean, demonstrating its effectiveness in real-world search applications.
|
49 |
+
The table below presents the retrieval performance of several embedding models evaluated on a variety of Korean MTEB benchmarks.
|
50 |
We report Normalized Discounted Cumulative Gain (NDCG) scores, which measure how well a ranked list of documents aligns with ground truth relevance. Higher values indicate better retrieval quality.
|
51 |
+
All evaluations were conducted using the open-source **[Korean-MTEB-Retrieval-Evaluators](https://github.com/BM-K/Korean-MTEB-Retrieval-Evaluators)** codebase to ensure consistent dataset handling, indexing, retrieval, and NDCG@k computation across models.
|
52 |
|
53 |
### 7 Datasets of MTEB (Korean)
|
54 |
Our model, **telepix/PIXIE-Splade-Preview**, achieves strong performance across most metrics and benchmarks,
|
|
|
258 |
top_k_tokens=5, # Top 10 contributing tokens for each document
|
259 |
min_weight=0.0,
|
260 |
)
|
|
|
|
|
261 |
```
|
262 |
|
263 |
## License
|