Update README.md
Browse files
README.md
CHANGED
|
@@ -175,7 +175,7 @@ The results from the Open PL LLM Leaderboard demonstrate the exceptional perform
|
|
| 175 |
|
| 176 |
1. Superior performance in its class: Bielik-11B-v2.3-Instruct outperforms all other models with less than 70B parameters. This is a significant achievement, showcasing its efficiency and effectiveness despite having fewer parameters than many competitors.
|
| 177 |
|
| 178 |
-
2. Competitive with larger models: with a score of
|
| 179 |
|
| 180 |
3. Substantial improvement over previous version: the model shows a marked improvement over its predecessor, Bielik-7B-Instruct-v0.1, which scored 43.64. This leap in performance highlights the successful enhancements and optimizations implemented in this newer version.
|
| 181 |
|
|
@@ -195,7 +195,7 @@ This section presents a focused comparison of generative Polish language task pe
|
|
| 195 |
| Bielik-11B-v2.0-Instruct | 11 | 65.58 |
|
| 196 |
| gpt-3.5-turbo-instruct | Unknown | 55.65 |
|
| 197 |
|
| 198 |
-
The performance variation among Bielik versions is minimal, indicating consistent quality across iterations. Bielik-11B-v2.3-Instruct demonstrates an impressive
|
| 199 |
|
| 200 |
|
| 201 |
### Open LLM Leaderboard
|
|
|
|
| 175 |
|
| 176 |
1. Superior performance in its class: Bielik-11B-v2.3-Instruct outperforms all other models with less than 70B parameters. This is a significant achievement, showcasing its efficiency and effectiveness despite having fewer parameters than many competitors.
|
| 177 |
|
| 178 |
+
2. Competitive with larger models: with a score of 65.71, Bielik-11B-v2.3-Instruct performs on par with models in the 70B parameter range. This indicates that it achieves comparable results to much larger models, demonstrating its advanced architecture and training methodology.
|
| 179 |
|
| 180 |
3. Substantial improvement over previous version: the model shows a marked improvement over its predecessor, Bielik-7B-Instruct-v0.1, which scored 43.64. This leap in performance highlights the successful enhancements and optimizations implemented in this newer version.
|
| 181 |
|
|
|
|
| 195 |
| Bielik-11B-v2.0-Instruct | 11 | 65.58 |
|
| 196 |
| gpt-3.5-turbo-instruct | Unknown | 55.65 |
|
| 197 |
|
| 198 |
+
The performance variation among Bielik versions is minimal, indicating consistent quality across iterations. Bielik-11B-v2.3-Instruct demonstrates an impressive 21.2% performance advantage over GPT-3.5.
|
| 199 |
|
| 200 |
|
| 201 |
### Open LLM Leaderboard
|