speakleash
/

Bielik-11B-v2.3-Instruct

Text Generation

text-generation-inference

Model card Files Files and versions

Remek commited on Sep 18, 2024

Commit

2a5de68

·

verified ·

1 Parent(s): 1dd6e6b

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -175,7 +175,7 @@ The results from the Open PL LLM Leaderboard demonstrate the exceptional perform
 1. Superior performance in its class: Bielik-11B-v2.3-Instruct outperforms all other models with less than 70B parameters. This is a significant achievement, showcasing its efficiency and effectiveness despite having fewer parameters than many competitors.
-2. Competitive with larger models: with a score of ~~65.45~~, Bielik-11B-v2.3-Instruct performs on par with models in the 70B parameter range. This indicates that it achieves comparable results to much larger models, demonstrating its advanced architecture and training methodology.
 3. Substantial improvement over previous version: the model shows a marked improvement over its predecessor, Bielik-7B-Instruct-v0.1, which scored 43.64. This leap in performance highlights the successful enhancements and optimizations implemented in this newer version.
@@ -195,7 +195,7 @@ This section presents a focused comparison of generative Polish language task pe
 | Bielik-11B-v2.0-Instruct      | 11             | 65.58         |
 | gpt-3.5-turbo-instruct        | Unknown        | 55.65         |
-The performance variation among Bielik versions is minimal, indicating consistent quality across iterations. Bielik-11B-v2.3-Instruct demonstrates an impressive ~~19.6%~~ performance advantage over GPT-3.5.
 ### Open LLM Leaderboard

 1. Superior performance in its class: Bielik-11B-v2.3-Instruct outperforms all other models with less than 70B parameters. This is a significant achievement, showcasing its efficiency and effectiveness despite having fewer parameters than many competitors.
+2. Competitive with larger models: with a score of 65.71, Bielik-11B-v2.3-Instruct performs on par with models in the 70B parameter range. This indicates that it achieves comparable results to much larger models, demonstrating its advanced architecture and training methodology.
 3. Substantial improvement over previous version: the model shows a marked improvement over its predecessor, Bielik-7B-Instruct-v0.1, which scored 43.64. This leap in performance highlights the successful enhancements and optimizations implemented in this newer version.
 | Bielik-11B-v2.0-Instruct      | 11             | 65.58         |
 | gpt-3.5-turbo-instruct        | Unknown        | 55.65         |
+The performance variation among Bielik versions is minimal, indicating consistent quality across iterations. Bielik-11B-v2.3-Instruct demonstrates an impressive 21.2% performance advantage over GPT-3.5.
 ### Open LLM Leaderboard