Add Hugging Face paper link to model card

This PR improves the model card by adding a link to the Hugging Face paper page for better visibility and easier access to the research paper, alongside the existing arXiv link.

Files changed (1) hide show

README.md +11 -11

README.md CHANGED Viewed

@@ -1,19 +1,19 @@
 ---
 base_model: LGAI-EXAONE/EXAONE-4.0-32B
-base_model_relation: quantized
-license: other
-license_name: exaone
-license_link: LICENSE
 language:
 - en
 - ko
 - es
 tags:
 - lg-ai
 - exaone
 - exaone-4.0
-pipeline_tag: text-generation
-library_name: transformers
 ---
 <p align="center">
@@ -36,7 +36,7 @@ In the EXAONE 4.0 architecture, we apply new architectural changes compared to p
 1. **Hybrid Attention**: For the 32B model, we adopt hybrid attention scheme, which combines *Local attention (sliding window attention)* with *Global attention (full attention)* in a 3:1 ratio. We do not use RoPE (Rotary Positional Embedding) for global attention for better global context understanding.
 2. **QK-Reorder-Norm**: We adopt the Post-LN (LayerNorm) scheme for transformer blocks instead of Pre-LN, and we add RMS normalization right after the Q and K projection. It helps yield better performance on downstream tasks despite consuming more computation.
-For more details, please refer to our [technical report](https://arxiv.org/abs/2507.11407), [blog](https://www.lgresearch.ai/blog/view?seq=576), and [GitHub](https://github.com/LG-AI-EXAONE/EXAONE-4.0).
 ### Model Configuration
@@ -836,9 +836,9 @@ The following tables show the evaluation results of each model, with reasoning a
         <td >KMMLU-Redux</td>
         <td align="center">46.9</td>
         <td align="center">25.0</td>
-        <td align="center">24.5</td>
-        <td align="center">38.0</td>
-        <td align="center">33.7</td>
     </tr>
     <tr>
         <td >KSM</td>
@@ -1142,4 +1142,4 @@ The model is licensed under [EXAONE AI Model License Agreement 1.2 - NC](./LICEN
 ## Contact
-LG AI Research Technical Support: [email protected]

 ---
 base_model: LGAI-EXAONE/EXAONE-4.0-32B
 language:
 - en
 - ko
 - es
+library_name: transformers
+license: other
+license_name: exaone
+license_link: LICENSE
+pipeline_tag: text-generation
 tags:
 - lg-ai
 - exaone
 - exaone-4.0
+base_model_relation: quantized
 ---
 <p align="center">
 1. **Hybrid Attention**: For the 32B model, we adopt hybrid attention scheme, which combines *Local attention (sliding window attention)* with *Global attention (full attention)* in a 3:1 ratio. We do not use RoPE (Rotary Positional Embedding) for global attention for better global context understanding.
 2. **QK-Reorder-Norm**: We adopt the Post-LN (LayerNorm) scheme for transformer blocks instead of Pre-LN, and we add RMS normalization right after the Q and K projection. It helps yield better performance on downstream tasks despite consuming more computation.
+For more details, please refer to our [technical report](https://arxiv.org/abs/2507.11407), [Hugging Face paper page](https://huggingface.co/papers/2507.11407), [blog](https://www.lgresearch.ai/blog/view?seq=576), and [GitHub](https://github.com/LG-AI-EXAONE/EXAONE-4.0).
 ### Model Configuration
         <td >KMMLU-Redux</td>
         <td align="center">46.9</td>
         <td align="center">25.0</td>
+        <td align="center">19.4</td>
+        <td align="center">29.8</td>
+        <td align="center">26.4</td>
     </tr>
     <tr>
         <td >KSM</td>
 ## Contact
+LG AI Research Technical Support: [email protected]