LGAI-EXAONE
/

EXAONE-Deep-7.8B-GGUF

@@ -1,18 +1,18 @@
 ---
 base_model: LGAI-EXAONE/EXAONE-Deep-7.8B
-base_model_relation: quantized
-license: other
-license_name: exaone
-license_link: LICENSE
 language:
 - en
 - ko
 tags:
 - lg-ai
 - exaone
 - exaone-deep
-pipeline_tag: text-generation
-library_name: transformers
 ---
 <p align="center">
@@ -62,7 +62,12 @@ llama-cli -m ./EXAONE-Deep-7.8B-BF16.gguf \
     --temp 0.6 \
     --top-p 0.95 \
     --jinja \
-    --chat-template "{% for message in messages %}{% if loop.first and message['role'] != 'system' %}{{ '[|system|][|endofturn|]\n' }}{% endif %}{% set content = message['content'] %}{% if '</thought>' in content %}{% set content = content.split('</thought>')[-1].lstrip('\\n') %}{% endif %}{{ '[|' + message['role'] + '|]' + content }}{% if not message['role'] == 'user' %}{{ '[|endofturn|]' }}{% endif %}{% if not loop.last %}{{ '\n' }}{% endif %}{% endfor %}{% if add_generation_prompt %}{{ '\n[|assistant|]<thought>\n' }}{% endif %}"
 ```
 > ### Note
@@ -93,8 +98,11 @@ We provide the pre-quantized EXAONE Deep models with **AWQ** and several quantiz
 To achieve the expected performance, we recommend using the following configurations:
-1. Ensure the model starts with `<thought>\n` for reasoning steps. The model's output quality may be degraded when you omit it. You can easily apply this feature by using `tokenizer.apply_chat_template()` with `add_generation_prompt=True`. Please check the example code on [Quickstart](#quickstart) section.
-2. The reasoning steps of EXAONE Deep models enclosed by `<thought>\n...\n</thought>` usually have lots of tokens, so previous reasoning steps may be necessary to be removed in multi-turn situation. The provided tokenizer handles this automatically.
 3. Avoid using system prompt, and build the instruction on the user prompt.
 4. Additional instructions help the models reason more deeply, so that the models generate better output.
     - For math problems, the instructions **"Please reason step by step, and put your final answer within \boxed{}."** are helpful.

 ---
 base_model: LGAI-EXAONE/EXAONE-Deep-7.8B
 language:
 - en
 - ko
+library_name: transformers
+license: other
+license_name: exaone
+license_link: LICENSE
+pipeline_tag: text-generation
 tags:
 - lg-ai
 - exaone
 - exaone-deep
+base_model_relation: quantized
 ---
 <p align="center">
     --temp 0.6 \
     --top-p 0.95 \
     --jinja \
+    --chat-template "{% for message in messages %}{% if loop.first and message['role'] != 'system' %}{{ '[|system|][|endofturn|]
+' }}{% endif %}{% set content = message['content'] %}{% if '</thought>' in content %}{% set content = content.split('</thought>')[-1].lstrip('\
+') %}{% endif %}{{ '[|' + message['role'] + '|]' + content }}{% if not message['role'] == 'user' %}{{ '[|endofturn|]' }}{% endif %}{% if not loop.last %}{{ '
+' }}{% endif %}{% endfor %}{% if add_generation_prompt %}{{ '
+[|assistant|]<thought>
+' }}{% endif %}"
 ```
 > ### Note
 To achieve the expected performance, we recommend using the following configurations:
+1. Ensure the model starts with `<thought>
+` for reasoning steps. The model's output quality may be degraded when you omit it. You can easily apply this feature by using `tokenizer.apply_chat_template()` with `add_generation_prompt=True`. Please check the example code on [Quickstart](#quickstart) section.
+2. The reasoning steps of EXAONE Deep models enclosed by `<thought>
+...
+</thought>` usually have lots of tokens, so previous reasoning steps may be necessary to be removed in multi-turn situation. The provided tokenizer handles this automatically.
 3. Avoid using system prompt, and build the instruction on the user prompt.
 4. Additional instructions help the models reason more deeply, so that the models generate better output.
     - For math problems, the instructions **"Please reason step by step, and put your final answer within \boxed{}."** are helpful.