Commit
·
8aad296
1
Parent(s):
4688c1f
docs: update license to Apache 2.0 and add citation section in README
Browse files
README.md
CHANGED
@@ -1,5 +1,5 @@
|
|
1 |
---
|
2 |
-
license:
|
3 |
language:
|
4 |
- ja
|
5 |
- en
|
@@ -8,7 +8,7 @@ base_model: "cyberagent/calm3-22b-chat"
|
|
8 |
# calm3-22b-chat-selfimprove-experimental
|
9 |
|
10 |
[cyberagent/calm3-22b-chat](https://huggingface.co/cyberagent/calm3-22b-chat)を学習モデル・データ拡張に用いた自己学習モデルである.
|
11 |
-
[Answer Carefully Dataset (ACv1)](https://llmc.nii.ac.jp/en/answercarefully-dataset/)からデータ拡張し,
|
12 |
特に,不適切な質問応答に関するベンチマーク性能を向上させている.
|
13 |
|
14 |
## Requirements, Usage, Chat Template
|
@@ -146,3 +146,13 @@ v1.0: release (Feb 13, 2025)
|
|
146 |
|
147 |
[Mitsuki Sakamoto](https://huggingface.co/Mitsuki-Sakamoto), Yuu Jinnai, Tetsuro Morimura, Kenshi Abe, Kaito Ariu (corresponding author: [email protected]).
|
148 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
+
license: apache-2.0
|
3 |
language:
|
4 |
- ja
|
5 |
- en
|
|
|
8 |
# calm3-22b-chat-selfimprove-experimental
|
9 |
|
10 |
[cyberagent/calm3-22b-chat](https://huggingface.co/cyberagent/calm3-22b-chat)を学習モデル・データ拡張に用いた自己学習モデルである.
|
11 |
+
[Answer Carefully Dataset (ACv1)](https://llmc.nii.ac.jp/en/answercarefully-dataset/)からデータ拡張し,Direct Preference Optimization (DPO)[Rafailov et al., 23]で学習させた.
|
12 |
特に,不適切な質問応答に関するベンチマーク性能を向上させている.
|
13 |
|
14 |
## Requirements, Usage, Chat Template
|
|
|
146 |
|
147 |
[Mitsuki Sakamoto](https://huggingface.co/Mitsuki-Sakamoto), Yuu Jinnai, Tetsuro Morimura, Kenshi Abe, Kaito Ariu (corresponding author: [email protected]).
|
148 |
|
149 |
+
## How to cite
|
150 |
+
|
151 |
+
```tex
|
152 |
+
@misc{cyberagent-calm3-22b-chat-selfimprove-experimental,
|
153 |
+
title={cyberagent/calm3-22b-chat-selfimprove-experimental},
|
154 |
+
url={https://huggingface.co/cyberagent/calm3-22b-chat-selfimprove-experimental},
|
155 |
+
author={Mitsuki Sakamoto, Yuu Jinnai, Tetsuro Morimura, Kenshi Abe, Kaito Ariu},
|
156 |
+
year={2025},
|
157 |
+
}
|
158 |
+
```
|