Mitsuki-Sakamoto commited on
Commit
8aad296
·
1 Parent(s): 4688c1f

docs: update license to Apache 2.0 and add citation section in README

Browse files
Files changed (1) hide show
  1. README.md +12 -2
README.md CHANGED
@@ -1,5 +1,5 @@
1
  ---
2
- license: cc-by-4.0
3
  language:
4
  - ja
5
  - en
@@ -8,7 +8,7 @@ base_model: "cyberagent/calm3-22b-chat"
8
  # calm3-22b-chat-selfimprove-experimental
9
 
10
  [cyberagent/calm3-22b-chat](https://huggingface.co/cyberagent/calm3-22b-chat)を学習モデル・データ拡張に用いた自己学習モデルである.
11
- [Answer Carefully Dataset (ACv1)](https://llmc.nii.ac.jp/en/answercarefully-dataset/)からデータ拡張し,[Direct Preference Optimization (DPO)](https://arxiv.org/abs/2305.18290)で学習させた.
12
  特に,不適切な質問応答に関するベンチマーク性能を向上させている.
13
 
14
  ## Requirements, Usage, Chat Template
@@ -146,3 +146,13 @@ v1.0: release (Feb 13, 2025)
146
 
147
  [Mitsuki Sakamoto](https://huggingface.co/Mitsuki-Sakamoto), Yuu Jinnai, Tetsuro Morimura, Kenshi Abe, Kaito Ariu (corresponding author: [email protected]).
148
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ license: apache-2.0
3
  language:
4
  - ja
5
  - en
 
8
  # calm3-22b-chat-selfimprove-experimental
9
 
10
  [cyberagent/calm3-22b-chat](https://huggingface.co/cyberagent/calm3-22b-chat)を学習モデル・データ拡張に用いた自己学習モデルである.
11
+ [Answer Carefully Dataset (ACv1)](https://llmc.nii.ac.jp/en/answercarefully-dataset/)からデータ拡張し,Direct Preference Optimization (DPO)[Rafailov et al., 23]で学習させた.
12
  特に,不適切な質問応答に関するベンチマーク性能を向上させている.
13
 
14
  ## Requirements, Usage, Chat Template
 
146
 
147
  [Mitsuki Sakamoto](https://huggingface.co/Mitsuki-Sakamoto), Yuu Jinnai, Tetsuro Morimura, Kenshi Abe, Kaito Ariu (corresponding author: [email protected]).
148
 
149
+ ## How to cite
150
+
151
+ ```tex
152
+ @misc{cyberagent-calm3-22b-chat-selfimprove-experimental,
153
+ title={cyberagent/calm3-22b-chat-selfimprove-experimental},
154
+ url={https://huggingface.co/cyberagent/calm3-22b-chat-selfimprove-experimental},
155
+ author={Mitsuki Sakamoto, Yuu Jinnai, Tetsuro Morimura, Kenshi Abe, Kaito Ariu},
156
+ year={2025},
157
+ }
158
+ ```