nlp-waseda
/

roberta-base-japanese

@@ -32,7 +32,7 @@ You can fine-tune this model on downstream tasks.
 ## Tokenization
-The input text should be segmented into words by [Juman++](https://github.com/ku-nlp/jumanpp) in advance. Each word is tokenized into tokens by [sentencepiece](https://github.com/google/sentencepiece).
 ## Vocabulary

 ## Tokenization
+The input text should be segmented into words by [Juman++](https://github.com/ku-nlp/jumanpp) in advance. Juman++ 2.0.0-rc3 was used for pretraining. Each word is tokenized into tokens by [sentencepiece](https://github.com/google/sentencepiece).
 ## Vocabulary