Added the version of Juman++ to README.md.
Browse files
README.md
CHANGED
@@ -32,7 +32,7 @@ You can fine-tune this model on downstream tasks.
|
|
32 |
|
33 |
## Tokenization
|
34 |
|
35 |
-
The input text should be segmented into words by [Juman++](https://github.com/ku-nlp/jumanpp) in advance. Each word is tokenized into tokens by [sentencepiece](https://github.com/google/sentencepiece).
|
36 |
|
37 |
## Vocabulary
|
38 |
|
|
|
32 |
|
33 |
## Tokenization
|
34 |
|
35 |
+
The input text should be segmented into words by [Juman++](https://github.com/ku-nlp/jumanpp) in advance. Juman++ 2.0.0-rc3 was used for pretraining. Each word is tokenized into tokens by [sentencepiece](https://github.com/google/sentencepiece).
|
36 |
|
37 |
## Vocabulary
|
38 |
|