Update README.md
Browse files
README.md
CHANGED
@@ -6,6 +6,7 @@ language:
|
|
6 |
---
|
7 |
|
8 |
# llm-jp-modernbert-base
|
|
|
9 |
|
10 |
This model is based on the [modernBERT-base](https://arxiv.org/abs/2412.13663) architecture with [llm-jp-tokenizer](https://github.com/llm-jp/llm-jp-tokenizer).
|
11 |
It was trained using the Japanese subset (3.4TB) of the llm-jp-corpus v4 and supports a max sequence length of 8192.
|
|
|
6 |
---
|
7 |
|
8 |
# llm-jp-modernbert-base
|
9 |
+
📄 [Paper](https://arxiv.org/abs/2504.15544) | 🧑💻 [Code](https://github.com/llm-jp/llm-jp-modernbert)
|
10 |
|
11 |
This model is based on the [modernBERT-base](https://arxiv.org/abs/2412.13663) architecture with [llm-jp-tokenizer](https://github.com/llm-jp/llm-jp-tokenizer).
|
12 |
It was trained using the Japanese subset (3.4TB) of the llm-jp-corpus v4 and supports a max sequence length of 8192.
|