research-backup
/

mt5-base-trimmed-ja-15000

text2text-generation

Model card Files Files and versions

mt5-base-trimmed-ja-15000 / README.md

asahi417's picture

commit files to HF hub

aadb37e over 2 years ago

|

history blame contribute delete

1.56 kB

	# Vocabulary Trimmed [google/mt5-base](https://huggingface.co/google/mt5-base): `vocabtrimmer/mt5-base-trimmed-ja-15000`
	This model is a trimmed version of [google/mt5-base](https://huggingface.co/google/mt5-base) by [`vocabtrimmer`](https://github.com/asahi417/lm-vocab-trimmer), a tool for trimming vocabulary of language models to compress the model size.
	Following table shows a summary of the trimming process.

	\| \| google/mt5-base \| vocabtrimmer/mt5-base-trimmed-ja-15000 \|
	\|:---------------------------\|:------------------\|:-----------------------------------------\|
	\| parameter_size_full \| 582,401,280 \| 221,273,856 \|
	\| parameter_size_embedding \| 384,172,032 \| 23,044,608 \|
	\| vocab_size \| 250,112 \| 15,003 \|
	\| compression_rate_full \| 100.0 \| 37.99 \|
	\| compression_rate_embedding \| 100.0 \| 6.0 \|


	Following table shows the parameter used to trim vocabulary.

	\| language \| dataset \| dataset_column \| dataset_name \| dataset_split \| target_vocab_size \| min_frequency \|
	\|:-----------\|:----------------------------\|:-----------------\|:---------------\|:----------------\|--------------------:\|----------------:\|
	\| ja \| vocabtrimmer/mc4_validation \| text \| ja \| validation \| 15000 \| 2 \|