fkrasnov2 commited on
Commit
971ae2d
·
verified ·
1 Parent(s): 222351e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -5
README.md CHANGED
@@ -9,13 +9,11 @@ tags:
9
  - e-commerce
10
  - encoder
11
  ---
12
- Encoder-model for e-commerce search query similarity task. The search queries are short.
13
 
14
- Sentencepiece tokenizer fitted on 269 million Russian search queries log.
15
 
16
- Short sequence length is used to reduce the memory footprint.
17
-
18
- The dataset for validation with manual markup consisted of 362 thousand examples.
19
 
20
  ![Validation results](https://huggingface.co/fkrasnov2/SBE/resolve/main/bvf_recall1k_query_len_eng.svg)
21
 
 
9
  - e-commerce
10
  - encoder
11
  ---
12
+ A sentencepiece tokenizer was applied to a corpus of 269 million Russian search queries.
13
 
14
+ The encoder-model was trained for the e-commerce search query similarity task, and the search queries were short.
15
 
16
+ The dataset for validation, which was manually annotated, comprised 362,000 instances.
 
 
17
 
18
  ![Validation results](https://huggingface.co/fkrasnov2/SBE/resolve/main/bvf_recall1k_query_len_eng.svg)
19