Post
1304
A few new Russian-language synthetic datasets. The labelling is good, but some of the syntax and grammar is not great.
Great for Russian-language classification models, probably not great for fine-tuning Russian-langauge text generation.
- Virtual Assistant Query / Responses: ZennyKenny/ru_virtual_assistant_chatgpt_distill
- LLM Query / Responses: ZennyKenny/russian_llm_response_chatgpt_distill
Crazy how much language drift is still an issue, especially given that Russian constitutes nearly 5% of the content on the internet.
Great for Russian-language classification models, probably not great for fine-tuning Russian-langauge text generation.
- Virtual Assistant Query / Responses: ZennyKenny/ru_virtual_assistant_chatgpt_distill
- LLM Query / Responses: ZennyKenny/russian_llm_response_chatgpt_distill
Crazy how much language drift is still an issue, especially given that Russian constitutes nearly 5% of the content on the internet.