Vinh Nguyen

vinhnx90

https://vinhnx.github.io

AI & ML interests

Learn by doing

Recent Activity

liked a dataset 13 minutes ago

Anthropic/hh-rlhf

liked a Space 17 minutes ago

hesamation/primer-llm-embedding

liked a Space about 1 hour ago

ds4sd/SmolDocling-256M-Demo

View all activity

Organizations

None yet

vinhnx90's activity

liked a dataset 13 minutes ago

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 13.6k • 1.3k

liked a Space 17 minutes ago

LLM Embeddings Explained: A Visual and Intuitive Guide

🚀

How Language Models Turn Text into Meaning, From Traditional

liked a Space about 1 hour ago

203

SmolDocling

🦆

Convert images and text to document formats

liked a model about 1 hour ago

nitrosocke/Ghibli-Diffusion

Text-to-Image • Updated Aug 3, 2023 • 6.6k • • 641

liked 2 datasets about 1 hour ago

HuggingFaceTB/cosmopedia-100k

Viewer • Updated Feb 19, 2024 • 100k • 929 • 42

HuggingFaceTB/smollm-corpus

Viewer • Updated Sep 6, 2024 • 237M • 12.5k • 317

upvoted a collection about 11 hours ago

CoRNStack

Collection

State-of-the-art code retrieval and re-ranking models and datasets • 9 items • Updated 2 days ago • 14

liked a model about 11 hours ago

nomic-ai/nomic-embed-code

reacted to s-emanuilov's post with 👍 about 19 hours ago

Post

5219

Tutorial 💥 Training a non-English reasoning model with GRPO and Unsloth

I wanted to share my experiment with training reasoning models in languages other than English/Chinese.

Using Llama 3.1 8B as base, GRPO trainer from trl, and Unsloth optimizations, I got a working prototype in Bulgarian after ~5 hours on an L40S GPU. The approach should work for any language where the base model has some pre-training coverage.

Full code and tutorial here: https://unfoldai.com/reasoning-in-a-non-english-language/

The model itself: s-emanuilov/LLMBG-Llama-3.1-8B-BG-Reasoning-v0.1

I hope this helps anyone looking to build reasoning models in their language.