Muhtasham Oblokulov's picture

Muhtasham Oblokulov PRO

muhtasham

AI & ML interests

None yet

Recent Activity

Organizations

Deprem Yapay Zeka's profile picture Spaces-explorers's profile picture Amazon SageMaker Community's profile picture πŸ€— Course Team AI Law Assistant's profile picture Keras's profile picture Training Transformers Together's profile picture CVPR Demo Track's profile picture Technical University of Munich's profile picture HugGAN Community's profile picture Eddevs's profile picture Gradio-Blocks-Party's profile picture Webhooks Explorers (BETA)'s profile picture EuroPython 2022's profile picture fastai X Hugging Face Group 2022's profile picture ICML 2022's profile picture BigCode's profile picture Ludwig Maximilian University of Munich's profile picture Munich NLP's profile picture SIGGRAPH 2022's profile picture Sabanci University's profile picture Blog-explorers's profile picture ilm's profile picture MLX Vision's profile picture ZeroGPU Explorers's profile picture Unofficial Mistral Community's profile picture MLX Community's profile picture 7wonders-of-ai's profile picture Hugging Face Discord Community's profile picture Hugging Face Party @ PyTorch Conference's profile picture

muhtasham's activity

liked a Space about 23 hours ago
upvoted an article 2 days ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

β€’ 434
reacted to tomaarsen's post with πŸ”₯ 3 days ago
view post
Post
2031
I've just shipped the Sentence Transformers v3.1.1 patch release, fixing the hard negatives mining utility for some models. This utility is extremely useful to get more performance out of your embedding training data.

⛏ Hard negatives are texts that are rather similar to some anchor text (e.g. a query), but are not the correct match. They're difficult for a model to distinguish from the correct answer, often resulting in a stronger model after training.
mine_hard_negatives docs: https://sbert.net/docs/package_reference/util.html#sentence_transformers.util.mine_hard_negatives

πŸ”“ Beyond that, this release removes the numpy<2 restriction from v3.1.0. This was previously required for Windows as not all third-party libraries were updated to support numpy v2. With Sentence Transformers, you can now choose v1 or v2 of numpy.

Check out the full release notes here: https://github.com/UKPLab/sentence-transformers/releases/tag/v3.1.1

I'm looking forward to releasing v3.2, I have some exciting things planned πŸš€
reacted to MoritzLaurer's post with ❀️ 3 days ago
view post
Post
4627
#phdone - I defended my PhD yesterday! A key lesson: it is amazing how open science and open source can empower beginners with limited resources:

I first learned about instruction-based classifiers like BERT-NLI 3-4 years ago, through the @HuggingFace ZeroShotClassificationPipeline. Digging deeper into this, it was surprisingly easy to find new datasets, newer base models, and reusable fine-tuning scripts on the HF Hub to create my own zeroshot models - although I didn't know much about fine-tuning at the time.

Thanks to the community effect of the Hub, my models were downloaded hundreds of thousands of times after a few months. Seeing my research being useful for people motivated me to improve and upload newer models. Leaving my contact details in the model cards led to academic cooperation and consulting contracts (and eventually my job at HF).

That's the power of open science & open source: learning, sharing, improving, collaborating.

I mean every word in my thesis acknowledgments (screenshot). I'm very grateful to my supervisors @vanatteveldt @CasAndreu @KasperWelbers for their guidance; to @profAndreaRenda and @CEPS_thinktank for enabling me to work part-time during the first year; to @huggingface for creating awesome tools and an awesome platform; and to many others who are not active on social media.

Links to the full thesis and the collection of my most recent models are below.

PS: If someone happens to speak Latin, let me know if my diploma contains some hidden Illuminati code or something :D
Β·