Muhtasham Oblokulov's picture

Muhtasham Oblokulov PRO

muhtasham

·

https://www.linkedin.com/in/muhtasham/

AI & ML interests

None yet

Recent Activity

liked a model about 8 hours ago

pipecat-ai/smart-turn-v2

liked a model 1 day ago

nvidia/canary-qwen-2.5b

liked a model 1 day ago

DFloat11/FLUX.1-Kontext-dev-DF11

View all activity

Organizations

upvoted 2 articles 1 day ago

Article

Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever.

By

•

2 days ago

• 57

Article

Introducing ColQwen-Omni: Retrieve in every modality

By

and 4 others •

2 days ago

• 39

upvoted an article 2 days ago

Article

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

By

and 5 others •

3 days ago

• 33

upvoted a collection 7 days ago

NextCoder

NextCoder family of code-editing LMs developed with Selective Knowledge Transfer and its training data. • 6 items • Updated 10 days ago • 67

upvoted a paper 11 days ago

How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks

Paper • 2507.01955 • Published 16 days ago • 34

upvoted a collection 13 days ago

Speech-To-Text

https://kyutai.org/next/stt • 6 items • Updated 30 days ago • 12

upvoted a collection 15 days ago

EmoNet

The full collection of our EmoNet effort. More info available at: https://huggingface.co/blog/felfri/emonet • 8 items • Updated 27 days ago • 5

upvoted a paper 18 days ago

SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning

Paper • 2506.21355 • Published 22 days ago • 9

upvoted 2 collections 22 days ago

Gemma 3n

24 items • Updated 6 days ago • 11

Gemma 3n

4 items • Updated 9 days ago • 183

upvoted a paper 28 days ago

DeepFilterNet: Perceptually Motivated Real-Time Speech Enhancement

Paper • 2305.08227 • Published May 14, 2023 • 1

upvoted an article about 1 month ago

Article

How to generate text: using different decoding methods for language generation with Transformers

By

•

Mar 1, 2020

• 222

upvoted 2 papers about 1 month ago

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

Paper • 2504.07951 • Published Apr 10 • 29

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 114

upvoted 3 articles about 1 month ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

By

and 8 others •

Jun 3

• 202

Article

LTX-Video LoRA training study (Single image/style training)

By

•

Jan 14

• 3

Article

Introduction to 3D Gaussian Splatting

By

•

Sep 18, 2023

• 93

upvoted a paper about 1 month ago

PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers

Paper • 2506.05573 • Published Jun 5 • 71

upvoted 2 collections about 1 month ago

Qwen3

Chat templates replaced with Qwen2.5 template • 14 items • Updated 19 days ago • 2

SkyReels-AX

7 items • Updated Apr 13 • 6