48 528 642

Sugato Ray PRO

sugatoray

https://linkedin.com/in/sugatoray

AI & ML interests

None yet

Recent Activity

updated a collection about 3 hours ago

LLMs

liked a model about 3 hours ago

mlabonne/gemma-3-12b-it-abliterated-GGUF

updated a collection 1 day ago

LLMs

View all activity

Organizations

sugatoray's activity

upvoted a collection 1 day ago

Orpheus TTS

Collection

TTS Towards Human-Sounding Speech • 2 items • Updated 6 days ago • 49

upvoted an article 1 day ago

Article

Xet is on the Hub

7 days ago

• 32

upvoted a paper 1 day ago

One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation

Paper • 2503.13358 • Published 7 days ago • 86

upvoted 2 papers 2 days ago

Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models

Paper • 2503.16257 • Published 4 days ago • 21

Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published 4 days ago • 67

upvoted a collection 2 days ago

MoshiVis v0.1

Collection

MoshiVis is a Vision Speech Model built as a perceptually-augmented version of Moshi v0.1 for conversing about image inputs • 8 items • Updated 3 days ago • 14

upvoted a paper 2 days ago

DeTikZify: Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ

Paper • 2405.15306 • Published May 24, 2024 • 5

upvoted a collection 2 days ago

DeTikZify

Collection

Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ • 12 items • Updated 5 days ago • 20

upvoted a paper 2 days ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published 4 days ago • 38

upvoted a paper 3 days ago

SynCity: Training-Free Generation of 3D Worlds

Paper • 2503.16420 • Published 4 days ago • 19

upvoted an article 3 days ago

Article

Open R1: How to use OlympicCoder locally for coding?

5 days ago

• 45

upvoted 3 papers 3 days ago

Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning

Paper • 2503.16252 • Published 4 days ago • 23

Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts

Paper • 2503.16057 • Published 4 days ago • 12

Tokenize Image as a Set

Paper • 2503.16425 • Published 4 days ago • 12

upvoted 2 papers 4 days ago

Manify: A Python Library for Learning Non-Euclidean Representations

Paper • 2503.09576 • Published 12 days ago • 1

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published 6 days ago • 100

upvoted a paper 6 days ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published 10 days ago • 72

upvoted an article 7 days ago

Article

Assisted Generation: a new direction toward low-latency text generation

May 11, 2023

• 51

upvoted a paper 7 days ago

VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search

Paper • 2503.10582 • Published 11 days ago • 20