Satyam's picture

Satyam

satyamt

·

AI & ML interests

Biotechnology

Recent Activity

liked a model 13 days ago

reasonir/ReasonIR-8B

liked a model 14 days ago

ByteDance-Seed/BAGEL-7B-MoT

upvoted an article 15 days ago

Distributed Training with JAX and Flax NNX: A Practical Guide to Sharding

View all activity

Organizations

satyamt's activity

upvoted an article 15 days ago

Article

Distributed Training with JAX and Flax NNX: A Practical Guide to Sharding

By

•

Mar 26

• 7

upvoted a collection 4 months ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated Apr 28 • 119

upvoted 2 papers 5 months ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 115

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 105

upvoted 3 papers 6 months ago

LearnLM: Improving Gemini for Learning

Paper • 2412.16429 • Published Dec 21, 2024 • 22

Deliberation in Latent Space via Differentiable Cache Augmentation

Paper • 2412.17747 • Published Dec 23, 2024 • 33

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 83

upvoted 2 collections 8 months ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated 3 days ago • 155

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Apr 30 • 305

upvoted an article 9 months ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

By

•

Jul 5, 2024

• 256

upvoted a paper 9 months ago

Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming

Paper • 2408.16725 • Published Aug 29, 2024 • 54

upvoted an article 10 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

By

and 2 others •

May 14, 2024

• 253

upvoted a paper 10 months ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15, 2024 • 60

upvoted a collection 10 months ago

PaliGemma Release

Pretrained and mix checkpoints for PaliGemma • 16 items • Updated 11 days ago • 147

upvoted an article 10 months ago

Article

Constitutional AI with Open LLMs

By

and 6 others •

Feb 1, 2024

• 14

upvoted a collection 10 months ago

Probably function calling datasets

Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17, 2024 • 38

upvoted an article 10 months ago

Article

Serverless Inference with Hugging Face and NVIDIA NIMs

By

and 1 other •

Jul 29, 2024

• 31

upvoted an article 11 months ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

By

and 2 others •

Jun 24, 2024

• 194

upvoted 2 papers 12 months ago

GenQA: Generating Millions of Instructions from a Handful of Prompts

Paper • 2406.10323 • Published Jun 14, 2024 • 5

Show, Don't Tell: Aligning Language Models with Demonstrated Feedback

Paper • 2406.00888 • Published Jun 2, 2024 • 34