Sanjeev Satheesh's picture

7

Sanjeev Satheesh

sanjeevnv

·

sancha

AI & ML interests

None yet

Recent Activity

upvoted an article 13 days ago

📢 NVIDIA Releases Nemotron-CC-Math Pre-Training Dataset: A High-Quality, Web-Scale Math Corpus for Pretraining Large Language Models

updated a collection 27 days ago

NVIDIA Nemotron

upvoted a paper about 1 month ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

View all activity

Organizations

authored 10 papers about 1 month ago

ImageNet Large Scale Visual Recognition Challenge

Paper • 1409.0575 • Published Sep 1, 2014 • 9

Deep Speech: Scaling up end-to-end speech recognition

Paper • 1412.5567 • Published Dec 17, 2014

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

Paper • 1512.02595 • Published Dec 8, 2015 • 2

Nemotron-4 340B Technical Report

Paper • 2406.11704 • Published Jun 17, 2024

MIND: Math Informed syNthetic Dialogues for Pretraining LLMs

Paper • 2410.12881 • Published Oct 15, 2024 • 1

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Paper • 2504.03624 • Published Apr 4 • 15

Llama-Nemotron: Efficient Reasoning Models

Paper • 2505.00949 • Published May 2 • 42

Cold Fusion: Training Seq2Seq Models Together with Language Models

Paper • 1708.06426 • Published Aug 21, 2017 • 1

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20 • 36

Nemotron-CC-Math: A 133 Billion-Token-Scale High Quality Math Pretraining Dataset

Paper • 2508.15096 • Published Aug 20 • 2