38 488 5944

Diwank Tomer PRO

diwank

https://diwank.name

AI & ML interests

None yet

Recent Activity

updated a collection about 15 hours ago

Audio

liked a model about 15 hours ago

nvidia/nemotron-speech-streaming-en-0.6b

updated a collection about 15 hours ago

Vision

View all activity

Organizations

upvoted a paper 16 days ago

Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers

Paper • 2512.17351 • Published 19 days ago • 25

upvoted a collection 16 days ago

Gemma Scope 2

Collection

11 items • Updated 19 days ago • 16

upvoted a paper 19 days ago

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

Paper • 2512.15603 • Published 21 days ago • 59

upvoted an article about 1 month ago

Article

Norm-Preserving Biprojected Abliteration

Nov 6, 2025

•

upvoted 2 papers about 2 months ago

Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published Nov 19, 2025 • 53

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published Nov 11, 2025 • 105

upvoted 2 collections about 2 months ago

The Bestiary

Collection

Decensored language models made using Heretic (https://github.com/p-e-w/heretic) • 6 items • Updated Nov 16, 2025 • 78

Nemotron RAG

Collection

14 items • Updated 2 days ago • 57

upvoted a paper about 2 months ago

Drax: Speech Recognition with Discrete Flow Matching

Paper • 2510.04162 • Published Oct 5, 2025 • 27

upvoted 3 papers 2 months ago

upvoted 2 articles 2 months ago

Article

What makes good reasoning data

Oct 30, 2025

•

Article

Projected Abliteration

Oct 25, 2025

•

upvoted 6 papers 3 months ago

The Markovian Thinker

Paper • 2510.06557 • Published Oct 8, 2025 • 30

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 174

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30, 2025 • 539

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30, 2025 • 55

Thinking Sparks!: Emergent Attention Heads in Reasoning Models During Post Training

Paper • 2509.25758 • Published Sep 30, 2025 • 22

DeepScientist: Advancing Frontier-Pushing Scientific Findings Progressively

Paper • 2509.26603 • Published Sep 30, 2025 • 16

Diwank Tomer PRO

AI & ML interests

Recent Activity

Organizations

diwank's activity

Norm-Preserving Biprojected Abliteration

What makes good reasoning data

Projected Abliteration