Robin Williams's picture

Robin Williams

bfuzzy1

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 months ago

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

upvoted an article 5 months ago

SmolLM3: smol, multilingual, long-context reasoner

upvoted a collection 5 months ago

Encoders vs Decoders: the Ettin Suite

View all activity

Organizations

None yet

upvoted a paper 4 months ago

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

Paper • 2509.01363 • Published Sep 1, 2025 • 58

upvoted an article 5 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

741

upvoted a collection 5 months ago

Encoders vs Decoders: the Ettin Suite

A collection of SOTA, open-data, paired encoder-only and decoder only models ranging from 17M params to 1B. See the paper at https://arxiv.org/abs/250 • 32 items • Updated Jul 16, 2025 • 25

commented a paper 6 months ago

FLEXITOKENS: Flexible Tokenization for Evolving Language Models

Paper • 2507.12720 • Published Jul 17, 2025 • 9 •

updated a collection 6 months ago

Nifty

41 items • Updated Jul 19, 2025

upvoted 2 papers 6 months ago

FLEXITOKENS: Flexible Tokenization for Evolving Language Models

Paper • 2507.12720 • Published Jul 17, 2025 • 9

Teach Old SAEs New Domain Tricks with Boosting

Paper • 2507.12990 • Published Jul 17, 2025 • 11

updated a collection 6 months ago

Nifty

41 items • Updated Jul 19, 2025

upvoted a paper 6 months ago

Scaling Laws for Optimal Data Mixtures

Paper • 2507.09404 • Published Jul 12, 2025 • 36

upvoted an article 6 months ago

Article

Transformers Are Getting Old: Variants and Alternatives Exist!

Jul 5, 2025

•

44

upvoted 5 papers 6 months ago

PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models

Paper • 2506.16054 • Published Jun 19, 2025 • 60

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19, 2025 • 130

FaithfulSAE: Towards Capturing Faithful Features with Sparse Autoencoders without External Dataset Dependencies

Paper • 2506.17673 • Published Jun 21, 2025 • 7

Steering Conceptual Bias via Transformer Latent-Subspace Activation

Paper • 2506.18887 • Published Jun 23, 2025 • 6

Orthogonal Finetuning Made Scalable

Paper • 2506.19847 • Published Jun 24, 2025 • 11

upvoted 2 papers 7 months ago

RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling

Paper • 2506.08672 • Published Jun 10, 2025 • 30

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 187

updated a collection 7 months ago

Nifty

41 items • Updated Jul 19, 2025

upvoted 2 papers 7 months ago

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Paper • 2505.19147 • Published May 25, 2025 • 144

Truth Neurons

Paper • 2505.12182 • Published May 18, 2025 • 8