NivCohen's picture

18

NivCohen

NivC

Nivc

AI & ML interests

None yet

Recent Activity

upvoted a paper about 15 hours ago

Debatable Intelligence: Benchmarking LLM Judges via Debate Speech Evaluation

upvoted a paper 10 days ago

WHISTRESS: Enriching Transcriptions with Sentence Stress Detection

upvoted a paper 10 days ago

Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning

View all activity

Organizations

None yet

NivC's activity

upvoted a paper about 15 hours ago

Debatable Intelligence: Benchmarking LLM Judges via Debate Speech Evaluation

Paper • 2506.05062 • Published 6 days ago • 11

upvoted 4 papers 10 days ago

WHISTRESS: Enriching Transcriptions with Sentence Stress Detection

Paper • 2505.19103 • Published 17 days ago • 13

Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning

Paper • 2505.17813 • Published 19 days ago • 55

CHIMERA: A Knowledge Base of Idea Recombination in Scientific Literature

Paper • 2505.20779 • Published 15 days ago • 15

StressTest: Can YOUR Speech LM Handle the Stress?

Paper • 2505.22765 • Published 13 days ago • 17

upvoted 2 papers 2 months ago

Follow the Flow: On Information Flow Across Textual Tokens in Text-to-Image Models

Paper • 2504.01137 • Published Apr 1 • 21

Scaling Analysis of Interleaved Speech-Text Language Models

Paper • 2504.02398 • Published Apr 3 • 29

upvoted 4 papers 3 months ago

OmnimatteZero: Training-free Real-time Omnimatte with Pre-trained Video Diffusion Models

Paper • 2503.18033 • Published Mar 23 • 25

Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published Mar 20 • 91

Charting and Navigating Hugging Face's Model Atlas

Paper • 2503.10633 • Published Mar 13 • 84

RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling

Paper • 2503.09601 • Published Mar 12 • 15

upvoted a paper 4 months ago

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

Paper • 2502.09619 • Published Feb 13 • 36

upvoted 2 papers 6 months ago

JuStRank: Benchmarking LLM Judges for System Ranking

Paper • 2412.09569 • Published Dec 12, 2024 • 20

Hidden in the Noise: Two-Stage Robust Watermarking for Images

Paper • 2412.04653 • Published Dec 5, 2024 • 31

upvoted a paper 8 months ago

SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification

Paper • 2410.05057 • Published Oct 7, 2024 • 7

upvoted a paper 9 months ago

Style over Substance: Failure Modes of LLM Judges in Alignment Benchmarking

Paper • 2409.15268 • Published Sep 23, 2024 • 13

upvoted a paper 11 months ago

Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP

Paper • 2407.00402 • Published Jun 29, 2024 • 23

upvoted a paper 12 months ago

Dataset Size Recovery from LoRA Weights

Paper • 2406.19395 • Published Jun 27, 2024 • 19