Eni Grand's picture

Eni Grand

Enigrand

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 5 hours ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

upvoted a paper about 8 hours ago

TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

liked a Space about 21 hours ago

allenai/reward-bench

View all activity

Organizations

Enigrand's activity

upvoted a paper about 5 hours ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 2 days ago • 43

upvoted a paper about 8 hours ago

TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

Paper • 2501.16937 • Published 2 days ago • 2

upvoted 5 papers 1 day ago

Open Problems in Mechanistic Interpretability

Paper • 2501.16496 • Published 3 days ago • 11

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 84

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 2 days ago • 21

Are Vision Language Models Texture or Shape Biased and Can We Steer Them?

Paper • 2403.09193 • Published Mar 14, 2024 • 8

Low-Rank Adapters Meet Neural Architecture Search for LLM Compression

Paper • 2501.16372 • Published 8 days ago • 5

upvoted a collection 2 days ago

YuE

YuE: Open Full-song Generation Foundation Model • 9 items • Updated 2 days ago • 14

upvoted 2 papers 2 days ago

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 4 days ago • 39

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published 5 days ago • 45

upvoted 2 collections 3 days ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 3 days ago • 283

InternLM3

6 items • Updated 13 days ago • 21

upvoted 2 collections 4 days ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 4 days ago • 86

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 1 day ago • 64

upvoted 4 papers 7 days ago

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Paper • 2501.12895 • Published 8 days ago • 50

TextGrad: Automatic "Differentiation" via Text

Paper • 2406.07496 • Published Jun 11, 2024 • 29

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 8 days ago • 267

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published 8 days ago • 75

upvoted a collection 9 days ago

Llama 3.2

Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 27 items • Updated about 6 hours ago • 48

upvoted a paper 9 days ago

VideoWorld: Exploring Knowledge Learning from Unlabeled Videos

Paper • 2501.09781 • Published 14 days ago • 24