44 53 112

Junlin Zhou

jlzhou

edwardzjl

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

reacted to Kseniase's post with ❤️ 5 days ago

6 Essential Reads on core AI/ML topics: Time to look at some free useful resources that can help you upgrade your knowledge of AI and machine learning! Today we offer you these 6 must-read surveys that can be your perfect guides to the major fields and techniques: 1. Foundations of Large Language Models by Tong Xiao and Jingbo Zhu → https://arxiv.org/abs/2501.09223 Many recommend this 270-page book as a good resource to focus on fundamental concepts, such as pre-training, generative models, prompting, alignment, and inference 2. Large Language Models Post-Training: Surveying Techniques from Alignment to Reasoning -> https://huggingface.co/papers/2503.06072 Read this to master policy optimization (RLHF, DPO, GRPO), supervised and parameter-efficient fine-tuning, reasoning, integration, and adaptation techniques 3. Agentic Large Language Models, a survey by Leiden University → https://arxiv.org/abs/2503.23037 Surveys agentic LLMs across reasoning, tools, and multi-agent collaboration, highlighting their synergy. It also explores their promise, risks and applications in medicine, finance, science. 4. A Survey of Context Engineering for Large Language Models → https://huggingface.co/papers/2507.13334 Defines Context Engineering as systematic info design for LLMs beyond prompting, covering retrieval, processing, management, and architectures like RAG and multi-agent systems 5. A Survey of Generative Categories and Techniques in Multimodal Large Language Models → https://arxiv.org/abs/2506.10016 Covers multimodal models, exploring six generative modalities, key techniques (SSL, RLHF, CoT), architectural trends, and challenges 6. Large Language models for Time Series Analysis: Techniques, Applications, and Challenges → https://arxiv.org/abs/2506.11040 Explains how LLMs transform time series analysis by enhancing pattern recognition and long-term dependency handling + shows how to build them Also, subscribe to the Turing Post: https://www.turingpost.com/subscribe

new activity 15 days ago

mistralai/Magistral-Small-2506:docs: fix anchor link to "vllm-recommended"

View all activity

Organizations

upvoted a paper 3 days ago

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published 11 days ago • 78

upvoted an article 15 days ago

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

18 days ago

• 578

upvoted an article 16 days ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

and 1 other •

17 days ago

• 608

upvoted 3 papers about 1 month ago

Don't Pay Attention

Paper • 2506.11305 • Published Jun 12 • 8

Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning

Paper • 2506.06205 • Published Jun 6 • 29

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 249

upvoted a paper about 2 months ago

Just as Humans Need Vaccines, So Do Models: Model Immunization to Combat Falsehoods

Paper • 2505.17870 • Published May 23 • 5

upvoted a paper 2 months ago

Cache Me if You Can: Accelerating Diffusion Models through Block Caching

Paper • 2312.03209 • Published Dec 6, 2023 • 21

upvoted an article 3 months ago

Article

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 632

upvoted a paper 3 months ago

RealHarm: A Collection of Real-World Language Model Application Failures

Paper • 2504.10277 • Published Apr 14 • 11

upvoted an article 4 months ago

Article

You could have designed state of the art positional encoding

•

Nov 25, 2024

• 318

upvoted a paper 4 months ago

Min P Sampling: Balancing Creativity and Coherence at High Temperature

Paper • 2407.01082 • Published Jul 1, 2024 • 1

upvoted an article 4 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

and 3 others •

Mar 12

• 447

upvoted a paper 5 months ago

Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning

Paper • 2502.18080 • Published Feb 25 • 2

upvoted 2 articles 5 months ago

Article

Open R1: Update #3

and 9 others •

Mar 11

• 295

Article

From Files to Chunks: Improving Hugging Face Storage Efficiency

and 1 other •

Nov 20, 2024

• 63

upvoted 2 papers 5 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 125

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3 • 115

upvoted an article 5 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 190

upvoted a paper 6 months ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published Jan 29 • 59

Junlin Zhou

AI & ML interests

Recent Activity

Organizations

jlzhou's activity

SmolLM3: smol, multilingual, long-context reasoner

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Uncensor any LLM with abliteration

You could have designed state of the art positional encoding

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Open R1: Update #3

From Files to Chunks: Improving Hugging Face Storage Efficiency

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge