44 53 112

Junlin Zhou

jlzhou

edwardzjl

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

reacted to Kseniase's post with ❤️ 5 days ago

6 Essential Reads on core AI/ML topics: Time to look at some free useful resources that can help you upgrade your knowledge of AI and machine learning! Today we offer you these 6 must-read surveys that can be your perfect guides to the major fields and techniques: 1. Foundations of Large Language Models by Tong Xiao and Jingbo Zhu → https://arxiv.org/abs/2501.09223 Many recommend this 270-page book as a good resource to focus on fundamental concepts, such as pre-training, generative models, prompting, alignment, and inference 2. Large Language Models Post-Training: Surveying Techniques from Alignment to Reasoning -> https://huggingface.co/papers/2503.06072 Read this to master policy optimization (RLHF, DPO, GRPO), supervised and parameter-efficient fine-tuning, reasoning, integration, and adaptation techniques 3. Agentic Large Language Models, a survey by Leiden University → https://arxiv.org/abs/2503.23037 Surveys agentic LLMs across reasoning, tools, and multi-agent collaboration, highlighting their synergy. It also explores their promise, risks and applications in medicine, finance, science. 4. A Survey of Context Engineering for Large Language Models → https://huggingface.co/papers/2507.13334 Defines Context Engineering as systematic info design for LLMs beyond prompting, covering retrieval, processing, management, and architectures like RAG and multi-agent systems 5. A Survey of Generative Categories and Techniques in Multimodal Large Language Models → https://arxiv.org/abs/2506.10016 Covers multimodal models, exploring six generative modalities, key techniques (SSL, RLHF, CoT), architectural trends, and challenges 6. Large Language models for Time Series Analysis: Techniques, Applications, and Challenges → https://arxiv.org/abs/2506.11040 Explains how LLMs transform time series analysis by enhancing pattern recognition and long-term dependency handling + shows how to build them Also, subscribe to the Turing Post: https://www.turingpost.com/subscribe

new activity 15 days ago

mistralai/Magistral-Small-2506:docs: fix anchor link to "vllm-recommended"

View all activity

Organizations

upvoted a paper 3 days ago

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published 11 days ago • 78

reacted to Kseniase's post with ❤️ 5 days ago

Post

5981

6 Essential Reads on core AI/ML topics:

Time to look at some free useful resources that can help you upgrade your knowledge of AI and machine learning!
Today we offer you these 6 must-read surveys that can be your perfect guides to the major fields and techniques:

1. Foundations of Large Language Models by Tong Xiao and Jingbo Zhu → https://arxiv.org/abs/2501.09223
Many recommend this 270-page book as a good resource to focus on fundamental concepts, such as pre-training, generative models, prompting, alignment, and inference

2. Large Language Models Post-Training: Surveying Techniques from Alignment to Reasoning -> A Survey on Post-training of Large Language Models (2503.06072)
Read this to master policy optimization (RLHF, DPO, GRPO), supervised and parameter-efficient fine-tuning, reasoning, integration, and adaptation techniques

3. Agentic Large Language Models, a survey by Leiden University → https://arxiv.org/abs/2503.23037
Surveys agentic LLMs across reasoning, tools, and multi-agent collaboration, highlighting their synergy. It also explores their promise, risks and applications in medicine, finance, science.

4. A Survey of Context Engineering for Large Language Models → A Survey of Context Engineering for Large Language Models (2507.13334)
Defines Context Engineering as systematic info design for LLMs beyond prompting, covering retrieval, processing, management, and architectures like RAG and multi-agent systems

5. A Survey of Generative Categories and Techniques in Multimodal Large Language Models → https://arxiv.org/abs/2506.10016
Covers multimodal models, exploring six generative modalities, key techniques (SSL, RLHF, CoT), architectural trends, and challenges

6. Large Language models for Time Series Analysis: Techniques, Applications, and Challenges → https://arxiv.org/abs/2506.11040
Explains how LLMs transform time series analysis by enhancing pattern recognition and long-term dependency handling + shows how to build them

Also, subscribe to the Turing Post: https://www.turingpost.com/subscribe

1 reply

New activity in mistralai/Magistral-Small-2506 15 days ago

docs: fix anchor link to "vllm-recommended"

#26 opened 15 days ago by

jlzhou

upvoted an article 15 days ago

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

18 days ago

• 578

upvoted an article 16 days ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

and 1 other •

17 days ago

• 608

New activity in mistralai/Magistral-Small-2506 about 1 month ago

output issue

#14 opened about 1 month ago by

mobo68

upvoted 2 papers about 1 month ago

Don't Pay Attention

Paper • 2506.11305 • Published Jun 12 • 8

Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning

Paper • 2506.06205 • Published Jun 6 • 29

commented a paper about 1 month ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 249 •

upvoted a paper about 1 month ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 249

upvoted a paper about 2 months ago

Just as Humans Need Vaccines, So Do Models: Model Immunization to Combat Falsehoods

Paper • 2505.17870 • Published May 23 • 5

liked 2 models 2 months ago

tiiuae/Falcon3-10B-Instruct-1.58bit

Text Generation • 3B • Updated Jan 13 • 905 • 20

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated 28 days ago • 1.53M • • 11k

upvoted a paper 2 months ago

Cache Me if You Can: Accelerating Diffusion Models through Block Caching

Paper • 2312.03209 • Published Dec 6, 2023 • 21

liked a dataset 3 months ago

b-mc2/sql-create-context

Viewer • Updated Jan 25, 2024 • 78.6k • 2.1k • 469

reacted to mlabonne's post with 👍 3 months ago

Post

16957

✂️ AutoAbliteration

I made a Colab notebook to automatically abliterate models.

It's quite general, so you can do interesting stuff like blocking a given language in the model outputs.

💻 Colab: https://colab.research.google.com/drive/1RmLv-pCMBBsQGXQIM8yF-OdCNyoylUR1?usp=sharing

1 reply

upvoted an article 3 months ago

Article

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 632

liked a model 3 months ago

meta-llama/Llama-Prompt-Guard-2-86M

Text Classification • 0.3B • Updated Apr 29 • 249k • • 52

reacted to etemiz's post with 👍 3 months ago

Post

1622

Grok 3 Human Alignment Score: 42

It is better in health, nutrition, fasting compared to Grok 2. About the same in liberating tech like bitcoin and nostr. Worse in the misinformation and faith domains. The rest is about the same. So we have a model that is less faithful but knows how to live a healthier life.

https://sheet.zoho.com/sheet/open/mz41j09cc640a29ba47729fed784a263c1d08?sheetid=0&range=A1

https://huggingface.co/blog/etemiz/benchmarking-ai-human-alignment-of-grok-3

upvoted a paper 3 months ago

RealHarm: A Collection of Real-World Language Model Application Failures

Paper • 2504.10277 • Published Apr 14 • 11

Junlin Zhou

AI & ML interests

Recent Activity

Organizations

jlzhou's activity

docs: fix anchor link to "vllm-recommended"

SmolLM3: smol, multilingual, long-context reasoner

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

output issue

Uncensor any LLM with abliteration