Abdelaziz Bounhar PRO

BounharAbdelaziz

AI & ML interests

Deep Learning, Reinforcement Learning, AI Agents, Generative Modeling, NLP, Information Theory, Security of Machine Learning, ...etc

Recent Activity

updated a collection 1 day ago

SFT Vision

updated a collection 1 day ago

RL Vision

updated a collection 1 day ago

SFT Vision Thinking

View all activity

Organizations

upvoted 2 collections 5 days ago

Reward Models

Collection

Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge • 8 items • Updated 9 days ago • 20

Nemotron-Pre-Training-Dataset

Collection

5 items • Updated 6 days ago • 24

upvoted a collection 6 days ago

Step 1: Reproducing DeepSeek's Distilled Models

Collection

Code for training and evaluation: https://github.com/huggingface/open-r1 • 3 items • Updated May 26 • 3

upvoted an article 19 days ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

and 11 others •

19 days ago

• 472

upvoted a paper 21 days ago

GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning

Paper • 2507.19457 • Published 30 days ago • 24

upvoted a paper 26 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 290

upvoted 2 articles 29 days ago

Article

Improving Parquet Dedupe on Hugging Face Hub

and 1 other •

Oct 5, 2024

• 38

Article

Parquet Content-Defined Chunking

•

about 1 month ago

• 61

upvoted 2 collections about 1 month ago

DeepSeek-Prover

Collection

DeepSeek-Prover-Series • 10 items • Updated Apr 30 • 57

open_formal_datasets

Collection

4 items • Updated May 17, 2024 • 1

upvoted an article about 1 month ago

Article

<p style="text-align:center;"> Bourbaki (7b): SOTA 7B Algorithms for Putnam Bench (Part I: Reasoning MDPs)</p>

and 2 others •

Jul 13

• 11

upvoted a collection about 2 months ago

Nile-Chat

Collection

6 items • Updated 1 day ago • 3

upvoted a paper about 2 months ago

Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts

Paper • 2507.04569 • Published Jul 6 • 19

upvoted an article about 2 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

Jul 8

• 636

upvoted an article 3 months ago

Article

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

and 5 others •

May 21

• 34

upvoted a collection 3 months ago

Absolute Zero Reasoner

Collection

6 items • Updated May 9 • 56

upvoted an article 3 months ago

Article

Putting RL back in RLHF

and 1 other •

Jun 12, 2024

• 100

upvoted an article 4 months ago

Article

Introducing the Open Arabic LLM Leaderboard

and 4 others •

May 14, 2024

• 97

upvoted 2 collections 4 months ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 164

AI2 Safety Toolkit

Collection

Safety data, moderation tools and safe LLMs. • 6 items • Updated Apr 30 • 8

Abdelaziz Bounhar PRO

AI & ML interests

Recent Activity

Organizations

BounharAbdelaziz's activity

Welcome GPT OSS, the new open-source model family from OpenAI!

Improving Parquet Dedupe on Hugging Face Hub

Parquet Content-Defined Chunking

<p style="text-align:center;"> Bourbaki (7b): SOTA 7B Algorithms for Putnam Bench (Part I: Reasoning MDPs)</p>

SmolLM3: smol, multilingual, long-context reasoner

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Putting RL back in RLHF

Introducing the Open Arabic LLM Leaderboard