Ali Bidaran's picture

9 2 90

Ali Bidaran

alibidaran

·

AI & ML interests

LLMs, Computer Vision, Generative AI, NLP, Machine /Deep learning, Reinforcement Learning

Recent Activity

liked a dataset 2 days ago

mrs83/kurtis_mental_health_dpo

liked a dataset 2 days ago

arafatanam/Student-Mental-Health-Counseling-100K

liked a dataset 2 days ago

tcabanski/mental_health_counseling_responses

View all activity

Organizations

None yet

liked 5 datasets 2 days ago

mrs83/kurtis_mental_health_dpo

Viewer • Updated Dec 29, 2024 • 2.8k • 11 • 3

arafatanam/Student-Mental-Health-Counseling-100K

Viewer • Updated Apr 3 • 100k • 31 • 1

tcabanski/mental_health_counseling_responses

Viewer • Updated Jan 19 • 26.1k • 33 • 3

arafatanam/Student-Mental-Health-Counseling-50K

Viewer • Updated Apr 3 • 50k • 24 • 1

izi-ano/CounselBench-Adv

Viewer • Updated May 16 • 20 • 55 • 1

liked 2 datasets 6 days ago

angie-chen55/python-github-code

Viewer • Updated May 31, 2022 • 7.23M • 1.06k • 32

dipesh/python-code-ds-mini

Viewer • Updated Dec 9, 2022 • 2.8k • 316 • 14

reacted to sergiopaniego's post with 👍 19 days ago

Post

4478

Just included example scripts for aligning models using GSPO (including VLM example) 🙆‍♂️🙆‍♂️

GSPO is the latest RL alignment algo by @Alibaba_Qwen and it's already supported in the latest TRL v0.20 release.

Super-easy-to-get-started example scripts below, GO run them!👩‍💻👩‍💻

🧑‍🎨 Script: https://github.com/huggingface/trl/blob/main/examples/scripts/gspo.py
🦄 VLM script: https://github.com/huggingface/trl/blob/main/examples/scripts/gspo_vlm.py
🧩 More TRL examples: https://huggingface.co/docs/trl/main/en/example_overview
🧙‍♂️ GSPO paper: Group Sequence Policy Optimization (2507.18071)

liked a dataset 23 days ago

fka/awesome-chatgpt-prompts

Viewer • Updated Jan 6 • 203 • 37.9k • 8.81k

updated a model 25 days ago

alibidaran/GRPO_LLAMA3_Reasoning_Consultor

Updated 25 days ago

published a model 27 days ago

alibidaran/GRPO_LLAMA3_Reasoning_Consultor

Updated 25 days ago

updated a model 28 days ago

alibidaran/GRPO_LLAMA3-instructive_reasoning1

Updated 28 days ago

published a model 28 days ago

alibidaran/GRPO_LLAMA3-instructive_reasoning1

Updated 28 days ago

updated a model about 1 month ago

alibidaran/dreambooth-old-book-illustration

Text-to-Image • Updated about 1 month ago • 22

published a model about 1 month ago

alibidaran/dreambooth-old-book-illustration

Text-to-Image • Updated about 1 month ago • 22

updated a model about 1 month ago

alibidaran/llama3_vision_Radiography_demo

updated a Space about 1 month ago

Bone Fraction

Detect bone fractures in X-ray images

upvoted a paper about 1 month ago

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 245

updated a model about 1 month ago

alibidaran/GRPO_Python_Reasoning_Demo

updated a Space about 1 month ago

Davinci EYE

Segment surgical instruments and tissues in images