Women on Hugging Face

community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

loubnabnl authored a paper 6 days ago

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

mariagrandury authored a paper 18 days ago

Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation

mariagrandury authored a paper 18 days ago

It's the same but not the same: Do LLMs distinguish Spanish varieties?

View all activity

WomenonHuggingFace's activity

AdinaY

posted an update 2 days ago

Post

1412

Lingshu 🩺📖 medical MLLM released by DAMO Alibaba

lingshu-medical-mllm/lingshu-mllms-6847974ca5b5df750f017dad

✨ 7B/32B
✨ 12+ imaging modalities supported: X-Ray, CT, MRI, Microscopy +more
✨ Great performance on medical benchmark

Ameeeee

posted an update 3 days ago

Post

1550

With Sheets, try a new way to create structured content with the help of AI!

No installs. No login. Just open a link and 🤩

This app lets you create a dataset by importing a file or starting from a prompt.

What’s different about SHEETS?
🔎 Web search integration to ground answers in real-world data
📚 In-context learning from validated sources
🔗 Transparent sourcing — every result is linked
🧩 Runs on multiple open-source models

Fight hallucinations and start creating content you can rely on.

AdinaY

posted an update 4 days ago

Post

3098

RoboBrain 2.0🔥 OPEN embedded brain model by BAAIBeijing

BAAI/RoboBrain2.0-7B

✨ 7B - Apache 2.0 / 32B coming soon
✨ Supports multiple images, long videos, and high-resolution visuals
✨ Spatial + temporal reasoning
✨ Real-time memory & scene graphs

loubnabnl

authored a paper 6 days ago

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published 7 days ago • 36

AdinaY

posted an update 6 days ago

Post

2626

RedNote 小红书 just released their first LLM 🔥

dots.llm1.base 🪐 a 142B MoE model with only 14B active params.

rednote-hilab/dotsllm1-68246aaaaba3363374a8aa7c
✨ Base & Instruct - MIT license
✨ Trained on 11.2T non-synthetic high-quality data
✨ Competitive with Qwen2.5/3 on reasoning, code, alignment

AdinaY

posted an update 6 days ago

Post

402

MiniCPM4🔥 efficient LLMs built for end-side devices, by OpenBMB

openbmb/minicpm4-6841ab29d180257e940baa9b

✨ Apache 2.0
✨ 5–7× Faster Inference (Jetson Orin & RTX 4090)
✨ 8B trained on 8T clean, non-synthetic tokens
✨ 32K Native Context -> 128K+ with InfLLM v2 + LongRoPE
✨ Runs on 🤗Transformers , http://CPM.cu, vLLM, and SGLang

AdinaY

posted an update 7 days ago

Post

1986

New models from Qwen 🔥

Qwen3-Embedding and Qwen3-Reranker Series just released on the hub by
Alibaba Qwen team.

✨ 0.6B/ 4B/ 8B with Apache2.0
✨ Supports 119 languages 🤯
✨ Top-tier performance: Leading the MTEB multilingual leaderboard！

Reranker:
Qwen/qwen3-reranker-6841b22d0192d7ade9cdefea
Embedding:
Qwen/qwen3-embedding-6841b2055b99c44d9a4c371f

AdinaY

posted an update 8 days ago

Post

1574

OpenAudio S1-mini 🔊 a new OPEN multilingual TTS model trained on 2M+ hours of data, by FishAudio

fishaudio/openaudio-s1-mini

✨ Supports 14 languages
✨ 50+ emotions & tones
✨ RLHF-optimized
✨ Special effects: laughing, crying, shouting, etc.

1 reply

AdinaY

posted an update 9 days ago

Post

997

AReaL-boba² 🔥 A fully async RL system by Ant Research & Tsinghua.

Paper: AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning (2505.24298)
Model:
inclusionAI/areal-boba-2-683f0e819ccb7bb2e1b2f2d5

✨ 8B/14B/32B models, datasets & paper – all on the hub
✨ 2.77× faster training
✨ Native Agentic RL support

AdinaY

posted an update 10 days ago

Post

868

SynLogic 🧠 logical reasoning model & dataset by MiniMax.

MiniMaxAI/synlogic-6836c3246fca0277657ff032

✨ 3 models: 7B/32B/ Mix-3-32B (MIT license)
✨ Dataset: 35 verifiable logic tasks (Sudoku, Cipher, Arrow Maze etc.)
✨ RL training with auto-verifiable rewards
✨ Generalizes to math without explicit math training
✨ +6 pts on BBEH, +9.5 on KOR-Bench vs baselines

AdinaY

posted an update 10 days ago

Post

1649

Video-XL-2 🔥 long video understanding model by BAAI & Shanghai Jiaotong University

BAAI/Video-XL-2

✨ Apache 2.0
✨ Handles up to 10,000+ frames on a single GPU
✨ 2048-frame encoding in just 12s
✨ Efficient Chunk-based Prefilling & Bi-granularity KV decoding

AdinaY

posted an update 11 days ago

Post

2140

May highlights from China’s open source ecosystem 🔥

zh-ai-community/may-2025-open-works-from-the-chinese-community-681a3494145f2914dc679b7c

✨ DeepSeek dropped R1 updates
- Both R1 & 8B distralled smol model

✨ Bytedance goes big on open source:
- BAGEL, Dolphin, Seedcoder, Dream0...

✨ Multimodal is on fire!
- HuyuanCustom / HunyuanVideo-Avatar / HunyuanPortrait
- MiniMax: SynLogic / Orsta-7B
- Xiaomi: MiMo VL
- Alibaba Wan: Wan2.1-VACE
- OpenGVlab: ZeroGUI
- StepFun: ACE-Step-v1/Step1X-3D

✨ Specialized models/datasets excels
- Alibaba Qwen: World PM 72B
- BAAI:RobotBrain (MLLM for robotic)
- HiThink Research: BizFinBench (dataset)
- OpenBMB: Ultra FineWeb (dataset)
- Bilibili: Index-anisora (Anime/ACG)
- Skywork:Matrix-Game (game)

More awesome releases: Alibaba QwenLong-L1-32B, SkyWork OR1, OpenS2V-5M etc...

AdinaY

posted an update 14 days ago

Post

527

MiMo-VL 🔥 smol & mighty vision language model by Xiaomi

XiaomiMiMo/mimo-vl-68382ccacc7c2875500cd212

✨ 7B with RL & SFT
✨ Native resolution ViT for fine grained perception
✨ MORL = smarter alignment across perception, grounding & reasoning

AdinaY

posted an update 16 days ago

Post

2653

🔥 New benchmark & dataset for Subject-to-Video generation

OPENS2V-NEXUS by Pekin University

✨ Fine-grained evaluation for subject consistency
BestWishYsh/OpenS2V-Eval
✨ 5M-scale dataset:
BestWishYsh/OpenS2V-5M
✨ New metrics – automatic scores for identity, realism, and text match

2 replies

AdinaY

posted an update 16 days ago

Post

2265

HunyuanVideo-Avatar 🔥 another image to video model byTencent Hunyuan

tencent/HunyuanVideo-Avatar

✨Emotion-controlled, high-dynamic avatar videos
✨Multi-character support with separate audio control
✨Works with any style: cartoon, 3D, real face, while keeping identity consistent

AdinaY

posted an update 17 days ago

Post

1919

HunyuanPortrait 🔥 video model by Tencent Hunyuan team.

HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation (2503.18860)
tencent/HunyuanPortrait

✨Portrait animation from just one image + a video prompt
✨Diffusion-based, implicit motion control
✨Superior temporal consistency & detail

AdinaY

posted an update 18 days ago

Post

2836

Orsta 🔥 vision language models trained with V-Triune, a unified reinforcement learning system by MiniMax AI

One-RL-to-See-Them-All/one-rl-to-see-them-all-6833d27abce23898b2f9815a

✨ 7B & 32B with MIT license
✨ Masters 8 visual tasks: math, science QA, charts, puzzles, object detection, grounding, OCR, and counting
✨ Uses Dynamic IoU rewards for better visual understanding
✨Strong performance in visual reasoning and perception

AdinaY

posted an update 18 days ago

Post

2090

QwenLong-L1🔥 long-context reasoning model by Alibaba Tongyi Zhiwen team.

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning (2505.17667)
Tongyi-Zhiwen/QwenLong-L1-32B

✨ 32B & Apache 2.0
✨ Outperforms OpenAI-o3-mini & Qwen3-235B-A22B
✨ Trained on a unique 1.6K DocQA RL dataset spanning math, logic & multi-hop reasoning

mariagrandury

authored 2 papers 18 days ago

Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation

Paper • 2504.07072 • Published Apr 9 • 9

It's the same but not the same: Do LLMs distinguish Spanish varieties?

Paper • 2504.20049 • Published Apr 8

AI & ML interests

Recent Activity

Team members 65

WomenonHuggingFace's activity