Bave

alibave491

AI & ML interests

None yet

Recent Activity

reacted to azettl's post with 🔥 7 days ago

Agents & MCP Hackathon Day 1 Before starting with the second night of the Agents & MCP Hackathon, I briefly wanted to share my progress from last night. Not much sleep, but lots of progress! So I managed to build the first MVP version of my custom Gradio component, https://huggingface.co/spaces/azettl/gradio_consilium_roundtable (https://pypi.org/project/gradio-consilium-roundtable/). This creates a visual roundtable component for AI consensus discussions. Displays AI participants as avatars positioned around an oval table (poker style!) with animated speech bubbles, thinking states, and real-time discussion updates. Also, I managed to get a rough draft of the Gradio app + MCP server done, but not so much yet that I can share the space with you. You will be able to define your question the AI participants should discuss, decide on the protocol, do role assignments like having a devil's advocate on the table, and define the communication pattern. Lastly, you can decide which AI should be the moderator and how many rounds of discussions there should be. You can see my progress in the attached image. Most of the options are just placeholders right now, and I will work on their implementation tonight. Hopefully, I can add an MVP tomorrow evening to the following space: https://huggingface.co/spaces/Agents-MCP-Hackathon/consilium_mcp. I am also very interested in the cool stuff you all are building; please let me know in the comments. :)

reacted to AdinaY's post with 👍 7 days ago

SynLogic 🧠 logical reasoning model & dataset by MiniMax. https://huggingface.co/collections/MiniMaxAI/synlogic-6836c3246fca0277657ff032 ✨ 3 models: 7B/32B/ Mix-3-32B (MIT license) ✨ Dataset: 35 verifiable logic tasks (Sudoku, Cipher, Arrow Maze etc.) ✨ RL training with auto-verifiable rewards ✨ Generalizes to math without explicit math training ✨ +6 pts on BBEH, +9.5 on KOR-Bench vs baselines

reacted to AdinaY's post with 👀 7 days ago

Video-XL-2 🔥 long video understanding model by BAAI & Shanghai Jiaotong University https://huggingface.co/BAAI/Video-XL-2 ✨ Apache 2.0 ✨ Handles up to 10,000+ frames on a single GPU ✨ 2048-frame encoding in just 12s ✨ Efficient Chunk-based Prefilling & Bi-granularity KV decoding

View all activity

Organizations

None yet

alibave491's activity

reacted to azettl's post with 🔥 7 days ago

Post

1637

Agents & MCP Hackathon Day 1

Before starting with the second night of the Agents & MCP Hackathon, I briefly wanted to share my progress from last night. Not much sleep, but lots of progress!

So I managed to build the first MVP version of my custom Gradio component, https://huggingface.co/spaces/azettl/gradio_consilium_roundtable (https://pypi.org/project/gradio-consilium-roundtable/). This creates a visual roundtable component for AI consensus discussions. Displays AI participants as avatars positioned around an oval table (poker style!) with animated speech bubbles, thinking states, and real-time discussion updates.

Also, I managed to get a rough draft of the Gradio app + MCP server done, but not so much yet that I can share the space with you. You will be able to define your question the AI participants should discuss, decide on the protocol, do role assignments like having a devil's advocate on the table, and define the communication pattern. Lastly, you can decide which AI should be the moderator and how many rounds of discussions there should be. You can see my progress in the attached image.

Most of the options are just placeholders right now, and I will work on their implementation tonight. Hopefully, I can add an MVP tomorrow evening to the following space: Agents-MCP-Hackathon/consilium_mcp.

I am also very interested in the cool stuff you all are building; please let me know in the comments. :)

1 reply

reacted to AdinaY's post with 👍 7 days ago

Post

863

SynLogic 🧠 logical reasoning model & dataset by MiniMax.

MiniMaxAI/synlogic-6836c3246fca0277657ff032

✨ 3 models: 7B/32B/ Mix-3-32B (MIT license)
✨ Dataset: 35 verifiable logic tasks (Sudoku, Cipher, Arrow Maze etc.)
✨ RL training with auto-verifiable rewards
✨ Generalizes to math without explicit math training
✨ +6 pts on BBEH, +9.5 on KOR-Bench vs baselines

reacted to AdinaY's post with 👀 7 days ago

Post

1643

Video-XL-2 🔥 long video understanding model by BAAI & Shanghai Jiaotong University

BAAI/Video-XL-2

✨ Apache 2.0
✨ Handles up to 10,000+ frames on a single GPU
✨ 2048-frame encoding in just 12s
✨ Efficient Chunk-based Prefilling & Bi-granularity KV decoding

reacted to danielhanchen's post with 🤗 7 days ago

Post

3172

New DeepSeek-R1-0528 1.65-bit Dynamic GGUF!

Run the model locally even easier! Will fit on a 192GB Macbook and run at 7 tokens/s.

DeepSeek-R1-0528 GGUFs: unsloth/DeepSeek-R1-0528-GGUF
Qwen3-8B DeepSeek-R1-0528 GGUFs: unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF

And read our Guide: https://docs.unsloth.ai/basics/deepseek-r1-0528

reacted to AdinaY's post with 🚀 7 days ago

Post

2134

May highlights from China’s open source ecosystem 🔥

zh-ai-community/may-2025-open-works-from-the-chinese-community-681a3494145f2914dc679b7c

✨ DeepSeek dropped R1 updates
- Both R1 & 8B distralled smol model

✨ Bytedance goes big on open source:
- BAGEL, Dolphin, Seedcoder, Dream0...

✨ Multimodal is on fire!
- HuyuanCustom / HunyuanVideo-Avatar / HunyuanPortrait
- MiniMax: SynLogic / Orsta-7B
- Xiaomi: MiMo VL
- Alibaba Wan: Wan2.1-VACE
- OpenGVlab: ZeroGUI
- StepFun: ACE-Step-v1/Step1X-3D

✨ Specialized models/datasets excels
- Alibaba Qwen: World PM 72B
- BAAI:RobotBrain (MLLM for robotic)
- HiThink Research: BizFinBench (dataset)
- OpenBMB: Ultra FineWeb (dataset)
- Bilibili: Index-anisora (Anime/ACG)
- Skywork:Matrix-Game (game)

More awesome releases: Alibaba QwenLong-L1-32B, SkyWork OR1, OpenS2V-5M etc...

reacted to AdinaY's post with 🔥 7 days ago

Post

522

MiMo-VL 🔥 smol & mighty vision language model by Xiaomi

XiaomiMiMo/mimo-vl-68382ccacc7c2875500cd212

✨ 7B with RL & SFT
✨ Native resolution ViT for fine grained perception
✨ MORL = smarter alignment across perception, grounding & reasoning

reacted to AdinaY's post with ❤️ 7 days ago

Post

2646

🔥 New benchmark & dataset for Subject-to-Video generation

OPENS2V-NEXUS by Pekin University

✨ Fine-grained evaluation for subject consistency
BestWishYsh/OpenS2V-Eval
✨ 5M-scale dataset:
BestWishYsh/OpenS2V-5M
✨ New metrics – automatic scores for identity, realism, and text match

2 replies

reacted to AdinaY's post with 👍 7 days ago

Post

2260

HunyuanVideo-Avatar 🔥 another image to video model byTencent Hunyuan

tencent/HunyuanVideo-Avatar

✨Emotion-controlled, high-dynamic avatar videos
✨Multi-character support with separate audio control
✨Works with any style: cartoon, 3D, real face, while keeping identity consistent

reacted to AdinaY's post with 🔥 7 days ago

Post

1917

HunyuanPortrait 🔥 video model by Tencent Hunyuan team.

HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation (2503.18860)
tencent/HunyuanPortrait

✨Portrait animation from just one image + a video prompt
✨Diffusion-based, implicit motion control
✨Superior temporal consistency & detail

reacted to AdinaY's post with 👍 7 days ago

Post

2833

Orsta 🔥 vision language models trained with V-Triune, a unified reinforcement learning system by MiniMax AI

One-RL-to-See-Them-All/one-rl-to-see-them-all-6833d27abce23898b2f9815a

✨ 7B & 32B with MIT license
✨ Masters 8 visual tasks: math, science QA, charts, puzzles, object detection, grounding, OCR, and counting
✨ Uses Dynamic IoU rewards for better visual understanding
✨Strong performance in visual reasoning and perception

reacted to AdinaY's post with 🔥 7 days ago

Post

2088

QwenLong-L1🔥 long-context reasoning model by Alibaba Tongyi Zhiwen team.

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning (2505.17667)
Tongyi-Zhiwen/QwenLong-L1-32B

✨ 32B & Apache 2.0
✨ Outperforms OpenAI-o3-mini & Qwen3-235B-A22B
✨ Trained on a unique 1.6K DocQA RL dataset spanning math, logic & multi-hop reasoning

reacted to AdinaY's post with 🔥 7 days ago

Post

2790

ByteDance is absolutely cooking lately🔥

BAGEL 🥯 7B active parameter open multimodal foundation model by Bytedance Seed team.

ByteDance-Seed/BAGEL-7B-MoT

✨ Apache 2.0
✨ Outperforms top VLMs (Qwen2.5-VL & InternVL-2.5)
✨ Mixture-of-Transformer-Experts + dual encoders
✨ Trained on trillions of interleaved tokens