Bave's picture

Bave

alibave491

AI & ML interests

None yet

Recent Activity

reacted to azettl's post with 🔥 7 days ago
Agents & MCP Hackathon Day 1 Before starting with the second night of the Agents & MCP Hackathon, I briefly wanted to share my progress from last night. Not much sleep, but lots of progress! So I managed to build the first MVP version of my custom Gradio component, https://huggingface.co/spaces/azettl/gradio_consilium_roundtable (https://pypi.org/project/gradio-consilium-roundtable/). This creates a visual roundtable component for AI consensus discussions. Displays AI participants as avatars positioned around an oval table (poker style!) with animated speech bubbles, thinking states, and real-time discussion updates. Also, I managed to get a rough draft of the Gradio app + MCP server done, but not so much yet that I can share the space with you. You will be able to define your question the AI participants should discuss, decide on the protocol, do role assignments like having a devil's advocate on the table, and define the communication pattern. Lastly, you can decide which AI should be the moderator and how many rounds of discussions there should be. You can see my progress in the attached image. Most of the options are just placeholders right now, and I will work on their implementation tonight. Hopefully, I can add an MVP tomorrow evening to the following space: https://huggingface.co/spaces/Agents-MCP-Hackathon/consilium_mcp. I am also very interested in the cool stuff you all are building; please let me know in the comments. :)
View all activity

Organizations

None yet

alibave491's activity

reacted to azettl's post with 🔥 7 days ago
view post
Post
1637
Agents & MCP Hackathon Day 1

Before starting with the second night of the Agents & MCP Hackathon, I briefly wanted to share my progress from last night. Not much sleep, but lots of progress!

So I managed to build the first MVP version of my custom Gradio component, https://huggingface.co/spaces/azettl/gradio_consilium_roundtable (https://pypi.org/project/gradio-consilium-roundtable/). This creates a visual roundtable component for AI consensus discussions. Displays AI participants as avatars positioned around an oval table (poker style!) with animated speech bubbles, thinking states, and real-time discussion updates.

Also, I managed to get a rough draft of the Gradio app + MCP server done, but not so much yet that I can share the space with you. You will be able to define your question the AI participants should discuss, decide on the protocol, do role assignments like having a devil's advocate on the table, and define the communication pattern. Lastly, you can decide which AI should be the moderator and how many rounds of discussions there should be. You can see my progress in the attached image.

Most of the options are just placeholders right now, and I will work on their implementation tonight. Hopefully, I can add an MVP tomorrow evening to the following space: Agents-MCP-Hackathon/consilium_mcp.

I am also very interested in the cool stuff you all are building; please let me know in the comments. :)
  • 1 reply
·
reacted to AdinaY's post with 👍 7 days ago
view post
Post
863
SynLogic 🧠 logical reasoning model & dataset by MiniMax.

MiniMaxAI/synlogic-6836c3246fca0277657ff032

✨ 3 models: 7B/32B/ Mix-3-32B (MIT license)
✨ Dataset: 35 verifiable logic tasks (Sudoku, Cipher, Arrow Maze etc.)
✨ RL training with auto-verifiable rewards
✨ Generalizes to math without explicit math training
✨ +6 pts on BBEH, +9.5 on KOR-Bench vs baselines
reacted to AdinaY's post with 👀 7 days ago
view post
Post
1643
Video-XL-2 🔥 long video understanding model by BAAI & Shanghai Jiaotong University

BAAI/Video-XL-2

✨ Apache 2.0
✨ Handles up to 10,000+ frames on a single GPU
✨ 2048-frame encoding in just 12s
✨ Efficient Chunk-based Prefilling & Bi-granularity KV decoding
reacted to danielhanchen's post with 🤗 7 days ago
reacted to AdinaY's post with 🚀 7 days ago
view post
Post
2134
May highlights from China’s open source ecosystem 🔥

zh-ai-community/may-2025-open-works-from-the-chinese-community-681a3494145f2914dc679b7c

✨ DeepSeek dropped R1 updates
- Both R1 & 8B distralled smol model

✨ Bytedance goes big on open source:
- BAGEL, Dolphin, Seedcoder, Dream0...

✨ Multimodal is on fire!
- HuyuanCustom / HunyuanVideo-Avatar / HunyuanPortrait
- MiniMax: SynLogic / Orsta-7B
- Xiaomi: MiMo VL
- Alibaba Wan: Wan2.1-VACE
- OpenGVlab: ZeroGUI
- StepFun: ACE-Step-v1/Step1X-3D

✨ Specialized models/datasets excels
- Alibaba Qwen: World PM 72B
- BAAI:RobotBrain (MLLM for robotic)
- HiThink Research: BizFinBench (dataset)
- OpenBMB: Ultra FineWeb (dataset)
- Bilibili: Index-anisora (Anime/ACG)
- Skywork:Matrix-Game (game)

More awesome releases: Alibaba QwenLong-L1-32B, SkyWork OR1, OpenS2V-5M etc...
reacted to AdinaY's post with 🔥 7 days ago
view post
Post
522
MiMo-VL 🔥 smol & mighty vision language model by Xiaomi

XiaomiMiMo/mimo-vl-68382ccacc7c2875500cd212

✨ 7B with RL & SFT
✨ Native resolution ViT for fine grained perception
✨ MORL = smarter alignment across perception, grounding & reasoning
reacted to AdinaY's post with ❤️ 7 days ago
view post
Post
2646
🔥 New benchmark & dataset for Subject-to-Video generation

OPENS2V-NEXUS by Pekin University

✨ Fine-grained evaluation for subject consistency
BestWishYsh/OpenS2V-Eval
✨ 5M-scale dataset:
BestWishYsh/OpenS2V-5M
✨ New metrics – automatic scores for identity, realism, and text match
  • 2 replies
·
reacted to AdinaY's post with 👍 7 days ago
view post
Post
2260
HunyuanVideo-Avatar 🔥 another image to video model byTencent Hunyuan

tencent/HunyuanVideo-Avatar

✨Emotion-controlled, high-dynamic avatar videos
✨Multi-character support with separate audio control
✨Works with any style: cartoon, 3D, real face, while keeping identity consistent
reacted to AdinaY's post with 🔥 7 days ago
reacted to AdinaY's post with 👍 7 days ago
view post
Post
2833
Orsta 🔥 vision language models trained with V-Triune, a unified reinforcement learning system by MiniMax AI

One-RL-to-See-Them-All/one-rl-to-see-them-all-6833d27abce23898b2f9815a

✨ 7B & 32B with MIT license
✨ Masters 8 visual tasks: math, science QA, charts, puzzles, object detection, grounding, OCR, and counting
✨ Uses Dynamic IoU rewards for better visual understanding
✨Strong performance in visual reasoning and perception
reacted to AdinaY's post with 🔥 7 days ago
reacted to AdinaY's post with 🔥 7 days ago
view post
Post
2790
ByteDance is absolutely cooking lately🔥

BAGEL 🥯 7B active parameter open multimodal foundation model by Bytedance Seed team.

ByteDance-Seed/BAGEL-7B-MoT

✨ Apache 2.0
✨ Outperforms top VLMs (Qwen2.5-VL & InternVL-2.5)
✨ Mixture-of-Transformer-Experts + dual encoders
✨ Trained on trillions of interleaved tokens