AI & ML interests

None defined yet.

AdinaY 
posted an update 1 day ago
view post
Post
2010
Hunyuan-A13B 🔥 New MoE LLM by TencentHunyuan

tencent/Hunyuan-A13B-Instruct

✨80B total / 13B active params
✨256K context window
✨Dual-mode reasoning: fast & slow thinking
✨Efficient inference (GQA + quantization)
AdinaY 
posted an update 4 days ago
AdinaY 
posted an update 4 days ago
view post
Post
252
MOSS-TTSD 🔊 Bilingual text-to-spoken dialogue model by Fudan University - Open MOSS team.

Model:
fnlp/MOSS-TTSD-v0
Demo:
fnlp/MOSS-TTSD

✨ Supports Chinese & English
✨ Zero-shot 2-speaker voice cloning
✨ Long-form generation (up to 960s)
✨ Built on Qwen 3
AdinaY 
posted an update 5 days ago
view post
Post
243
Skywork-SWE 🔥 New code agent model by Skywork 天工

Skywork/Skywork-SWE-32B

✨ 32B - Apache 2.0
✨ 38.0% pass@1 on SWE-bench Verified
✨ Up to 47.0% with test-time scaling
✨ Shows clear data scaling law (8K+ demos)
✨ Built on Qwen2.5-Coder-32B + OpenHands
AdinaY 
posted an update 11 days ago
AdinaY 
posted an update 12 days ago
view post
Post
4023
Kimi-Dev 💻 New coding model by Moonshot AI

moonshotai/Kimi-Dev-72B

✨ 72B - MIT license
✨ 60.4% on SWE-bench Verified
✨ RL-trained to patch real repos in Docker
✨ Only rewarded if full test suite passes
AdinaY 
posted an update 12 days ago
view post
Post
644
MiniMax-M1 🔥 The First reasoning model by MiniMax.

MiniMaxAI/minimax-m1-68502ad9634ec0eeac8cf094

✨ 40k/80k thinking budget
✨ Powered by Hybrid MoE + Lightning Attention 👀
✨ 1M context length 🤯
✨ Apache 2.0
✨ RL-trained for math, coding & real-world software
AdinaY 
posted an update 13 days ago
view post
Post
438
Hunyuan 3D 2.1 🔥 Industrial-grade 3D model just released by Tencent Hunyuan

tencent/Hunyuan3D-2.1
tencent/Hunyuan3D-2.1

✨ PBR materials: leather, bronze & more, breathtaking realism under any light
✨ Consumer GPU-ready: good for developers and small teams

AdinaY 
posted an update 18 days ago
AdinaY 
posted an update 20 days ago
view post
Post
3178
RoboBrain 2.0🔥 OPEN embedded brain model by BAAIBeijing

BAAI/RoboBrain2.0-7B

✨ 7B - Apache 2.0 / 32B coming soon
✨ Supports multiple images, long videos, and high-resolution visuals
✨ Spatial + temporal reasoning
✨ Real-time memory & scene graphs
AdinaY 
posted an update 22 days ago
view post
Post
2682
RedNote 小红书 just released their first LLM 🔥

dots.llm1.base 🪐 a 142B MoE model with only 14B active params.

rednote-hilab/dotsllm1-68246aaaaba3363374a8aa7c
✨ Base & Instruct - MIT license
✨ Trained on 11.2T non-synthetic high-quality data
✨ Competitive with Qwen2.5/3 on reasoning, code, alignment
AdinaY 
posted an update 22 days ago
view post
Post
451
MiniCPM4🔥 efficient LLMs built for end-side devices, by OpenBMB

openbmb/minicpm4-6841ab29d180257e940baa9b

✨ Apache 2.0
✨ 5–7× Faster Inference (Jetson Orin & RTX 4090)
✨ 8B trained on 8T clean, non-synthetic tokens
✨ 32K Native Context -> 128K+ with InfLLM v2 + LongRoPE
✨ Runs on 🤗Transformers , http://CPM.cu, vLLM, and SGLang
AdinaY 
posted an update 23 days ago
AdinaY 
posted an update 24 days ago
view post
Post
1618
OpenAudio S1-mini 🔊 a new OPEN multilingual TTS model trained on 2M+ hours of data, by FishAudio

fishaudio/openaudio-s1-mini

✨ Supports 14 languages
✨ 50+ emotions & tones
✨ RLHF-optimized
✨ Special effects: laughing, crying, shouting, etc.
  • 1 reply
·
AdinaY 
posted an update 25 days ago
AdinaY 
posted an update 25 days ago
view post
Post
885
SynLogic 🧠 logical reasoning model & dataset by MiniMax.

MiniMaxAI/synlogic-6836c3246fca0277657ff032

✨ 3 models: 7B/32B/ Mix-3-32B (MIT license)
✨ Dataset: 35 verifiable logic tasks (Sudoku, Cipher, Arrow Maze etc.)
✨ RL training with auto-verifiable rewards
✨ Generalizes to math without explicit math training
✨ +6 pts on BBEH, +9.5 on KOR-Bench vs baselines
AdinaY 
posted an update 26 days ago
view post
Post
1664
Video-XL-2 🔥 long video understanding model by BAAI & Shanghai Jiaotong University

BAAI/Video-XL-2

✨ Apache 2.0
✨ Handles up to 10,000+ frames on a single GPU
✨ 2048-frame encoding in just 12s
✨ Efficient Chunk-based Prefilling & Bi-granularity KV decoding
AdinaY 
posted an update 27 days ago
view post
Post
2159
May highlights from China’s open source ecosystem 🔥

zh-ai-community/may-2025-open-works-from-the-chinese-community-681a3494145f2914dc679b7c

✨ DeepSeek dropped R1 updates
- Both R1 & 8B distralled smol model

✨ Bytedance goes big on open source:
- BAGEL, Dolphin, Seedcoder, Dream0...

✨ Multimodal is on fire!
- HuyuanCustom / HunyuanVideo-Avatar / HunyuanPortrait
- MiniMax: SynLogic / Orsta-7B
- Xiaomi: MiMo VL
- Alibaba Wan: Wan2.1-VACE
- OpenGVlab: ZeroGUI
- StepFun: ACE-Step-v1/Step1X-3D

✨ Specialized models/datasets excels
- Alibaba Qwen: World PM 72B
- BAAI:RobotBrain (MLLM for robotic)
- HiThink Research: BizFinBench (dataset)
- OpenBMB: Ultra FineWeb (dataset)
- Bilibili: Index-anisora (Anime/ACG)
- Skywork:Matrix-Game (game)

More awesome releases: Alibaba QwenLong-L1-32B, SkyWork OR1, OpenS2V-5M etc...
AdinaY 
posted an update 30 days ago
view post
Post
539
MiMo-VL 🔥 smol & mighty vision language model by Xiaomi

XiaomiMiMo/mimo-vl-68382ccacc7c2875500cd212

✨ 7B with RL & SFT
✨ Native resolution ViT for fine grained perception
✨ MORL = smarter alignment across perception, grounding & reasoning
AdinaY 
posted an update about 1 month ago
view post
Post
2665
🔥 New benchmark & dataset for Subject-to-Video generation

OPENS2V-NEXUS by Pekin University

✨ Fine-grained evaluation for subject consistency
BestWishYsh/OpenS2V-Eval
✨ 5M-scale dataset:
BestWishYsh/OpenS2V-5M
✨ New metrics – automatic scores for identity, realism, and text match
  • 2 replies
·