Tahir's picture

Tahir

TahirC
·

AI & ML interests

None yet

Recent Activity

Organizations

Stanford AI's profile picture Samsung Electronics's profile picture

TahirC's activity

reacted to KaiChen1998's post with 👍 7 days ago
view post
Post
4766
📢 Our EMOVA paper has been accepted by CVPR 2025, and we are glad to release all resources, including code (training & inference), datasets (training & evaluation), and checkpoints (EMOVA-3B/7B/72B)!

🤗 EMOVA is a novel end-to-end omni-modal LLM that can see, hear and speak. Given omni-modal (i.e., textual, visual and speech) inputs, EMOVA can generate both textual and speech responses with vivid emotional controls by utilizing the speech decoder and a style controller.

✨ EMOVA Highlights
✅ State-of-the-art omni-modality: EMOVA achieves SoTA comparable results on both vision-language and speech benchmarks simultaneously.
✅ Device adaptation: our codebase supports training/inference on both NVIDIA GPUs (e.g., A800 & H20) and Ascend NPUs (e.g., 910B3)!
✅ Modular design: we integrate multiple implementations of vision encoder, vision projector, and language model, even including the most recent DeepSeekMoE-tiny!

🔥 You are all welcome to try and star!
- Project page: https://emova-ollm.github.io/
- Github: https://github.com/emova-ollm/EMOVA
- Demo: Emova-ollm/EMOVA-demo
upvoted an article 10 days ago
view article
Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

338
New activity in Remade-AI/Deflate 11 days ago
reacted to AdinaY's post with 🔥 18 days ago
view post
Post
4006
Exciting releases from the Chinese community this February🔥
👉 zh-ai-community/2025-february-67a35aaa68e97812def5b6ef

MLLM:
✨ Ovis2 by Alibaba
AIDC-AI/ovis2-67ab36c7e497429034874464
✨ Step Audio Chat by StepFun AI
stepfun-ai/step-audio-67b33accf45735bb21131b0b

Audio:
✨ Step Audio TTS by StepFunAI
stepfun-ai/Step-Audio-TTS-3B
✨ InspireMusic by Alibaba
https://huggingface.co/FunAudioLLM
✨ Baichuan Audio by BaichuanAI
baichuan-inc/Baichuan-Audio-Instruct

Video:
✨ Wan2.1 by Alibaba_Wan
Wan-AI/Wan2.1-T2V-14B
✨ Stepvideo-T2V by StepFun AI
stepfun-ai/stepvideo-t2v
✨ SkyReels-V1 by Skywork
Skywork/skyreels-v1-67b34676ff65b4ec02d16307
✨ LLaDA-8B by RenminUniversity
GSAI-ML/LLaDA-8B-Instruct

MoE:
✨ Moonlight-16B by MoonshotAI (Kimi)
moonshotai/Moonlight-16B-A3B-Instruct

Reasoning:
✨ TinyR1-32B by Qihoo360
qihoo360/TinyR1-32B-Preview

Dataset:
✨ Chinese DeepSeek R1-Distill data -110k
Congliu/Chinese-DeepSeek-R1-Distill-data-110k
reacted to elismasilva's post with 🔥 18 days ago
view post
Post
2793
MoD ControlNet Tile Upscaler for SDXL: Upscale Your Images with Ease! 🚀

Meet the MoD ControlNet Tile Upscaler for SDXL, a powerful tool that uses advanced technology to upscale your images without losing quality! Our app is designed to process images in tiles without leaving them blurry or with visible lines between the tiles. The result? Upscaled images with preserved details and smooth, natural transitions—all through a user-friendly interface. ✨

What MoD Upscaler Offers:

🔍 Preserved Details: Unlike traditional upscalers, the MoD ControlNet Tile Upscaler enlarges your images while maintaining clarity and adding details that might otherwise be lost. Your photos gain more definition without sacrificing original quality.
🧩 Advanced Tiling Technology: We use a smart combination of techniques to ensure natural and smooth transitions between tiles. This means your upscaled images remain consistent and high-quality, even at higher resolutions. No more visible lines or imperfections!
⚡ Fast and Efficient: You don’t need a super-powered computer! Our app is optimized to run quickly and smoothly, even on simpler machines.
🎨 Easy-to-Use Interface: You don’t have to be an expert to use the MoD ControlNet Tile Upscaler. The interface is simple, intuitive, and designed so anyone can achieve professional-quality results without hassle.
Upscale your images without losing quality and with details preserved. Try the MoD ControlNet Tile Upscaler today! 👍

Demo App: elismasilva/mod-control-tile-upscaler-sdxl
Github Code: https://github.com/DEVAIEXP/mod-control-tile-upscaler-sdxl

We use Gradio amazing interfaces.
We use Hugging Face Diffusers to build this tool and Hugging Face Spaces to run this demo.

Thank you all! 🙏
New activity in unsloth/Qwen2.5-VL-7B-Instruct 21 days ago
New activity in unsloth/README about 1 month ago
New activity in unsloth/Qwen2.5-VL-72B-Instruct-bnb-4bit about 1 month ago

Update preprocessor_config.json

6
#3 opened about 1 month ago by
TahirC