1 7 6

Chihang Lau

puccho

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation

upvoted a paper 19 days ago

FusionAudio-1.2M: Towards Fine-grained Audio Captioning with Multimodal Contextual Fusion

liked a Space 3 months ago

puccho/Soundwave

View all activity

Organizations

upvoted a paper 2 days ago

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation

Paper • 2506.18095 • Published 5 days ago • 59

upvoted a paper 19 days ago

FusionAudio-1.2M: Towards Fine-grained Audio Captioning with Multimodal Contextual Fusion

Paper • 2506.01111 • Published 26 days ago • 29

liked a Space 3 months ago

Soundwave

🚀

The Official Demo of Soundwave

liked a dataset 3 months ago

cais/mmlu

Viewer • Updated Mar 8, 2024 • 231k • 164k • 496

upvoted a paper 3 months ago

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27 • 79

updated a model 3 months ago

FreedomIntelligence/Soundwave

Audio-Text-to-Text • 9B • Updated Mar 16 • 61 • 11

published a Space 3 months ago

Soundwave

🚀

The Official Demo of Soundwave

updated a Space 3 months ago

Soundwave

🚀

The Official Demo of Soundwave

upvoted a paper 4 months ago

S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information

Paper • 2503.05085 • Published Mar 7 • 48

liked a model 4 months ago

FreedomIntelligence/Soundwave

Audio-Text-to-Text • 9B • Updated Mar 16 • 61 • 11

published a model 4 months ago

FreedomIntelligence/Soundwave

Audio-Text-to-Text • 9B • Updated Mar 16 • 61 • 11

liked a Space 4 months ago

SoundwaveDemo

📉

Process audio and generate text output based on instructions

liked a model 4 months ago

IDEA-CCNL/Ziya-LLaMA-7B-Reward

Text Classification • Updated Jun 7, 2023 • 119 • 71

liked a dataset 4 months ago

JusperLee/EchoSet

Viewer • Updated Jan 22 • 82.1k • 185 • 8

authored a paper 4 months ago

Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published Feb 18 • 86

upvoted a paper 4 months ago

Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published Feb 18 • 86

upvoted 2 papers 6 months ago

On the Compositional Generalization of Multimodal LLMs for Medical Imaging

Paper • 2412.20070 • Published Dec 28, 2024 • 47

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 105

Chihang Lau

AI & ML interests

Recent Activity

Organizations

puccho's activity

Soundwave

Soundwave

Soundwave

SoundwaveDemo