6 12 4

Yuzhe Yang

TobyYang7

https://tobyyang7.github.io/

TobyYang7

AI & ML interests

None yet

Recent Activity

upvoted a paper about 23 hours ago

FusionAudio-1.2M: Towards Fine-grained Audio Captioning with Multimodal Contextual Fusion

upvoted a paper 22 days ago

XtraGPT: LLMs for Human-AI Collaboration on Controllable Academic Paper Revision

upvoted a paper 2 months ago

JudgeLRM: Large Reasoning Models as a Judge

View all activity

Organizations

TobyYang7's activity

upvoted a paper about 23 hours ago

FusionAudio-1.2M: Towards Fine-grained Audio Captioning with Multimodal Contextual Fusion

Paper • 2506.01111 • Published 8 days ago • 27

upvoted a paper 22 days ago

XtraGPT: LLMs for Human-AI Collaboration on Controllable Academic Paper Revision

Paper • 2505.11336 • Published 25 days ago • 6

upvoted 2 papers 2 months ago

JudgeLRM: Large Reasoning Models as a Judge

Paper • 2504.00050 • Published Mar 31 • 61

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27 • 79

upvoted a paper 3 months ago

S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information

Paper • 2503.05085 • Published Mar 7 • 48

upvoted a paper 4 months ago

Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published Feb 18 • 86

authored a paper 4 months ago

TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets

Paper • 2502.01506 • Published Feb 3 • 38

upvoted a paper 4 months ago

TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets

Paper • 2502.01506 • Published Feb 3 • 38

liked a model 5 months ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated Mar 27 • 653k • • 12.3k

upvoted 2 papers 5 months ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 76

On the Compositional Generalization of Multimodal LLMs for Medical Imaging

Paper • 2412.20070 • Published Dec 28, 2024 • 47

upvoted a paper 7 months ago

Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination

Paper • 2411.03823 • Published Nov 6, 2024 • 50

updated a dataset 8 months ago

TobyYang7/UCFE

Viewer • Updated Oct 26, 2024 • 1 • 21 • 1

upvoted a paper 8 months ago

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

Paper • 2410.14059 • Published Oct 17, 2024 • 62

authored a paper 8 months ago

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

Paper • 2410.14059 • Published Oct 17, 2024 • 62

New activity in TheFinAI/FinLLaVA 9 months ago

Update app.py

#2 opened 9 months ago by

Yueru1

updated a Space 9 months ago

FinLLaVA

🔥

Combine images and text to get answers

New activity in TheFinAI/FinLLaVA 9 months ago

Update README.md

#1 opened 9 months ago by

Yueru1

liked 2 models 9 months ago

arcee-ai/Llama-3-SEC-Chat

Text Generation • Updated Jun 20, 2024 • 32 • 37

arcee-ai/Llama-3.1-SuperNova-Lite

Text Generation • Updated Jan 17 • 4.94k • • 192