arxiv:2510.15232
Tongyan Hu
entropyhu
AI & ML interests
None yet
Recent Activity
authored
a paper
4 days ago
FinTrust: A Comprehensive Benchmark of Trustworthiness Evaluation in
Finance Domain
upvoted
a
paper
4 days ago
FinTrust: A Comprehensive Benchmark of Trustworthiness Evaluation in
Finance Domain
upvoted
a
paper
23 days ago
MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP
Use