1 5 3

wums

RadiCat

fairyshine

AI & ML interests

None yet

Recent Activity

liked a dataset about 1 month ago

simplescaling/s1K

upvoted a paper about 2 months ago

ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition

updated a dataset about 2 months ago

RadiCat/Seal-Tools

View all activity

Organizations

RadiCat's activity

liked a dataset about 1 month ago

simplescaling/s1K

Viewer • Updated Feb 11 • 1k • 2.51k • 217

upvoted a paper about 2 months ago

ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition

Paper • 2503.21248 • Published Mar 27 • 20

updated a dataset about 2 months ago

RadiCat/Seal-Tools

Preview • Updated Mar 25 • 111

authored 4 papers about 2 months ago

Mirror: A Universal Framework for Various Information Extraction Tasks

Paper • 2311.05419 • Published Nov 9, 2023

Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmark

Paper • 2405.08355 • Published May 14, 2024

NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models

Paper • 2410.11805 • Published Oct 15, 2024 • 14

Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models

Paper • 2503.16779 • Published Mar 21 • 1

New activity in RadiCat/SimpleToolQuestions about 2 months ago

Improve dataset card

#1 opened about 2 months ago by

nielsr

updated a dataset about 2 months ago

RadiCat/SimpleToolQuestions

Preview • Updated Mar 25 • 66 • 1

published 2 datasets about 2 months ago

RadiCat/SimpleToolQuestions

Preview • Updated Mar 25 • 66 • 1

RadiCat/Seal-Tools

Preview • Updated Mar 25 • 111

upvoted a paper 4 months ago

LLM4SR: A Survey on Large Language Models for Scientific Research

Paper • 2501.04306 • Published Jan 8 • 37

upvoted 2 papers 9 months ago

Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM

Paper • 2408.07246 • Published Aug 14, 2024 • 22

GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI

Paper • 2408.03361 • Published Aug 6, 2024 • 87

upvoted a paper 10 months ago

Learning to Refuse: Towards Mitigating Privacy Risks in LLMs

Paper • 2407.10058 • Published Jul 14, 2024 • 32

liked 2 models about 2 years ago

gsdf/Counterfeit-V2.5

Text-to-Image • Updated Mar 14, 2023 • 8.17k • 1.57k

THUDM/chatglm-6b

Updated Aug 4, 2024 • 3.56k • 2.85k