Praxis Maldevide's picture

Praxis Maldevide

maldv

·

maldevide

AI & ML interests

ai alchemy

Recent Activity

liked a model 1 day ago

LatitudeGames/Wayfarer-12B

liked a model 1 day ago

alpindale/L3.3-70B-Magnum-v4-SE-FP8

liked a model 2 days ago

Qwen/Qwen2.5-Coder-32B-Instruct

View all activity

Organizations

maldv's activity

upvoted a paper 9 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 10 days ago • 230

upvoted an article 16 days ago

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

By

•

16 days ago

• 37

upvoted a paper 29 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 30 days ago • 340

upvoted a paper 3 months ago

Upcycling Large Language Models into Mixture of Experts

Paper • 2410.07524 • Published Oct 10, 2024 • 4

upvoted a collection 6 months ago

Lumimaid 0.2

4 items • Updated Jul 26, 2024 • 17

upvoted an article 6 months ago

Article

The Great LLM Showdown: Amy's Quest for the Perfect LLM

By

•

Jul 9, 2024

• 13

upvoted a paper 9 months ago

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4, 2024 • 61

upvoted a paper 10 months ago

DiJiang: Efficient Large Language Models through Compact Kernelization

Paper • 2403.19928 • Published Mar 29, 2024 • 11