KuKu's picture

KuKu

dragonkue

·

AI & ML interests

anything.

Recent Activity

liked a dataset 2 days ago

XuHu6736/s1_59k

upvoted a paper 5 days ago

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

upvoted a paper 6 days ago

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

View all activity

Organizations

dragonkue's activity

upvoted a paper 5 days ago

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published 22 days ago • 118

upvoted 2 papers 6 days ago

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published Apr 30 • 45

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published 16 days ago • 30

upvoted a paper about 1 month ago

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published Apr 29 • 55

upvoted 2 articles about 2 months ago

Article

Can RLHF with Preference Optimization Techniques Help LLMs Surpass GPT4-Quality Models?

By

•

Nov 24, 2024

• 4

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

By

and 4 others •

Jan 18, 2024

• 63

upvoted a paper 2 months ago

Understanding R1-Zero-Like Training: A Critical Perspective

Paper • 2503.20783 • Published Mar 26 • 50

upvoted an article 2 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

By

•

Mar 26

• 134

upvoted an article 3 months ago

Article

What Makes a Dialog Agent Useful?

By

and 3 others •

Jan 24, 2023

• 2

upvoted a collection 3 months ago

EXAONE-Deep

EXAONE reasoning model series of 2.4B, 7.8B, and 32B, optimized for reasoning tasks including math and coding • 9 items • Updated Mar 18 • 86

upvoted 2 papers 3 months ago

Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers

Paper • 2503.00865 • Published Mar 2 • 65

Gemini Embedding: Generalizable Embeddings from Gemini

Paper • 2503.07891 • Published Mar 10 • 39

upvoted a collection 3 months ago

Tools 4 learning AI

This is a collection of tools on the hub that teachers and students can use to learn AI! • 10 items • Updated 26 days ago • 67

upvoted a paper 3 months ago

Contextual Document Embeddings

Paper • 2410.02525 • Published Oct 3, 2024 • 23

upvoted 2 articles 5 months ago

Article

🪆 Introduction to Matryoshka Embedding Models

By

and 2 others •

Feb 23, 2024

• 124

Article

Train 400x faster Static Embedding Models with Sentence Transformers

By

•

Jan 15

• 185

upvoted 2 collections 5 months ago

Papers I've read

16 items • Updated Jan 12 • 6

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 238

upvoted a paper 5 months ago

Arctic-Embed 2.0: Multilingual Retrieval Without Compromise

Paper • 2412.04506 • Published Dec 3, 2024 • 1

upvoted an article 5 months ago

Article

Can We Train Chat Models with Raw Data?

By

•

Apr 25, 2024

• 19