2 24

Alsu Sagirova

alsu-sagirova

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

upvoted a paper 23 days ago

AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment

upvoted a paper 4 months ago

Towards an AI co-scientist

View all activity

Organizations

upvoted a paper 17 days ago

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Paper • 2506.06395 • Published 23 days ago • 123

upvoted a paper 23 days ago

AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment

Paper • 2506.04089 • Published 24 days ago • 46

upvoted 5 papers 4 months ago

Towards an AI co-scientist

Paper • 2502.18864 • Published Feb 26 • 50

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

Paper • 2502.19400 • Published Feb 26 • 49

SIFT: Grounding LLM Reasoning in Contexts via Stickers

Paper • 2502.14922 • Published Feb 19 • 32

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published Feb 20 • 91

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

Paper • 2502.13063 • Published Feb 18 • 73

upvoted 7 papers 5 months ago

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published Feb 3 • 70

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Paper • 2502.07374 • Published Feb 11 • 41

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Paper • 2502.06394 • Published Feb 10 • 90

Riddle Me This! Stealthy Membership Inference for Retrieval-Augmented Generation

Paper • 2502.00306 • Published Feb 1 • 5

upvoted an article 5 months ago

Article

RAG using huggingface tools

•

Jul 7, 2024

• 88

upvoted 4 papers 5 months ago

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published Jan 24 • 58

AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents

Paper • 2407.04363 • Published Jul 5, 2024 • 34

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Paper • 2501.10799 • Published Jan 18 • 15

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published Jan 23 • 48

upvoted a paper 7 months ago

Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Paper • 2412.06531 • Published Dec 9, 2024 • 73

Alsu Sagirova

AI & ML interests

Recent Activity

Organizations

alsu-sagirova's activity

RAG using huggingface tools