16 154 709

Drishti Sharma

DrishtiSharma

https://scholar.google.com/citations?hl=en&user=9-GkrdkAAAAJ

AI & ML interests

None yet

Recent Activity

updated a dataset about 5 hours ago

large-traversaal/mantra-14b-user-interaction-log

updated a collection about 9 hours ago

translation eval

liked a model about 14 hours ago

ModelSpace/GemmaX2-28-9B-v0.1

View all activity

Organizations

DrishtiSharma's activity

upvoted a paper about 1 month ago

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29 • 70

upvoted an article about 2 months ago

Article

MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR

•

Oct 20, 2024

• 43

upvoted 3 papers 3 months ago

upvoted an article 4 months ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

and 2 others •

Feb 19

• 70

upvoted 13 papers 4 months ago

Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published Feb 18 • 86

IHEval: Evaluating Language Models on Following the Instruction Hierarchy

Paper • 2502.08745 • Published Feb 12 • 19

ReLearn: Unlearning via Learning for Large Language Models

Paper • 2502.11190 • Published Feb 16 • 29

How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

Paper • 2502.11196 • Published Feb 16 • 22

Logical Reasoning in Large Language Models: A Survey

Paper • 2502.09100 • Published Feb 13 • 23

An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging

Paper • 2502.09056 • Published Feb 13 • 32

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published Feb 13 • 36

Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation

Paper • 2502.08690 • Published Feb 12 • 44

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published Feb 13 • 149

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Paper • 2502.06394 • Published Feb 10 • 90

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published Feb 10 • 132

BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models

Paper • 2502.07346 • Published Feb 11 • 54

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 153

upvoted an article 4 months ago

Article

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

•

Feb 10

• 58