2 8 9

Vishesh Tripathi

vishesh-t27

https://vishesht27.github.io/#/

AI & ML interests

Large Language Models Generative AI

Recent Activity

updated a model 23 days ago

vishesh-t27/deepseek-v3-500m

published a model 23 days ago

vishesh-t27/deepseek-v3-500m

liked a dataset about 1 month ago

open-thoughts/OpenThoughts-114k

View all activity

Organizations

upvoted an article about 2 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

Jul 8

• 643

upvoted 2 papers 2 months ago

From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning

Paper • 2505.17117 • Published May 21 • 1

Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding

Paper • 2506.16035 • Published Jun 19 • 88

upvoted a paper 5 months ago

ZClip: Adaptive Spike Mitigation for LLM Pre-Training

Paper • 2504.02507 • Published Apr 3 • 90

upvoted an article 11 months ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 290

upvoted 2 papers over 1 year ago

Komodo: A Linguistic Expedition into Indonesia's Regional Languages

Paper • 2403.09362 • Published Mar 14, 2024 • 11

Beyond Extraction: Contextualising Tabular Data for Efficient Summarisation by Language Models

Paper • 2401.02333 • Published Jan 4, 2024 • 7

upvoted a paper almost 2 years ago

InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining

Paper • 2310.07713 • Published Oct 11, 2023 • 3