Larsen Weigle's picture

64 2

Larsen Weigle

larsenweigle

https://larsenweigle.github.io/personalwebsite/

larsenweigle

AI & ML interests

NLP + Environmental Conservation

Organizations

None yet

larsenweigle's activity

upvoted a paper 5 days ago

Sample-Efficient Alignment for LLMs

Paper • 2411.01493 • Published 8 days ago • 10

upvoted a paper 6 days ago

HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems

Paper • 2411.02959 • Published 6 days ago • 53

upvoted a paper 10 days ago

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Paper • 2410.23743 • Published 11 days ago • 57

upvoted 2 papers 17 days ago

Can Knowledge Editing Really Correct Hallucinations?

Paper • 2410.16251 • Published 21 days ago • 53

LOGO -- Long cOntext aliGnment via efficient preference Optimization

Paper • 2410.18533 • Published 19 days ago • 42

upvoted a paper 20 days ago

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

Paper • 2410.14059 • Published 25 days ago • 52

upvoted 4 papers about 1 month ago

MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents

Paper • 2410.03450 • Published Oct 4 • 35

Aria: An Open Multimodal Native Mixture-of-Experts Model

Paper • 2410.05993 • Published Oct 8 • 107

Self-Boosting Large Language Models with Synthetic Preference Data

Paper • 2410.06961 • Published Oct 9 • 15

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 165

upvoted a paper 2 months ago

AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework

Paper • 2308.08155 • Published Aug 16, 2023 • 3

upvoted 3 papers 3 months ago

Automated Design of Agentic Systems

Paper • 2408.08435 • Published Aug 15 • 38

OpenResearcher: Unleashing AI for Accelerated Scientific Research

Paper • 2408.06941 • Published Aug 13 • 30

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13 • 65

upvoted 2 papers 4 months ago

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18 • 52

SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity

Paper • 2401.17072 • Published Jan 30 • 25

upvoted a paper 5 months ago

Adam-mini: Use Fewer Learning Rates To Gain More

Paper • 2406.16793 • Published Jun 24 • 67

upvoted 3 papers 7 months ago

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published Apr 22 • 124

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15 • 82

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Paper • 2404.07143 • Published Apr 10 • 103