1 19 75

wattai

wattai

AI & ML interests

Im interested in generating BMS charts from text and music prompts.

Recent Activity

liked a model 7 days ago

microsoft/phi-4

upvoted a paper 8 days ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

liked a model 10 days ago

dream-textures/texture-diffusion

View all activity

Organizations

None yet

wattai's activity

upvoted a paper 8 days ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published 9 days ago • 75

upvoted 2 papers 14 days ago

Dynamic Scaling of Unit Tests for Code Reward Modeling

Paper • 2501.01054 • Published 16 days ago • 17

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 16 days ago • 47

upvoted 2 papers 18 days ago

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published Dec 16, 2024 • 54

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published 24 days ago • 94

upvoted an article 22 days ago

Article

Deriving DPO's Loss

•

25 days ago

• 26

upvoted a paper 23 days ago

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

Paper • 2412.14922 • Published 30 days ago • 85

upvoted a paper 2 months ago

Direct Preference Optimization Using Sparse Feature-Level Constraints

Paper • 2411.07618 • Published Nov 12, 2024 • 15

upvoted 2 papers 3 months ago

Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens

Paper • 2410.13863 • Published Oct 17, 2024 • 37

ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression

Paper • 2410.08584 • Published Oct 11, 2024 • 12

upvoted an article 4 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 216

upvoted 2 papers 5 months ago

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Paper • 2408.16532 • Published Aug 29, 2024 • 48

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 79

upvoted an article 6 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 264

upvoted 2 papers 6 months ago

E5-V: Universal Embeddings with Multimodal Large Language Models

Paper • 2407.12580 • Published Jul 17, 2024 • 40

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12, 2024 • 132

upvoted 2 papers 9 months ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 121

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29, 2024 • 120

upvoted an article 9 months ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18, 2024

• 282