John-Paul K.'s picture

John-Paul K.

johnpaulbin

·

johnpaulbin

AI & ML interests

None yet

Recent Activity

liked a model 14 days ago

unslothai/1

liked a model 17 days ago

laion/BUD-E-Whisper

liked a model 21 days ago

Menlo/Jan-nano

View all activity

Organizations

upvoted a collection 2 months ago

Qwen3

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 65 items • Updated 5 days ago • 161

upvoted a paper 4 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 132

upvoted 2 papers 6 months ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 87

Tag-LLM: Repurposing General-Purpose LLMs for Specialized Domains

Paper • 2402.05140 • Published Feb 6, 2024 • 24

upvoted a collection 12 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 681

upvoted a paper about 1 year ago

Are We Done with MMLU?

Paper • 2406.04127 • Published Jun 6, 2024 • 39

upvoted 2 papers almost 2 years ago

STEVE-1: A Generative Model for Text-to-Behavior in Minecraft

Paper • 2306.00937 • Published Jun 1, 2023 • 9

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Paper • 2307.01952 • Published Jul 4, 2023 • 87