Nathan Simons's picture

Nathan Simons

JoeySalmons

·

AI & ML interests

I like AI

Recent Activity

upvoted a collection about 13 hours ago

liked a model 1 day ago

mistralai/Magistral-Small-2506_gguf

liked a model 1 day ago

mistralai/Magistral-Small-2506

View all activity

Organizations

None yet

JoeySalmons's activity

upvoted a collection about 13 hours ago

V-JEPA 2

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 4 items • Updated about 13 hours ago • 47

upvoted a collection 6 days ago

Common Pile v0.1 Filtered Data

An LLM pre-training dataset produced by filtering and deduplicating the raw text collected in the Common Pile v0.1 • 31 items • Updated 6 days ago • 12

upvoted a paper 6 days ago

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published 7 days ago • 36

upvoted 2 collections 6 days ago

Common Pile v0.1 Raw Data

8TB of public domain and openly licensed text • 30 items • Updated 6 days ago • 12

Common Pile v0.1

All resources related to Common Pile v0.1, an 8TB dataset of public domain and openly licensed text • 4 items • Updated 6 days ago • 24

upvoted a collection 21 days ago

Falcon-H1

Falcon-H1 Family of Hybrid-Head Language Models, including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained and instruction-tuned). • 37 items • Updated 22 days ago • 40

upvoted 2 collections 22 days ago

Gemma 3n Preview

4 items • Updated 1 day ago • 124

MedGemma Release

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 4 items • Updated 13 days ago • 158

upvoted 2 collections about 1 month ago

Qwen3

40 items • Updated 22 days ago • 749

Qwen3

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 65 items • Updated 13 days ago • 150

upvoted 2 collections about 2 months ago

Granite 3.3 Language Models

Our latest language models licensed under Apache 2.0 license. • 4 items • Updated May 2 • 34

GLM-4-0414

GLM-4-0414 series model • 8 items • Updated Apr 15 • 125

upvoted 6 collections 2 months ago

InternVL3

34 items • Updated Apr 20 • 70

Cogito v1 Preview

5 items • Updated Apr 8 • 111

EXL3 models

22 items • Updated 9 days ago • 25

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 13 days ago • 197

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 526

Rei-12B

A small preview of what might become the first(or second?) stepping stone for Magnum v5 • 8 items • Updated May 7 • 4

upvoted 2 collections 3 months ago

Ling

8 items • Updated 27 days ago • 11

Llama Nemotron

Open, Production-ready Enterprise Models • 8 items • Updated about 10 hours ago • 60