AI Safety Research's picture

AI Safety Research

AISafety

·

https://humanaligned.ai

AI & ML interests

LLMs, planning, EA

Recent Activity

liked a model 1 day ago

Hcompany/Holo1-7B

new activity 6 days ago

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B:Any plans for a Qwen3-32B model?

liked a model 6 days ago

deepseek-ai/DeepSeek-R1-0528

View all activity

Organizations

AISafety's activity

upvoted an article 26 days ago

Article

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

By

and 1 other •

29 days ago

• 35

upvoted a collection about 1 month ago

Qwen3

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 65 items • Updated 6 days ago • 148

upvoted an article about 1 month ago

Article

Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time

By

and 4 others •

Feb 18

• 33

upvoted a collection about 1 month ago

Granite Experiments

Experimental projects under consideration for the Granite family. • 17 items • Updated 1 day ago • 12

upvoted a paper about 2 months ago

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9 • 74

upvoted a collection about 2 months ago

Cogito v1 Preview

5 items • Updated Apr 8 • 111

upvoted an article 3 months ago

Article

Open R1: Update #3

By

and 9 others •

Mar 11

• 292

upvoted an article 4 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

By

and 2 others •

Jan 28

• 862

upvoted an article 7 months ago

Article

Decoding Strategies in Large Language Models

By

•

Oct 29, 2024

• 66

upvoted a collection 9 months ago

Solar Pro

The most intelligent LLM on a single GPU • 4 items • Updated Nov 15, 2024 • 14

upvoted 4 collections about 1 year ago

Cohere Labs Aya 23

Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. • 3 items • Updated Apr 15 • 55

Yi-1.5 (2024/05)

10 items • Updated May 20, 2024 • 93

Yi 1.5 GGUFs

Collection of Yi 1.5 GGUFs made with gguf-my-repo • 15 items • Updated May 20, 2024 • 5

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated May 1 • 569

upvoted 2 papers about 1 year ago

MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs

Paper • 2402.15627 • Published Feb 23, 2024 • 39

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 618

upvoted a collection over 1 year ago

Frankenmodels

They're not supposed to be that size! Neat, right? • 8 items • Updated Dec 12, 2023 • 3