Tech Dev's picture

Tech Dev

techdef215

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

unsloth/Qwen3-30B-A3B-GGUF

liked a model 3 days ago

bartowski/kalomaze_Qwen3-16B-A3B-GGUF

liked a model 4 days ago

jedisct1/MiMo-7B-RL-GGUF

View all activity

Organizations

None yet

techdef215's activity

upvoted a collection 14 days ago

Skywork-R1V2

Multimodal Hybrid Reinforcement Learning for Reasoning • 4 items • Updated 11 days ago • 10

upvoted a collection 15 days ago

Unsloth Dynamic 2.0 Quants

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & outperforms all leading quantization methods. • 29 items • Updated 9 days ago • 85

upvoted a collection 22 days ago

xLAM-2

A family of Large Action Model for multi-turn conversation and tool-use • 10 items • Updated 5 days ago • 13

upvoted a collection 23 days ago

Granite 3.3 Language Models

Our latest language models licensed under Apache 2.0 license. • 4 items • Updated 8 days ago • 33

upvoted a paper 25 days ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published Apr 7 • 130

upvoted a collection 25 days ago

LightThinker

3 items • Updated 21 days ago • 4

upvoted a paper 25 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 157

upvoted a collection 26 days ago

GLM-4-0414

GLM-4-0414 series model • 8 items • Updated 25 days ago • 124

upvoted a paper 27 days ago

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2 • 83

upvoted a collection 28 days ago

Apriel

ServiceNow Language Modeling Lab's first model family series • 3 items • Updated 3 days ago • 9

upvoted 5 collections about 1 month ago

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated 28 days ago • 66

Cogito v1 Preview

5 items • Updated Apr 8 • 109

ReSearch

Trained models as described in the paper "ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning" • 5 items • Updated Mar 27 • 5

LeX-Art

8 items • Updated Apr 1 • 3

CoRNStack

State-of-the-art code retrieval and re-ranking models and datasets • 9 items • Updated Mar 26 • 17

upvoted a paper about 1 month ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 150

upvoted 2 papers about 2 months ago

VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search

Paper • 2503.10582 • Published Mar 13 • 23

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 147

upvoted 2 collections about 2 months ago

Cohere Labs Aya Vision

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 24 days ago • 68

Llama Nemotron

Open, Production-ready Enterprise Models • 6 items • Updated about 6 hours ago • 51