Stephen Moore's picture

Stephen Moore

morongosteve

·

AI & ML interests

None yet

Recent Activity

liked a model 26 minutes ago

ByteDance-Seed/Tar-7B

replied to AdinaY's post 3 days ago

Kimi-K2 is now available on the hub🔥🚀 This is a trillion-parameter MoE model focused on long context, code, reasoning, and agentic behavior. https://huggingface.co/collections/moonshotai/kimi-k2-6871243b990f2af5ba60617d ✨ Base & Instruct ✨ 1T total / 32B active - Modified MIT License ✨ 128K context length ✨ Muon optimizer for stable trillion-scale training

reacted to AdinaY's post with 🔥 3 days ago

Kimi-K2 is now available on the hub🔥🚀 This is a trillion-parameter MoE model focused on long context, code, reasoning, and agentic behavior. https://huggingface.co/collections/moonshotai/kimi-k2-6871243b990f2af5ba60617d ✨ Base & Instruct ✨ 1T total / 32B active - Modified MIT License ✨ 128K context length ✨ Muon optimizer for stable trillion-scale training

View all activity

Organizations

None yet

liked a model 26 minutes ago

ByteDance-Seed/Tar-7B

Any-to-Any • 9B • Updated 13 days ago • 82 • 30

replied to AdinaY's post 3 days ago

🫡🫡🫡🫡

reacted to AdinaY's post with 🔥 3 days ago

Post

3135

Kimi-K2 is now available on the hub🔥🚀
This is a trillion-parameter MoE model focused on long context, code, reasoning, and agentic behavior.

moonshotai/kimi-k2-6871243b990f2af5ba60617d

✨ Base & Instruct
✨ 1T total / 32B active - Modified MIT License
✨ 128K context length
✨ Muon optimizer for stable trillion-scale training

1 reply

·

liked 3 models 3 days ago

bullerwins/FLUX.1-Kontext-dev-GGUF

Image-to-Image • 12B • Updated 19 days ago • 84.2k • 174

ai21labs/AI21-Jamba-Mini-1.7

52B • Updated 9 days ago • 89 • 21

lmarena-ai/p2l-7b-grk-01112025

7B • Updated Feb 25 • 7 • 4

liked a model 4 days ago

NovelAI/nerdstash-tokenizer-v1

Updated Aug 2, 2023 • 8

New activity in darkc0de/Chat 4 days ago

See

#9 opened 4 days ago by

liked a model 4 days ago

darkc0de/XortronCriminalComputingConfig

Text Generation • 24B • Updated 8 days ago • 668 • • 33

liked a Space 4 days ago

UncensoredChat

Fast and free uncensored chatbot that just works.

liked a model 4 days ago

apple/DiffuCoder-7B-cpGRPO

8B • Updated 11 days ago • 3.41k • 288

liked a Space 5 days ago

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

upvoted 2 papers 15 days ago

Consciousness in Artificial Intelligence: Insights from the Science of Consciousness

Paper • 2308.08708 • Published Aug 17, 2023 • 2

Learning to Skip the Middle Layers of Transformers

Paper • 2506.21103 • Published 19 days ago • 16

liked 2 models 19 days ago

tencent/Hunyuan3D-2.1

Image-to-3D • Updated 21 days ago • 64.3k • 539

FractalAIResearch/Fathom-R1-14B

Text Generation • 15B • Updated Jun 5 • 20.6k • • 282

liked a Space 24 days ago

Qwen3 Demo

Generate responses to your messages

liked 3 models 24 days ago

mradermacher/DPOpenHermes-7B-v2-PerfLaser-GGUF

7B • Updated Nov 3, 2024 • 203 • 1

TheBloke/KafkaLM-70B-German-V0.1-GGUF

Text Generation • 69B • Updated Jan 31, 2024 • 1.36k • 41

mradermacher/Qwen3-33B-A3B-Stranger-Thoughts-IPONDER-GGUF

33B • Updated 4 days ago • 445 • 1