Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
18
52
150
Krishna Kaasyap
KrishnaKaasyap
Follow
victor's profile picture
ltim's profile picture
Juanelopo's profile picture
4 followers
·
27 following
krishnakaasyap
krishnakaasyap.bsky.social
AI & ML interests
Test Time Training Multimodal & Inter-Modality Transfer Learning Mechanistic Interpretability Evolutionary Model Merging Swarm Intelligence of multiple models with different architectures and different algorithms MuZero approach to general tasks
Recent Activity
liked
a model
about 8 hours ago
baichuan-inc/Baichuan-M1-14B-Instruct
liked
a model
5 days ago
microsoft/bitnet-b1.58-2B-4T
upvoted
a
collection
16 days ago
Llama 4
View all activity
Organizations
KrishnaKaasyap
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
about 8 hours ago
baichuan-inc/Baichuan-M1-14B-Instruct
Updated
Feb 20
•
22.8k
•
55
liked
a model
5 days ago
microsoft/bitnet-b1.58-2B-4T
Text Generation
•
Updated
1 day ago
•
13k
•
568
upvoted
a
collection
16 days ago
Llama 4
Collection
Llama 4 release
•
10 items
•
Updated
16 days ago
•
439
liked
a model
26 days ago
Qwen/Qwen2.5-Omni-7B
Any-to-Any
•
Updated
6 days ago
•
171k
•
1.45k
liked
2 models
about 1 month ago
CohereLabs/c4ai-command-a-03-2025
Text Generation
•
Updated
6 days ago
•
17.8k
•
•
344
Qwen/QwQ-32B
Text Generation
•
Updated
Mar 11
•
679k
•
•
2.7k
New activity in
RekaAI/reka-flash-3
about 1 month ago
Context length and reasoning length?
1
#6 opened about 1 month ago by
KrishnaKaasyap
liked
a model
about 1 month ago
RekaAI/reka-flash-3
Updated
Mar 13
•
2.75k
•
362
liked
3 models
3 months ago
deepseek-ai/Janus-Pro-1B
Any-to-Any
•
Updated
Feb 1
•
33.6k
•
429
deepseek-ai/Janus-Pro-7B
Any-to-Any
•
Updated
Feb 1
•
239k
•
3.34k
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation
•
Updated
Feb 24
•
263k
•
•
665
New activity in
deepseek-ai/DeepSeek-R1
3 months ago
Is this the same as DeepSeek-R1 (Preview) mentioned on LiveCodeBench?
1
2
#10 opened 3 months ago by
KrishnaKaasyap
New activity in
deepseek-ai/DeepSeek-R1-Zero
3 months ago
Hail CCP!!! God bless Chyna!
19
8
#3 opened 3 months ago by
mnemojeet
Thank you deepseek
29
2
#8 opened 3 months ago by
teknium
liked
3 models
3 months ago
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
25 days ago
•
1.71M
•
•
12k
deepseek-ai/DeepSeek-R1-Zero
Text Generation
•
Updated
25 days ago
•
5.69k
•
901
MiniMaxAI/MiniMax-Text-01
Text Generation
•
Updated
4 days ago
•
6.79k
•
572
liked
a Space
4 months ago
Running
1.13k
1.13k
InstantCoder
🦀
Generate app code from ideas
liked
a dataset
4 months ago
PowerInfer/QWQ-LONGCOT-500K
Viewer
•
Updated
Dec 26, 2024
•
286k
•
349
•
122
liked
a model
4 months ago
PowerInfer/SmallThinker-3B-Preview
Text Generation
•
Updated
Jan 16
•
46.5k
•
394
Load more