Krishna Kaasyap

KrishnaKaasyap

AI & ML interests

Test Time Training Multimodal & Inter-Modality Transfer Learning Mechanistic Interpretability Evolutionary Model Merging Swarm Intelligence of multiple models with different architectures and different algorithms MuZero approach to general tasks

Recent Activity

liked a model about 8 hours ago
baichuan-inc/Baichuan-M1-14B-Instruct
liked a model 5 days ago
microsoft/bitnet-b1.58-2B-4T
upvoted a collection 16 days ago
Llama 4
View all activity

Organizations

Blog-explorers's profile picture

KrishnaKaasyap's activity

upvoted an article 8 months ago
view article
Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

• 232