Prithiv Sakthi's picture

Prithiv Sakthi

prithivMLmods

·

https://linktr.ee/prithivsakthi

AI & ML interests

computer vision, nlp, multimodality @strangerzonehf @strangerguardhf

Recent Activity

liked a model about 6 hours ago

prithivMLmods/Spiral-Qwen3-4B-F32-GGUF

updated a model about 20 hours ago

prithivMLmods/Lacaille-MoT-4B-Supreme2

liked a model about 20 hours ago

aharley/alltracker

View all activity

Organizations

upvoted 2 papers 2 days ago

Fast and Simplex: 2-Simplicial Attention in Triton

Paper • 2507.02754 • Published 5 days ago • 21

Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback

Paper • 2507.02321 • Published 5 days ago • 38

upvoted 3 papers 3 days ago

Energy-Based Transformers are Scalable Learners and Thinkers

Paper • 2507.02092 • Published 6 days ago • 37

WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published 5 days ago • 84

HalluSegBench: Counterfactual Visual Reasoning for Segmentation Hallucination Evaluation

Paper • 2506.21546 • Published 12 days ago • 2

upvoted an article 5 days ago

Article

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

By

and 1 other •

7 days ago

• 83

upvoted 2 papers 5 days ago

A Survey on Vision-Language-Action Models: An Action Tokenization Perspective

Paper • 2507.01925 • Published 6 days ago • 29

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published 6 days ago • 112

upvoted 3 papers 6 days ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published 7 days ago • 61

Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation

Paper • 2506.19852 • Published 14 days ago • 36

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published 7 days ago • 173

upvoted an article 6 days ago

Article

Bringing Fusion Down to Earth: ML for Stellarator Optimization

By

•

6 days ago

• 60

upvoted 3 collections 7 days ago

Gemma 3n

4 items • Updated 12 days ago • 160

FLUX.1

A collection of our FLUX.1 models and LoRAs. • 9 items • Updated 12 days ago • 144

Hunyuan-A13B

4 items • Updated 11 days ago • 21

upvoted an article 8 days ago

Article

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

By

and 4 others •

19 days ago

• 73

upvoted a collection 8 days ago

ERNIE 4.5

collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 23 items • Updated 5 days ago • 143

upvoted an article 10 days ago

Article

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

By

and 10 others •

11 days ago

• 25

upvoted an article 13 days ago

Article

📄 PDF Support in the Hugging Face Dataset Viewer

By

•

13 days ago

• 2

upvoted a paper 13 days ago

AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models

Paper • 2506.19851 • Published 14 days ago • 56