1 312 929

jiakai

real-jiakai

https://blog.gujiakai.top

AI & ML interests

LLM && Smart QA

Recent Activity

liked a Space 1 day ago

TIGER-Lab/MMEB-Leaderboard

liked a model 1 day ago

jinaai/jina-embeddings-v4

liked a model 1 day ago

Menlo/Jan-nano-128k

View all activity

Organizations

upvoted a paper 3 days ago

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published 9 days ago • 105

upvoted an article 7 days ago

Article

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

and 4 others •

9 days ago

• 66

upvoted a paper 9 days ago

MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation

Paper • 2506.14028 • Published 11 days ago • 88

upvoted a paper 10 days ago

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Paper • 2506.11763 • Published 15 days ago • 58

upvoted a paper 11 days ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published 12 days ago • 240

upvoted a collection 11 days ago

MiniMax-M1

Collection

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 4 items • Updated 9 days ago • 101

upvoted an article 14 days ago

Article

Featherless AI on Hugging Face Inference Providers 🔥

and 5 others •

16 days ago

• 41

upvoted 3 papers 14 days ago

ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

Paper • 2506.09513 • Published 17 days ago • 92

SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks

Paper • 2506.10954 • Published 15 days ago • 51

Magistral

Paper • 2506.10910 • Published 15 days ago • 60

upvoted a paper 15 days ago

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Paper • 2506.06395 • Published 22 days ago • 123

upvoted an article 16 days ago

Article

Introducing Training Cluster as a Service - a new collaboration with NVIDIA

and 2 others •

17 days ago

• 23

upvoted 2 papers 16 days ago

Look Before You Leap: A GUI-Critic-R1 Model for Pre-Operative Error Diagnosis in GUI Automation

Paper • 2506.04614 • Published 23 days ago • 16

Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

Paper • 2506.06751 • Published 21 days ago • 72

upvoted 2 papers 17 days ago

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Paper • 2505.21115 • Published May 27 • 131

Sentinel: SOTA model to protect against prompt injections

Paper • 2506.05446 • Published 23 days ago • 22

upvoted a collection 17 days ago

MiniCPM4

Collection

MiniCPM4: Ultra-Efficient LLMs on End Devices • 22 items • Updated 6 days ago • 62

upvoted 2 papers 17 days ago

MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published 18 days ago • 81

Reinforcement Pre-Training

Paper • 2506.08007 • Published 18 days ago • 234

upvoted a paper 21 days ago

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published 23 days ago • 62

jiakai

AI & ML interests

Recent Activity

Organizations

real-jiakai's activity

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

Featherless AI on Hugging Face Inference Providers 🔥

Introducing Training Cluster as a Service - a new collaboration with NVIDIA