Ji-Xiang's picture

Ji-Xiang PRO

Ji-Xiang

·

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

Qwen/Qwen3-Reranker-8B

liked a model 4 days ago

Qwen/Qwen3-Reranker-4B

liked a model 4 days ago

Qwen/Qwen3-Reranker-0.6B

View all activity

Organizations

Ji-Xiang's activity

upvoted a collection 4 days ago

Qwen3-Reranker

3 items • Updated 4 days ago • 44

upvoted a paper 11 days ago

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Paper • 2505.19641 • Published 15 days ago • 64

upvoted a collection 17 days ago

🧠 Traditional Chinese Reasoning Datasets

A curated collection of datasets designed to evaluate and train reasoning capabilities in Traditional Chinese across various domains. • 3 items • Updated 27 days ago • 8

upvoted a collection 21 days ago

🏠 ParScale-1.8B

Base models trained on 1T high-quality tokens, demonstrating strong competitiveness among existing SOTA small models (<2B). • 4 items • Updated 23 days ago • 2

upvoted 3 collections about 1 month ago

Phi-4

Phi-4 family of small language, multi-modal and reasoning models. • 13 items • Updated May 1 • 154

Qwen3

40 items • Updated 20 days ago • 745

LiveCC

Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025) • 8 items • Updated Apr 23 • 4

upvoted a collection about 2 months ago

BitNet

🔥BitNet family of large language models (1-bit LLMs). • 7 items • Updated May 1 • 42

upvoted 2 collections 2 months ago

Llama Nemotron

Open, Production-ready Enterprise Models • 8 items • Updated 3 days ago • 60

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 524

upvoted a collection 3 months ago

Physical AI

Collection of commercial-grade datasets for physical AI developers • 15 items • Updated 3 days ago • 55

upvoted a paper 3 months ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 165

upvoted 2 collections 3 months ago

Gemma 3 Release

24 items • Updated 11 days ago • 384

DRAMA

A collection of small (sub-1B) multilingual dense retrievers that generalize well across a number of tasks and languages. • 3 items • Updated Feb 26 • 7

upvoted 2 papers 3 months ago

NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions

Paper • 2502.13124 • Published Feb 18 • 6

Large-Scale Data Selection for Instruction Tuning

Paper • 2503.01807 • Published Mar 3 • 12

upvoted 3 collections 3 months ago

OpenR1-Math

Dataset and SFT model distilled from DeepSeek-R1. Check out our blog post for more details: https://huggingface.co/blog/open-r1/update-2 • 3 items • Updated 28 days ago • 9

Llasa

TTS foundation model compatible with Llama framework (160k hours tokenized speech data released) • 11 items • Updated 30 days ago • 18

olmOCR

olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 3 items • Updated 21 days ago • 114

upvoted an article 4 months ago

Article

SmolVLM - small yet mighty Vision Language Model

By

and 4 others •

Nov 26, 2024

• 310