Goh Chia Wei Kenneth's picture

Goh Chia Wei Kenneth PRO

krecceg

·

https://www.ainewbie.org

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

Hcompany/Holo1-7B

liked a model 5 days ago

Qwen/Qwen3-Embedding-0.6B

upvoted a collection 5 days ago

Qwen3-Embedding

View all activity

Organizations

krecceg's activity

upvoted a collection 5 days ago

Qwen3-Embedding

6 items • Updated 6 days ago • 82

upvoted a collection 12 days ago

DeepSeek R1 (All Versions)

DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 12 days ago • 238

upvoted a paper about 2 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 270

upvoted an article about 2 months ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

By

and 6 others •

Apr 5

• 145

upvoted 2 articles 2 months ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

By

and 5 others •

Feb 4

• 90

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

By

•

Mar 26

• 135

upvoted a collection 4 months ago

SEA-LION v3

9 items • Updated Apr 14 • 7

upvoted an article 5 months ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

By

•

Jan 15

• 187

upvoted a paper 6 months ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 134

upvoted an article 8 months ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

and 1 other •

Oct 14, 2024

• 92

upvoted a collection 8 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Apr 30 • 305

upvoted 2 papers over 1 year ago

PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion

Paper • 2311.01767 • Published Nov 3, 2023 • 21

CodeFusion: A Pre-trained Diffusion Model for Code Generation

Paper • 2310.17680 • Published Oct 26, 2023 • 73