Open to Work

4 17 28

Dark Coder

codewithdark

AI & ML interests

AI/ML Engineer & Researcher | (GPU Poor) LLMs, NLP & Computer Vision | Applied AI & Innovating with Open Source

Recent Activity

liked a Space 10 days ago

huggingface/2025-wrapped

updated a Space 21 days ago

QuantLLM/README

updated a model 21 days ago

QuantLLM/functiongemma-270m-it-4bit-mlx

View all activity

Organizations

upvoted a collection 21 days ago

Models

Collection

4 items • Updated 21 days ago • 1

upvoted 2 papers 3 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 503

A Picture is Worth a Thousand Words: Principled Recaptioning Improves Image Generation

Paper • 2310.16656 • Published Oct 25, 2023 • 51

upvoted 9 papers 4 months ago

MI-Fuse: Label Fusion for Unsupervised Domain Adaptation with Closed-Source Large-Audio Language Model

Paper • 2509.20706 • Published Sep 25, 2025 • 2

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Paper • 2509.21268 • Published Sep 25, 2025 • 104

SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines

Paper • 2509.21320 • Published Sep 25, 2025 • 101

upvoted a paper 6 months ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30, 2025 • 277

upvoted a paper 8 months ago

Sadeed: Advancing Arabic Diacritization Through Small Language Model

Paper • 2504.21635 • Published Apr 30, 2025 • 59

upvoted 2 papers 10 months ago

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12, 2025 • 74

Agent models: Internalizing Chain-of-Action Generation into Reasoning models

Paper • 2503.06580 • Published Mar 9, 2025 • 20

upvoted a paper 11 months ago

Adding Conditional Control to Text-to-Image Diffusion Models

Paper • 2302.05543 • Published Feb 10, 2023 • 58

Dark Coder

AI & ML interests

Recent Activity

Organizations

codewithdark's activity