Jonathan Korstad PRO

jkorstad

https://jpalmer95.github.io/

AI & ML interests

Deep Reinforcement Learning, Generative 3D, Accessibility, Multimodal Models, Agents, Computer Vision, XR. Staying curious.

Recent Activity

updated a Space 3 days ago

jkorstad/img_to_3D_TRELLIS

upvoted a paper 4 days ago

Time Blindness: Why Video-Language Models Can't See What Humans Can?

updated a Space 5 days ago

jkorstad/Mesh_Rigger

View all activity

Organizations

jkorstad's activity

upvoted a paper 4 days ago

Time Blindness: Why Video-Language Models Can't See What Humans Can?

Paper • 2505.24867 • Published 6 days ago • 72

upvoted a collection 16 days ago

4D

Collection

1 item • Updated 16 days ago • 1

upvoted an article 24 days ago

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

25 days ago

• 414

upvoted an article about 1 month ago

Article

How to Build an MCP Server with Gradio

and 1 other •

Apr 30

• 162

upvoted 2 collections about 1 month ago

deployed-models

Collection

1397 items • Updated 22 days ago • 11

Perception LM

Collection

7 items • Updated Apr 17 • 52

upvoted an article about 2 months ago

Article

Gradio spaces are the perfect agent tools\!

•

Jan 17

• 17

upvoted a collection 2 months ago

Gemma 3 QAT

Collection

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 7 days ago • 195

upvoted 2 papers 2 months ago

Segment Any Motion in Videos

Paper • 2503.22268 • Published Mar 28 • 17

ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model

Paper • 2503.21144 • Published Mar 27 • 25

upvoted a paper 3 months ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 146

upvoted 2 articles 4 months ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

and 2 others •

Jan 23

• 180

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.25k

upvoted a paper 4 months ago

MatAnyone: Stable Video Matting with Consistent Memory Propagation

Paper • 2501.14677 • Published Jan 24 • 36

upvoted 2 articles 4 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 862

Article

We now support VLMs in smolagents!

and 2 others •

Jan 24

• 103

upvoted an article 5 months ago

Article

Run ComfyUI workflows for free on Spaces

and 1 other •

Jan 14, 2024

• 81

upvoted a paper 5 months ago

Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Paper • 2412.15322 • Published Dec 19, 2024 • 18

upvoted a paper 7 months ago

Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models

Paper • 2411.07126 • Published Nov 11, 2024 • 31

upvoted a collection 7 months ago

OpenCoder

Collection

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated Nov 23, 2024 • 83