zhang
AI & ML interests
Recent Activity
Organizations
-
Running77
Browser only - Screen Capture & OCR
🏆One-minute creation by AI Coding Autonomous Agent MOUSE-I
-
Running601601
First Agent Template
⚡Get current time in any timezone
-
Runtime error127127
OctoTools
🚀An Agentic Framework with Tools for Complex Reasoning
-
Running138138
smolagents LLM leaderboard
🏆A leaderboard for LLMs powering smolagents
-
Running on Zero1.53k1.53k
Joy Caption Alpha Two
👁Generate captions for images in various styles
-
Running on Zero4040
Florence Llama
💬Generate text responses from images and text input
-
trollek/ImagePromptHelper-danube3-500M
Text Generation • 0.5B • Updated • 10 • 3 -
trollek/ImagePromptHelper-danube3-500M-GGUF
0.5B • Updated • 379 • 2
-
laion/laion-audio-preview
Viewer • Updated • 4.15M • 621 • 11 -
Running on Zero2.64k2.64k
F5-TTS
🗣F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
Runtime error2.19k2.19k
FacePoke
🙂Import a portrait, click to move the head!
-
Running on L4643643
OpenAudio S1
🏆Generate speech from text
-
allenai/olmOCR-7B-0225-preview
Image-to-Text • 8B • Updated • 12.3k • 702 -
Runtime error8282
Nanonets OCR
👁Demo for Nanonets-OCR
-
Running on ZeroMCP369369
Multimodal OCR
🍍olmocr / nanonets ocr2 / qwen2vl ocr / aya vision / rolmocr
-
Running on ZeroMCP135135
OCR2
💻nanonets ocr / smoldocling / monkey ocr / typhoon ocr
-
Running on Zero1.57k1.57k
Flux.1-dev Upscaler
🔎Upscale low-resolution images to high resolution
-
Running on Zero432432
InvSR
🌍Image Super-resolution via Diffusion Inversion
-
Paused242242
FLUX Upsacle Image
🔥Upscale images with control and customization
-
Running on L4276276
Thera Arbitrary-Scale Super-Resolution
🔥Enhance image resolution with Thera
-
Djrango/Qwen2vl-Flux
Text-to-Image • Updated • 509 -
Running on Zero917917
OminiControl
🌍Generate an edited image based on text and input image
-
Running on Zero393393
FLUXllama gpt-oss
🏆mcp_server & FLUX 4-bit Quantization + Enhanced
-
Running on L42.15k2.15k
MagicQuill
🪶Generate edited images using scribble inputs
-
Running77
Browser only - Screen Capture & OCR
🏆One-minute creation by AI Coding Autonomous Agent MOUSE-I
-
Running601601
First Agent Template
⚡Get current time in any timezone
-
Runtime error127127
OctoTools
🚀An Agentic Framework with Tools for Complex Reasoning
-
Running138138
smolagents LLM leaderboard
🏆A leaderboard for LLMs powering smolagents
-
allenai/olmOCR-7B-0225-preview
Image-to-Text • 8B • Updated • 12.3k • 702 -
Runtime error8282
Nanonets OCR
👁Demo for Nanonets-OCR
-
Running on ZeroMCP369369
Multimodal OCR
🍍olmocr / nanonets ocr2 / qwen2vl ocr / aya vision / rolmocr
-
Running on ZeroMCP135135
OCR2
💻nanonets ocr / smoldocling / monkey ocr / typhoon ocr
-
Running on Zero1.53k1.53k
Joy Caption Alpha Two
👁Generate captions for images in various styles
-
Running on Zero4040
Florence Llama
💬Generate text responses from images and text input
-
trollek/ImagePromptHelper-danube3-500M
Text Generation • 0.5B • Updated • 10 • 3 -
trollek/ImagePromptHelper-danube3-500M-GGUF
0.5B • Updated • 379 • 2
-
Running on Zero1.57k1.57k
Flux.1-dev Upscaler
🔎Upscale low-resolution images to high resolution
-
Running on Zero432432
InvSR
🌍Image Super-resolution via Diffusion Inversion
-
Paused242242
FLUX Upsacle Image
🔥Upscale images with control and customization
-
Running on L4276276
Thera Arbitrary-Scale Super-Resolution
🔥Enhance image resolution with Thera
-
Djrango/Qwen2vl-Flux
Text-to-Image • Updated • 509 -
Running on Zero917917
OminiControl
🌍Generate an edited image based on text and input image
-
Running on Zero393393
FLUXllama gpt-oss
🏆mcp_server & FLUX 4-bit Quantization + Enhanced
-
Running on L42.15k2.15k
MagicQuill
🪶Generate edited images using scribble inputs
-
laion/laion-audio-preview
Viewer • Updated • 4.15M • 621 • 11 -
Running on Zero2.64k2.64k
F5-TTS
🗣F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
Runtime error2.19k2.19k
FacePoke
🙂Import a portrait, click to move the head!
-
Running on L4643643
OpenAudio S1
🏆Generate speech from text