John Ho PRO
AI & ML interests
Recent Activity
Organizations
-
Running on Zero432432
Parakeet-TDT-0.6b-V2
Transcribe audio to text with timestamps
-
Running on Zero4646
Fast Whisper Turbo
⚡Ultra-fast Whisper Turbo inference ⚡
-
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 3.91M • • 2.62k -
Running on Zero336336
Realtime Whisper Turbo
🤯Realtime implementation of Whisper large turbo
-
Running on T45454
RF-DETR
🔥SOTA real-time object detection model
-
Running on CPU Upgrade4949
YOLO ARENA
🏟compare performance of top object detectors
-
Running2222
SAM2 Video Predictor
🔥Segment and track objects in videos
-
Running on Zero9090
VLM Object Understanding
🦀Explore object detection, visual grounding, keypoint Detecti
-
Running on Zero107107
Qwen2 VL Localization
📉Detect objects in images and get bounding boxes
-
Paused158158
Seed1.5 VL
🚀Seed1.5-VL API Demo
-
Runtime error22
Vision Language SmolVLM2
🌍Video + text to text with SmolVLM2
-
Runtime error137137
Gemma 3n E4B It
⚡Generate text responses to images, videos, and audio
-
Runtime error99
Cantonese TTS Text To Speech
👁Generate Cantonese speech from text
-
Running33
Cantonese TTS Playground
🔥Generate speech from Cantonese text using selected or custom voice
-
Running on Zero1.68k1.68k
Dia 1.6B
👯Generate realistic dialogue from a script, using Dia!
-
Runtime error8282
Daily Paper Podcast
🎙Generates a podcast about today's top trending paper.
-
Running on Zero801801
Florence 2
📉Generate captions and analyze images with various tasks
-
Runtime error518518
Florence2 + SAM2
🔥Segment and caption objects in images and videos
-
Running on T4101101
SAM2 Video Predictor
🔥Segment and track objects in a video
-
Running2222
SAM2 Video Predictor
🔥Segment and track objects in videos
-
EvanZhouDev/open-genmoji
Text-to-Image • Updated • 251 • • 67 -
Running on Zero575575
ACE Step
😻A Step Towards Music Generation Foundation Model
-
Running on Zero596596
DreamO
🐨A Unified Framework for Image Customization
-
Running on Zero897897
Tile Upscaler
🚀Enhance and upscale images with advanced controls
-
Runtime error1.45k1.45k
EasyControl Ghibli
🦀New Ghibli EasyControl model is now released!!
-
akiyamasho/AnimeBackgroundGAN-Miyazaki
Image-to-Image • Updated • 25 -
Runtime error7272
Ghibli Multilingual Text-Rendering
🦀Elevating Ghibli-style AI art beyond ChatGPT's capabilities.
-
Runtime error66
AnimeGANv3
😁Convert images to anime style
-
Running on Zero432432
Parakeet-TDT-0.6b-V2
Transcribe audio to text with timestamps
-
Running on Zero4646
Fast Whisper Turbo
⚡Ultra-fast Whisper Turbo inference ⚡
-
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 3.91M • • 2.62k -
Running on Zero336336
Realtime Whisper Turbo
🤯Realtime implementation of Whisper large turbo
-
Running on Zero801801
Florence 2
📉Generate captions and analyze images with various tasks
-
Runtime error518518
Florence2 + SAM2
🔥Segment and caption objects in images and videos
-
Running on T4101101
SAM2 Video Predictor
🔥Segment and track objects in a video
-
Running2222
SAM2 Video Predictor
🔥Segment and track objects in videos
-
Running on T45454
RF-DETR
🔥SOTA real-time object detection model
-
Running on CPU Upgrade4949
YOLO ARENA
🏟compare performance of top object detectors
-
Running2222
SAM2 Video Predictor
🔥Segment and track objects in videos
-
Running on Zero9090
VLM Object Understanding
🦀Explore object detection, visual grounding, keypoint Detecti
-
Running on Zero107107
Qwen2 VL Localization
📉Detect objects in images and get bounding boxes
-
Paused158158
Seed1.5 VL
🚀Seed1.5-VL API Demo
-
Runtime error22
Vision Language SmolVLM2
🌍Video + text to text with SmolVLM2
-
Runtime error137137
Gemma 3n E4B It
⚡Generate text responses to images, videos, and audio
-
EvanZhouDev/open-genmoji
Text-to-Image • Updated • 251 • • 67 -
Running on Zero575575
ACE Step
😻A Step Towards Music Generation Foundation Model
-
Running on Zero596596
DreamO
🐨A Unified Framework for Image Customization
-
Running on Zero897897
Tile Upscaler
🚀Enhance and upscale images with advanced controls
-
Runtime error99
Cantonese TTS Text To Speech
👁Generate Cantonese speech from text
-
Running33
Cantonese TTS Playground
🔥Generate speech from Cantonese text using selected or custom voice
-
Running on Zero1.68k1.68k
Dia 1.6B
👯Generate realistic dialogue from a script, using Dia!
-
Runtime error8282
Daily Paper Podcast
🎙Generates a podcast about today's top trending paper.
-
Runtime error1.45k1.45k
EasyControl Ghibli
🦀New Ghibli EasyControl model is now released!!
-
akiyamasho/AnimeBackgroundGAN-Miyazaki
Image-to-Image • Updated • 25 -
Runtime error7272
Ghibli Multilingual Text-Rendering
🦀Elevating Ghibli-style AI art beyond ChatGPT's capabilities.
-
Runtime error66
AnimeGANv3
😁Convert images to anime style