Cuiunbo PRO
Cuiunbo
AI & ML interests
Anything
Recent Activity
upvoted
a
paper
about 1 month ago
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale
upvoted
a
paper
about 1 month ago
RLPR: Extrapolating RLVR to General Domains without Verifiers
liked
a model
3 months ago
maitrix-org/Voila-autonomous-preview
Organizations
VLM For OCR
audio
-
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Paper ⢠2311.07919 ⢠Published ⢠10 -
mozilla-foundation/common_voice_17_0
Viewer ⢠Updated ⢠13M ⢠34.2k ⢠320 -
Stable Audio Open
Paper ⢠2407.14358 ⢠Published ⢠27 -
fnlp/AnyGPT-chat
Text Generation ⢠Updated ⢠41 ⢠18
MiniCPM-V
-
openbmb/MiniCPM-Llama3-V-2_5
Image-Text-to-Text ⢠9B ⢠Updated ⢠35.2k ⢠1.4k -
openbmb/MiniCPM-Llama3-V-2_5-int4
Visual Question Answering ⢠5B ⢠Updated ⢠17.1k ⢠75 -
openbmb/MiniCPM-Llama3-V-2_5-gguf
Updated ⢠2.87k ⢠213 -
openbmb/MiniCPM-V-2
Visual Question Answering ⢠3B ⢠Updated ⢠4.23k ⢠474
Dataset For OCR
VLM dataset
MiniCPM-V
-
openbmb/MiniCPM-Llama3-V-2_5
Image-Text-to-Text ⢠9B ⢠Updated ⢠35.2k ⢠1.4k -
openbmb/MiniCPM-Llama3-V-2_5-int4
Visual Question Answering ⢠5B ⢠Updated ⢠17.1k ⢠75 -
openbmb/MiniCPM-Llama3-V-2_5-gguf
Updated ⢠2.87k ⢠213 -
openbmb/MiniCPM-V-2
Visual Question Answering ⢠3B ⢠Updated ⢠4.23k ⢠474
VLM For OCR
Dataset For OCR
audio
-
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Paper ⢠2311.07919 ⢠Published ⢠10 -
mozilla-foundation/common_voice_17_0
Viewer ⢠Updated ⢠13M ⢠34.2k ⢠320 -
Stable Audio Open
Paper ⢠2407.14358 ⢠Published ⢠27 -
fnlp/AnyGPT-chat
Text Generation ⢠Updated ⢠41 ⢠18