Post
1630
I think we just got the best Image to Markdown VLM out there and it's hosted here:
MohamedRashad/Nanonets-OCR
MohamedRashad/Nanonets-OCR
Speech data in audio and text format
Start with gathering high quality data first. This is by far the biggest hurdle against TTS systems out there.
I am considering canceling my Pro subscription because I just discovered that i am just limited to 10 zeroGPU spaces i can host on my account. This number should be way higher.