-
Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Paper • 2403.09622 • Published • 18 -
TableGPT2: A Large Multimodal Model with Tabular Data Integration
Paper • 2411.02059 • Published • 5 -
POINTS1.5: Building a Vision-Language Model towards Real World Applications
Paper • 2412.08443 • Published • 39
Ming
nodejs
AI & ML interests
None yet
Recent Activity
liked
a dataset
2 days ago
yandex/yambda
liked
a Space
about 2 months ago
hexgrad/Kokoro-TTS
liked
a model
2 months ago
nari-labs/Dia-1.6B
Organizations
None yet