Yu li

Yukkkop

AI & ML interests

None yet

Recent Activity

upvoted a paper about 17 hours ago
Native-Resolution Image Synthesis
liked a model 3 days ago
KE-Team/Ke-Omni-R-3B
liked a model 3 days ago
SicariusSicariiStuff/Phi-lthy4
View all activity

Organizations

None yet

Yukkkop's activity

reacted to merve's post with πŸš€ 7 days ago
view post
Post
2541
emerging trend: models that can understand image + text and generate image + text

don't miss out ‡️
> MMaDA: single 8B diffusion model aligned with CoT (reasoning!) + UniGRPO Gen-Verse/MMaDA
> BAGEL: 7B MoT model based on Qwen2.5, SigLIP-so-400M, Flux VAE ByteDance-Seed/BAGEL
both by ByteDance! 😱

I keep track of all any input β†’ any output models here https://huggingface.co/collections/merve/any-to-any-models-6822042ee8eb7fb5e38f9b62
  • 1 reply
Β·
upvoted an article 7 days ago
view article
Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

By danaaubakirova and 8 others β€’
β€’ 115
reacted to Jaward's post with πŸ‘ 7 days ago
view post
Post
1146
bumped into one of the OG reads today!! handwriting generation & synthesis is still my favorite application of RNNs - supper amazed at how such a small model (3.6M params), trained overnight on cpu could reach such peak performance. Huge credit to the data (IAM-OnDBπŸ”₯) which was meticulously curated using an infra-red device to track pen position.
Try demo here: https://www.calligrapher.ai/
Code: https://github.com/sjvasquez/handwriting-synthesis