4 9 14

Jaskirat Singh

jsingh

https://1jsingh.github.io/

AI & ML interests

Video Generation, Image Synthesis, Creative Content Generation

Recent Activity

upvoted a paper 9 days ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

upvoted an article about 1 month ago

Training Design for Text-to-Image Models: Lessons from Ablations

upvoted a paper about 2 months ago

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

View all activity

Organizations

upvoted a paper 9 days ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published 9 days ago • 88

upvoted an article about 1 month ago

Article

Training Design for Text-to-Image Models: Lessons from Ablations

Feb 3

•

upvoted a paper about 2 months ago

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

Paper • 2601.16208 • Published Jan 22 • 54

upvoted a paper 3 months ago

What matters for Representation Alignment: Global Information or Spatial Structure?

Paper • 2512.10794 • Published Dec 11, 2025 • 9

liked a model 3 months ago

REPA-E/iREPA-collections

Updated Dec 12, 2025 • 3

updated a model 3 months ago

REPA-E/iREPA-collections

Updated Dec 12, 2025 • 3

upvoted an article 4 months ago

Article

We’re open-sourcing our text-to-image model and the process behind it

Nov 12, 2025

•

updated a model 4 months ago

grspo/rae

Updated Nov 3, 2025

New activity in REPA-E/e2e-qwenimage-vae 5 months ago

do you need a special VAE loader to use in comfyui?

#2 opened 5 months ago by

ryg81

updated a Space 5 months ago

README

⚡

liked 3 models 5 months ago

upvoted a paper 5 months ago

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13, 2025 • 168

updated a dataset 8 months ago

multimodal-fusion/pemdas-cot-prompt-completion-500k

Viewer • Updated Jul 29, 2025 • 500k • 15

published a dataset 8 months ago

multimodal-fusion/pemdas-cot-prompt-completion-500k

Viewer • Updated Jul 29, 2025 • 500k • 15

updated a dataset 8 months ago

multimodal-fusion/pemdas-sft-prompt-completion-500k

Viewer • Updated Jul 29, 2025 • 500k • 16

published a dataset 8 months ago

multimodal-fusion/pemdas-sft-prompt-completion-500k

Viewer • Updated Jul 29, 2025 • 500k • 16

updated a dataset 8 months ago

multimodal-fusion/pemdas-cot-500k

Viewer • Updated Jul 28, 2025 • 500k • 25

published a dataset 8 months ago

multimodal-fusion/pemdas-cot-500k

Viewer • Updated Jul 28, 2025 • 500k • 25

Jaskirat Singh

AI & ML interests

Recent Activity

Organizations

jsingh's activity

Training Design for Text-to-Image Models: Lessons from Ablations

We’re open-sourcing our text-to-image model and the process behind it

do you need a special VAE loader to use in comfyui?

README